How can one convert genome sequence files of microsoft office into genbank format?

More Muhammad Ali's questions See All

Why crystal like structures are formed on NGM plates cultured with E. coli OP50?

When I prepare NGM plate for the growth of C. elegans, the plates are clear. However, when I spread culture of E. coli OP50 on NGM plates and incubate at 37 C overnight, I observed crystal like...

10 November 2017 853 4 View

Any advice on the difference in the Ct value of reference gene in treated and control samples?

I am using 16S primers for qPCR. However I got different Ct values in may treated and control sample (30.08 and 28.85). Would it lead to considerable error in final results? In my opinion,...

06 July 2015 3,446 10 View

Is it necessary to dilute cDNA before qPCR? How much dilution should be made? How to determine best dilution factor?

Kindly explane importance of cDNA dilution before qPCR and how one can determine best dilution factor for cDNA samples? How it will effect results of qPCR? In other words, how much quantity of...

05 June 2015 1,608 22 View

Is it possibile to determine genomic DNA contamination in cDNA sample on the basis of qPCR results?

Is there any possible way to find out genomic DNA contamination/residues in cDNA by analyzing results of qPCR? Can we get any information related to genomic DNA by Ct value?

05 June 2015 880 4 View

Which are experimentally proved Best house keeping genes to study gene expression in Pseudomonas syringae?

I need some information about house keeping genes to be used as internal control particularly for Pseudomonas syringae. Kindly let me know which house keeping genes are good to study gene...

31 December 2014 5,713 4 View

How can I avoid contamination of Genomic DNA during RNA extraction?

It is observed that RNA samples are often contaminated with genomic DNA. How can this contamination be avoided? What are the possible causes for genomic DNA contamination during mRNA extraction.

31 December 2014 5,145 3 View

We want to remove the gaps/errors of our laboratory genomes with some reference genomes. Which software would be the best option?

09 October 2014 9,575 15 View

Which are the best expression vectors for Pseudomonas syringae?

I need some good expression vectors for Pseudomonas syringae for over expression of proteins and for the construction of complementary strain. along with that, I also need the source/link from...

08 September 2014 5,469 4 View

What software/database could be used for identification of pan genome/core genome of user's own sequence data?

1) The sequence data is in contigs form. 2) Please suggest windows based tools. 3) Important literature/source would also be appreciated.

08 September 2014 8,449 18 View

What techniques are best to confirm construction of complementary strain?

After constructing a mutant bacterial strain, I have to construct complementary strain. How can I confirm expression of gene of interest in complementary strain? Which techniques can be used for...

05 June 2014 10,098 1 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

How to confirm the site-directed mutagenesis result without performing NGS?

I'm cloning a fragment of 3200 nts into plasmid. The cloning was successful, however, 02 amino acids were mutated. Now I want to fix these 02 aa by site-directed mutagenesis technique using...

08 August 2024 4,645 2 View

How to convert a privately loaded document into a public document?

I attempted to make a privately uploaded text public but a window appeared that said an error occurred. There was no explanation provided as to why there was an error or what might be done to...

05 August 2024 8,025 7 View

Why did the authors extrapolate a phenotype that they experimentally proved in one bacterial strain across the whole genus of the organism?

I aim to be as skeptical as possible regarding whether a pair of orthologous genes results in the same phenotype in their different but related bacterial organisms under similar environmental...

05 August 2024 6,787 4 View

Who of all the Global Scientific community will help me Prof. Dr. Yoshida make way for TPEOM, MEC ~EMC to return the atmospheric gases to the norma ?

TEP presentation caption (The Environmental Project) Re: Why should Washington’s DC, or any country government point of location think of as nowadays of as to being 'tomorrow as to come! if it...

03 August 2024 2,484 1 View

Does anyone have issues using Prepman Ultra reagent for MicroSeq ID bacterial, fungal and yeast sample preparation?

I have been attempting to extract DNA from Bacterial, Fungal and Yeast banked samples (>1e7 cells) using Prepman Ultra reagent and I seem to be struggling to obtain a sequence. Although the...

01 August 2024 2,079 0 View

How is the bacterial genome's high protein count verified as genuine despite 800+ contigs and good metrics (98.55%completeness, 0.68% contamination)?

Given that the bacterial genome has over 800 contigs, but its quality metrics are good, with a completeness of 98.55% and a contamination of 0.68% as assessed by CheckM, what specific validation...

01 August 2024 1,514 1 View

Seeking Software Recommendations for SELEX NGS Data Analysis?

I am looking for software to help analyze SELEX NGS data, including alignment, sequence enrichment, and other related tasks. Can anyone recommend suitable tools or software? Best wishes, Waleed

30 July 2024 1,061 5 View

Request for Advice: Starch Metabolism Research Project?

I am currently considering a research project focusing on a comparative analysis of starch metabolism in orchids and roses. I am particularly interested in identifying the types and quantities of...

30 July 2024 4,267 2 View

Hi there, someone has the SeinFit software for windows because I cannot download it?

DOS version.

29 July 2024 6,064 1 View

Theresa Wohlever Popular answer

The simplest option may be a bit of a combination of the two answers above, that is something along the lines of the following:

Open your sequence data in Word

Choose to Save As, and select the drop down format option Plain text from the drop down menu. Now, your saved sequence will be in plain text format. We don't know what sequence format it is in, but that might not be relevant.

Navigate to http://www.ebi.ac.uk/Tools/sfc/readseq/ and upload your plain text sequence file with the Choose file button. Make sure that Input Format selection is set to Auto-detected.

Convert this to GenBank, or any format that can be imported into the sequence editing software of your choice, that can also export to GenBank

Depending on the sequence editing software you use, you may then be able to directly import the annotation data such that it is applied directly to your sequence

Finally, you could export the annotated sequence from the sequence editing software in GenBank format.

http://www.ebi.ac.uk/Tools/sfc/readseq/

https://www.google.com/webhp?ion=1&espv=2&es_th=1&ie=UTF-8#q=%22sequence+editing%22+annotation+GenBank

Alfonso Benitez-Paez

Hi Muhammad,

I do not understand how you got MS word files from genome sequence. I recommend you to work on Lunix/Unix environment using plain text files. Anyway, you have READSEQ tool at EBI that can convert almost all type of files.

Good luck!

David W Waite

It depends a bit of exactly what layout your spreadsheet is in, but there are tools like tbl2asn2 that can stitch your sequence/annotation data together for a GenBank submission.

Like Alfonso, I'm surprised that they chose a Word document for your sequence data. If you want to extract the data from this, doc and docx files can be opened with an archive manager (7zip or similar) and you can usually find a plain text file inside that has the content. Alternatively, most scripting languages have libraries to extract the text from Office files (including Excel spreadsheets) so if you have some scripting knowledge, or know a bioinformatician who can do this, that's an option too.

Theresa Wohlever

Saurabh Gayali

i believe excel files annotation follow gff3 guidelines.

please read about gff3 file format and columns (which column holds what data)

if they match with your data then you have to add 3 lines at the start of notepad file (get any gff3 file from net and you will notice 2-3 lines at start starting with '#'.

Now paste your data on this notepad file and save as gff3 extension.

You will need your master chromosome sequence fast file to convert gff3 to genbank as genbank file contains sequences too. gff3 might contain sequences too but the tool we will use uses external fasta

the tool to convert gff3 to genbank is seqret

http://emboss.open-bio.org/rel/rel6/apps/seqret.html

please read the following discussion too

https://www.biostars.org/p/72220/

Muhammad Ali

Thank you Saurabh Gayali