Isn't a prokaryotic gene continuous?

More Ravi Kanth Reddy Sathi's questions See All

What is the formula to calculate the critical value of correlation?

I am calculating the correlation values between two data sets of size 257. I want to know what is the critical value of correlation for a sample size of 257. I tried searching on the web, but...

11 December 2013 9,253 18 View

What is the minimum coverage and how to identity percentage for protein domains?

I am trying to find out existence of protein domains in a set of sequences. I am using BLASTX for the task. I have made a BLASTX of my sequences with the ProDom sequences. I used an e-value cutoff...

10 November 2013 4,708 3 View

Ambiguity with bacterial ITS regions?

ITS regions are used for identifications of bacterial species. But while observing bacterial genomes it is seen that tRNA sequences are present in the ITS regions present between 16S rRNA and 23S...

09 October 2013 4,130 5 View

How to get consensus at ambiguous sites?

I have a set of aligned sequences in fasta format. I want to get consensus out of the alignment. In case of most of the sites one of the base is showing maximum occurrence. In case of sites where...

09 October 2013 3,887 2 View

How to screen genomes for compositional studies?

I am working with around 2600+ genomes and wish to study the genome, gene and intergenic features among various groups. In case of taxonomical groups which have very few representatives, there is...

08 September 2013 9,603 6 View

Is there any free online access to a good system for researchers?

I am working at genomes level and I have a shortage of computational resources to do the tasks I want to. So could you please suggest where I can get free online access to a good system for my...

07 August 2013 9,290 5 View

How do I Rectify NCBI C++ Exception? ncbi::CMemoryFileSegment::CMemoryFileSegment()

I have tried running a command line blast. The query file is a multi fasta file containing 2600 sequences. It was made a BLASTX against a proteins sequences (ProDom) of size 2 GB (prodom.phr :...

07 August 2013 444 0 View

Should regulatory sequences be included in intergenic sequences?

I am working on finding the functional signatures/remains of regulatory regions, RNAs and proteins in the intergenic sequences (IGs) of E. coli K12 MG1655. I have a few doubts regarding the...

07 August 2013 1,899 1 View

How efficient are PERL and PHP in designing bioinformatics tools? What are the pros and cons of each and which is most commonly used and why?

Out of my experience I think the basic difference among both of them lies in speed and usage. PHP can be used for creating online tools. PERL can also be used, but PHP is more easy to handle to...

06 July 2013 513 4 View

Are there any cloud computation facilities for Bioinformatics work?

Can some one please throw some light on cloud facilities available to carry on Bioinformatics work. Also is there a possibility of using such services for free or for a nominal fee, for academic...

06 July 2012 5,093 19 View

Are there instances where molecules with larger molecular weights exhibit greater mobility than those with smaller molecular weights?

Hi, I know that low molecular weight (MW) molecules generally tend to have higher mobility, while high molecular weight molecules tend to have lower mobility. However, in my experimental...

06 August 2024 1,495 2 View

Weak DAPI staining after immunohistochemistry - how to improve?

After immunohistochemistry of previously fixed in PFA and EtOH and then frozen 20 μm sections of zebrafish brain, DAPI staining is very weak (right) compared to the same sections stained without...

05 August 2024 9,637 2 View

Why did the authors extrapolate a phenotype that they experimentally proved in one bacterial strain across the whole genus of the organism?

I aim to be as skeptical as possible regarding whether a pair of orthologous genes results in the same phenotype in their different but related bacterial organisms under similar environmental...

05 August 2024 6,787 4 View

Why my colony PCR results of my recombinant bacterial not showing any results?

I am performing ligation of the plasmid and a target gene. The steps I have taken are: 1. Double digestion of the plasmid and target gene 2. Ligation of the plasmid with the target gene 3....

05 August 2024 2,570 3 View

The Curse of Evolution and Complexity?

Brain and body mass together are positively correlated with lifespan (Hofman 1993). The duration of neural development is one of the best predictors of brain size, and conception is the best...

05 August 2024 6,247 3 View

For an in-vitro drug release study, what molecular weight cut-off (MWCO) dialysis bag is required for a 117 kDa protein?

kindly reply me. Thanking you in advance.

05 August 2024 7,727 4 View

How to start a Molecular Dynamics Simulation?

Is it possible to conduct a molecular dynamics simulation to see the effects of a specific carbohydrate on the structure of lipids (e.g., micelle structure)? I am a beginner in this field and plan...

03 August 2024 3,371 3 View

Which will be the best software for the Hydration shell analysis with molecular dynamics?

I am using a windows system, what software I should use for hydration shell analysis with molecular dynamics?

02 August 2024 3,143 4 View

Can anyone provide me with molecular docking softwares/ websites?

Molecular docking software/ websites?

02 August 2024 8,704 7 View

What is the acceptable p-value cutoff for GO enrichment analysis ?

I have an RNA-seq data that I have analysed using Limma-voom and have extracted the gene IDs, log2FC and the p-values. At p value < 0.05, I have over 10,000 DEGs, however, when I run the GO...

31 July 2024 225 2 View

Satyabrata Nanda

As per my knowledge prokaryotic genes are usually continuous only. CDS (Coding DNA sequences) are just the part of genes that functionally codes for the resultant protein or transcription factor. And yes CDS may also overlap because of the fact that many genes in the prokaryotes also show overlapping existence and they share a common promoter sequence for the transcription. As there is no introns in prokaryotes, so both CDS and the corresponding gene can also be present on the same coordinates.

Ravi Kanth Reddy Sathi

@Satyabrata Yeah I agree totally with you.

But, how come a gene and CDS have same coordinates, that is what is making me sick. And it is not the case with one or two, there are many with such case. Is it a mistake with the NCBI Annotation System or is it that the definition of gene and CDS are more or less same.

Isn't it that a gene should have the promoter sequence and the terminator sequence along with the CDS. Then calling it a gene would be more apt. Then why is it that annotation systems annotate gene and CDS in the same way..? :(

@Manual

Why is it that annotation systems don't include promoters and terminators of a CDS in a gene? In such an annotation there is no differentiation between Gene and CDS

Marc Muller

CDS is the protein coding region of a gene, so it obviously has the same coordinates as the gene. Another important feature is the transcribed region, that is CDS plus additional 5'- and 3' - UTR (untranslated regions) on both ends. Often the precise ends of the transcript are not known (cannot be predicted, have to be determined experimentally), or even there might be several initiation and termination sites. As for introns (additional non-coding regions within the CDS), they are very rare in prokaryotes, but they do exist (I seem to remember that the nitrogen fixation genes in A. tumefaciens have introns). Now the fact that this is annotated as a pseudogene is also important. I would suggest to search for homologues in the same or other species to see whether these introns are a general feature of this gene.

Jonathan Perreault

Concerning continuous genes. Even if the vast majority of genes in prokaryotes are continuous, there are a few introns. These introns are self-splicing and are akin to mobile elements (read on group I and group II introns if you want to find out more).

Joao M.P. Alves

Ravi,

As said above, CDS is the coding sequence, and gene is the whole gene. You will notice the difference more easily (or at all) when you look at eukaryotic genes.

The CDS is therefore made of the parts of the sequence that will be in the protein, while gene contains introns and UTRs too, if defined. UTRs being relatively rarely defined in genomic sequences, it happens that start and end coordinates of CDS and gene features coincide, but it needs not to be the case.

As an example, look at this gene:

http://www.ncbi.nlm.nih.gov/nuccore/KF112870.1

Here, the coordinates of gene and CDS are not the same:

gene 1285..45402

CDS join(2073..2550,10331..10378,35742..35807,39696..39900,

40291..40354,44111..44227)

Also, refer to the NCBI definition (from http://www.ncbi.nlm.nih.gov/books/NBK21106/):

CDS

coding region, coding sequence. CDS refers to the portion of a genomic DNA sequence that is translated, from the start codon to the stop codon, inclusively, if complete. A partial CDS lacks part of the complete CDS (it may lack either or both the start and stop codons). Successful translation of a CDS results in the synthesis of a protein.

Notice they do not get into the hairy business of trying to define gene. ;-)

Cheers

Ole Skovgaard

Agree with Joao, the reason that UTR's and promoter regions are not included in most prokaryotic annotations is that the precise mapping is (still) largely missing; main technical problem is the unstable mRNA in bacteria.

My first advise is always: look at what you have in your hands!

Here it translates into: go and blast the intervening sequence (use a BLASTX), learn that it is either phage related or IS-related; then blast (use TBLASTN) your translation of the CDS and look at the results; they will tell you.

go ahead!