Can anyone clarify a calculation question with regards to dN/dS?

More C. S. Mukhopadhyay's questions See All

How might I enter new miRNA sequences to miRBase?

We have obtained some novel miRNA sequences of buffalo, following NGS of sRNA sequences (Ion Torrent platform). I have contacted the miRBase however, no communications received. Please let me know...

05 June 2015 9,053 9 View

How can I proceed with isoMir Data?

We have got an extensive list of IsoMir (cow genome used as map-file) after analyzing miRNA-seq data by miRAnalyzer (data were custom analyzed by s sequencing agency). Now we have the list in the...

05 June 2015 4,728 2 View

How can I proceed for Annotation and Characterization of novel miRNA?

We have sequenced miRNA of bubaline/buffalo leukocytes. Some novel miRNAs have been obtained. One example file has been attached. We are now blasting each of the novel miRNA sequence against the...

02 March 2015 5,983 5 View

How can I proceed for Annotation and Characterization of novel miRNA?

We have sequenced miRNA of bubaline/buffalo leukocytes. Some novel miRNAs have been obtained. One example file has been attached. We are now blasting each of the novel miRNA sequence against the...

02 March 2015 459 2 View

What is the mistake in this PERL code?

I am a novice in this field. Please help me identify the mistake in this code to use GetOptions. I am using Windows OS (Win8.1) #!/C:/Perl64/bin/perl require 5.16.3; # use strict; use...

05 June 2014 302 6 View

Which type of sequence should be used for phylogenetic tree construction, conserved or non conserved?

Suppose I have got a pair of sequences (100 amino acid each), one is conserved cds and another one is a non-conserved coding sequence. 1. Which one should be used for phylogeny? 2. Do these two...

05 June 2014 4,340 12 View

Why are common ancestors extinct?

Look at any phylogenetic tree we will find the internal nodes. However, do we have the common ancestors which are occupying those internal nodes in the real world? Sometimes we come up with...

03 April 2014 8,534 6 View

How is bioinformatics different from biostatistics?

It is well known that bioinformatics uses the knowledge of molecular biology, statistics, computer science and information technology (a part of computer science only). Can we say bioinformatics...

03 April 2014 7,059 7 View

Why is culturing Lymphocytes only possible in vitro?

Why can lymphocytes be cultures in vitro, but not the other type of blood cells, like leukocytes (viz. granulocytes and monocytes) and erythrocytes? Although certain reports are there regarding...

07 August 2013 2,679 0 View

Can someone recommend links to bioinformatics sites?

How useful is this site? http://bioinformaticssoftwareandtools.co.in/

07 August 2013 7,489 33 View

Which Scopus Journal provides the most affordable fees?

"PUBLISHING IN A SCOPUS JOURNAL" Researchers are now at a cross road. The critical need to publish in a Scopus or ISI, etc journal is ever vital. Journal Publication fees must be submitted....

10 August 2024 8,621 1 View

Seeking Advice on Viability and Execution of Undergraduate Thesis Topic?

Hello everyone, I am currently developing a thesis proposal and would appreciate your input on its viability and how to effectively carry it out. My proposed topic is: "Does the perceived threat...

10 August 2024 8,992 0 View

Who will be moral responsible for the death of thousands of people in the event of an earthquake?

Who will bear moral responsibility for the deaths of thousands of people in the event of an earthquake? Weeks and months remain before the onset of strong earthquakes that bring death to...

08 August 2024 6,134 12 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

The Bigger You Are, the Harder You Fall (some lessons from Dinosaurs)?

Evolutionary fitness is based on an organism’s ability to adapt rapidly to changing environmental circumstances. Large-bodied mammals have been equipped with large brains (and hence a high...

06 August 2024 4,849 2 View

Are there any instruments for studying time similar to the way it is in space?

There are a huge number of methods for studying objects in space, according to the senses (and not only). Mechanical, thermal, optical, acoustic, electrical, magnetic, based on particle beams,...

06 August 2024 7,102 0 View

Measuring the Intelligence of a Species?

Larger brains, which typically contain more neurons, store and transfer more information (Tehovnik and Chen 2015), but the precise relationship between number of neurons and information has yet to...

05 August 2024 1,238 2 View

The Curse of Evolution and Complexity?

Brain and body mass together are positively correlated with lifespan (Hofman 1993). The duration of neural development is one of the best predictors of brain size, and conception is the best...

05 August 2024 6,247 3 View

In the case of a wound l recurrence after radical breast cancer and sentinel lymph node biopsy. Are the sentinel lymph node procedure recommended?

In the case of a wound l recurrence after radical breast cancer and sentinel lymph node biopsy. Are the sentinel lymph node procedure recommended? If no axillary lymph node dissection was not...

05 August 2024 8,056 1 View

Regarding a model for simulating battery charge and discharge, what do you consider to be high fidelity?

Regarding a model for simulating battery charge and discharge, what do you consider to be high fidelity? What is the acceptable percentage of error (regardless of the metric)? Could you suggest...

03 August 2024 5,358 0 View

Dziedzom Komi de Souza

I guess the more sequences you have, the more confidence you have in your results, especially if you are dealing with means. With one sequence you cannot tell if there are differences between dN or dS in different sequences for the same species. But if you have only one sequence for a species, you just have to go with the assumption that that sequence is uniform irrespective of the number of sequences, and that there are no mutations that may change your dN/dS ratio.

C. S. Mukhopadhyay

Thanks.

Brian Thomas Foley

A single sequence from one species has to be compared to sequences from other species. You cannot calculate changes from one sequence. It has often been claimed that a dN:dS ratio of 1.0 indicates neutral drift or no selection pressure on a protein, but nearly all proteins are under negative and/or positive selection pressures with some codons being selected for changes while other codons are selected to remain constant. The HIV-1 envelope protein, for example, has quite a few absolutely invariant sites and some sites that evolve very rapidly due to host immune selection pressure.

If there is a 3D structure known for your protein, or a very similar protein (similar enough to do a sequence alignment of your sequence to the 3D structure sequence) you can use the ConSurf tool (http://consurf.tau.ac.il/overview.html ) to color the 3D structure by rate of evolution. For most proteins, the key catalytic sites are invariant while the rest of the molecule is free to evolve.

Even within a species, you may need to carefully consider how you want to calculate the scores for the data. If you have ten sequences from each of two isolated sub-populations, for a hypothetical example, and one population has GAA = Glu for one codon and the other has GAG = Glu for that same codon, it is more likely that this was a single G to A or A to G mutation in one of the founders of one of the populations, than many independent events. The phylogenetic methods like PAML as discussed by Adam Retchless, aim to help with this. But the calculation may assume that the data came from random sampling of a single well-mixed population. Very often, the data we have to work with violates the simple assumptions of the models used to analyze the data. This does not male the analyses worthless, but does mean that we should carefully consider how the results may be biased by such things as non-random sampling, lack of mixing in the population, etc...

Frantz Depaulis

I agree with Adam that intraspecific variation/polymorphism may behave differently from interspecific /divergence one. So that having several sequence from a species is a way to set appart fixed differences from polymorphisms. However it would be relevant only if you do a branch specific dN:dS analyses. Moreover, intraspecific dN:dS component may be largely affected by recombination, since methods assume a single tree. As an alternative to PAML, I recommend datamonkey, the user friendly servor HYPHY that is also more flexible.

Graham P Wallis

Replicates within species give a timescale for the type of selection that you are observing. If your gene is like MHC, with balancing selection maintaining lots of variation within species, as well as adaptive evolution among taxa, variants within species should also show the effect, provided sample size is large enough. In contrast, deep comparisons can start to show loss of significance, as synonymous substitutions start to accumulate while AA substitutions start to saturate. I think the best solution is to analyse character state change in a phylogenetic context, breaking up any long branches as far as is possible.

Michael Philip Schwarz

We have been examining very large numbers of haplotypes for mt sequences in some very recently diverged species. We find that synonymous changes are very frequent within species and nonsynonymous changes are mostly restricted to species level divergences. This suggests purifying selection within species and perhaps some adaptive changes surrounding speciation. So, in terms of your original question, you might get some interesting data if you sequence multiple specimens. But as other commentators have noted, if you have only limited sequence data you might be unlucky to encounter bp changes that have not yet been subjected to selection. Also, be aware that inferences of dN/dS depend strongly on your ability to infer ancestral states. A small number of outgroup sequences will compromise this ability. You need to step carefully here and be aware of the limitations that small data sets impose. More is better, so get as many sequences as you can.

Thanks for your answers