Someone suggested that the ratio between mean non-synonymous substitutions per non-synonymous site (dN) and mean synonymous substitutions per synonymous site (dS) should be calculated considering at least three sequences from each of the species. However, the explanation given was not understood by me. Can anyone please elucidate why should we consider multiple sequences from each species while calculating dN/dS?

Another query is, what if only one sequence is available for some of the species for that particular coding sequence under study?

More C. S. Mukhopadhyay's questions See All
Similar questions and discussions