I recently wrote an algorithm that compares multiple protein sequences in a multialignment and assigns a value for each amino acid position depending on the variation in that position. If only one single amino acid is found in a certain position, "100%" is assigned (= 100% conservation). If more than one amino acid is found in a certain position, this number drops. It drops stronger if amino acids of very different character occupy the position (e.g. arginine and tryptophan). It drops lesser if relatively similar amino acids occupy the position (e.g. tyrosine and phenylalanine).

My question is: Can I claim that my results depict the "conservation" of the protein sequence? With other words, is the term "conservation" strictly defined? Or is it a vague term carrying a lot of flexibility and requiring definition in a certain context?

I am asking because people may argue, for instance, over questions like "what is a similar amino acid"? Probably everybody would agree that valine is similar to leucine (both hydrophobic and aliphatic). But is valine also similar to tryptophan? In an way yes, because both are hydrophobic. In a way no, because one is aliphatic and the other is aromatic. For my algorithm I had to make some arbitrary decisions about similarity. Not everybody might agree with my decisions. Therefore, I am wondering, is the term "similar" clearly defined in the field? Or do evolutionary biologists define the term "similar" in a flexible way, according to their needs?

Wouldn't, for instance, someone interested in tyrosine phosphorylation claim that tyrosine is vastly different from phenylalanine, whereas someone interested in protein folding might consider those amino acids very similar?

Similar questions and discussions