I have noticed on multiple occasions some genes do not have an ATG start codon, and instead have TTG. Thus the protein should start with L and not M.

However, even when the sequence is starting TTG, NCBI never has L as the first amino acid , instead reverting to M. I have had to change my submission on occasion because their submission did not like an L at the start.

Are alternative start codons underrepresented in databases? A case in point is OXA-20. See https://www.ncbi.nlm.nih.gov/nuccore/AY307114?from=1243&to=2043 where the start is TTG which is translated as M in their submission (I have sequenced this gene too and it starts TTG). Or do bacteria see TTG as M when at the start of translation and L thereafter?

More Paul G Higgins's questions See All
Similar questions and discussions