Generally the transcripts built from RNA-Seq data is not completed and lacks beginning or ends sometimes. It might not have the regular start codon in the transcripts. What is the best or most convenient way to identify a premature stop codon in the transcripts? Translate into protein first, pick up the longest protein sequence and identify the stop codon?

Similar questions and discussions