Good day!

I have some predicted sequences that was not previously annotated in plant genome.

Than It was found that it's possible to BLAST these sequences using RNAseq data found in ncbi SRA archieve and to get the .sam file containing aligned results.

The question is: how to understand if a sequence is transcribed?

In the first case all the query sequence is covered by RNAseq reads, so I think it's definitelly YES.

But what about other cases?

What percentage of sequence must be covered to understand it's definitelly transcribed? Could there be gaps?

Similar questions and discussions