Hi all,

I have a fasta file containing thousands of gene sequences. I want to create a GTF annotation file that describes the locations of these genes, their exons etc.

I tried to turn the fasta into a fake fastq file with BBMap, then upload to galaxy, use BWA aligner (all of which worked) and then use Cufflinks to assemble a GTF, but it didn't work. I think the problem might be that the sizes of the fake reads varies greatly; When I use the default parameters I get a gtf file with only one gene in it, and when I try to take the parameters to the extreme so that all lengths of reads will be taken into account, it gives me an error (return code -11, if that helps).

Does anyone have an alternative strategy, or a way to make this one work?

Thanks!

Similar questions and discussions