Dear colleagues,
we now have the information of genomic features for many organisms, including the protein-coding genes, miRNAs, lncRNAs, et al, but new genes, especially the newly defined classes of genes, are identified continuously, for example, ascribed to the high tissue- and stage- specificity of lncRNAs, independent work would always identify hundreds or even thousands of new transcripts. Thus, we should resolve the problem, that when we annotated a large number of genes/transcripts ( include the annotated and unannotated genes/transcripts), how to merge these information with the exist annotation?
I have noticed the cuffmerge could do it in a way, but it would generate new transcripts when merging the GTFs, which is not expected sometimes.
Are there some other tools with a better performance? or any suggestion on parameter setting of cuffmerge?