I am working with an in-silico reference made simply by concatenating a couple of thousand genes that I know belong to the same organism. Aligning my seq data against this reference, I get very good mapping across most genes. But for some, only a small portion of the gene is seen mapped.
Assuming that the genes I use for the reference are not completely annotated, if, say, only 25% of a particular gene is mapped, it could either mean it is a splice variant, or simply that the reads that do show mapped there, actually belong somewhere else along the genome. Is there a way to identify what the true case might be?