Hi everyone,
I'm currently working on the splicing variants of a protein. For the analysis, I extracted variants from two databases in Fasta format. Before downstream processing, I want to make sure that no variant is repeated. So, how can I check this? I m using standalone BLAST on Linux. I tried using MegaBlast, but I'm not very confident about the results... in results e value is mostly zero and show 100 percent identity among sequences, bit score goes very high sometimes... the only thing which differ is query coverage... the start as well as end sites position vary in query and database sequence... I had to ask u... in the splice variant case can I make the query start and end difference as the basis of the difference of splice variants. I shall be waiting for ur response.
Mazhar Hussain