I got a 370mers sequence found in humanReferenceGenome and a RetrovirusGenome. Could any one tellme steps to evaluate possible role in the given genome?
You can do 6-frame translation and see if any proteins can be coded by the sequence. See the attached screenshot for the 6-translated frames. Then you can search the obtained amino-acid sequences for possible protein matches (I used CS-BLAST) . You get a perfect match in your 5'3' Frame 3 translated sequence, with a human and retrovirus protein. I think these are the sequences you are looking for. Just read about the protein in the two organisms to find out its roles, it looks like it's the reverse transcriptase, so it is responsible for translating viral RNA into DNA.
No 1 (human)
>gb|AAD51793.1| Gag-Pro-Pol-Env protein [Homo sapiens]
I made an algorithm that runs in a intel corei7 searching for virus subsequences along human or animal genomes. Whenever it finds something i try to understand what role could be playing the just-found sequence