Hello, I am working with closely related endogenous retrovirus (ERV) sequences. I suspect these might have been the result of several integration events from different but related exogenous retroviruses according to host phylogeny and geography. Basically, distant hosts that share geographical distribution have higher ERV identity between theirs than with closer species. ERVs from a single species that is closely related to a few others and has the same habitat, is however very different from its neighbors.
So two possibilities: each ERV clade comes from a different virus or all of them descend from the same insertion event.
But it remains as a hunch. Is there some sort of statistical test or other kind of test that might at least support or oppose this claim?
Thank you in advance.