Hi! I am trying to build a phylogeny for 100+ Hemipteran taxa. I was able to make a super-matrix tree using 13 genes, but there was a problem with it. One of the subgroups should be monophyletic but it was coming out as paraphyletic. Eventually I was able to solve that using two different approaches.

The first was that checked the trees produced using each gene on its own and I examined which genes were giving the group as paraphyletic. After excluding these genes, I got the expected topology, but I lost 20+ taxa. Among the genes I had to remove, was CO1 and I am not sure if that's ok or not.

The second approach was that I created a constrain tree by hand which was separating my taxa in 3 groups; the outgroup, and the two main groups which were making up my sample of Hemipteran taxa. I left every other relationship on this tree as a polytomy, to be resolved by RAxML. These worked and gave a tree similar to the one I got using the previous approach, allowing me to keep all my taxa and the CO1 sequences.

Are these two approaches acceptable? Is one of them better than the other?

Thank you all for taking the time to read my question. Any feedback will be greatly appreciated!

More Ioannis Dimopoulos's questions See All
Similar questions and discussions