10 December 2018 4 730 Report

I am making a tree with the whole genome of over 100 samples. They are very similar, with ~95% of sites identical. If I want to make a tree with the whole genome to compare their overall similarity, can I only use sites that are not identical to all? I mean, 95% of sites in the sequences alignment contains only identical bases, which cannot tell the differences among them.

More Xiaolong Cao's questions See All
Similar questions and discussions