I want to collect sequences from different regions to do phylogenetic analysis. Do I need to perform a statistical analysis before or after collecting sequences to avoid biased data?
Different regions mean different geographical locations. Sorry for the ambiguity.
I want to investigate the epidemic and evolution of Human Enterovirus 71 based on data collected from different locations. In location A , there is few sequences available , but in location B , sequences are abundant. To draw rigorous conclusion , I wander the necessary to peform statistical analysis , to support that the collected sequences can perform consequential phylogenetic analysis in perspective.