I have whole genome data sets of SARS-Cov2 of different countries. Now I want to exclude genome sequences which represents 100% sequence similarity. I want to do it by clustering analysis..
1) Can anyone suggest me any tool that is good for Viral whole genome clustering analysis??
2) any another method, that i should know??
A big thank you in advance.