I have whole genome data sets of SARS-Cov2 of different countries. Now I want to exclude genome sequences which represents 100% sequence similarity. I want to do it by clustering analysis..

1) Can anyone suggest me any tool that is good for Viral whole genome clustering analysis??

2) any another method, that i should know??

A big thank you in advance.

Similar questions and discussions