I am trying to find the best metric to determine the diversity of protein-dataset, the dataset has 5000 proteins. Can someone please help me in identifying the best.
I think.. your's dataset is too high matric to detect determine the diversity of protein, maybe you can set it 1500 - 3500, sorry if it does not match with your result, I just think the diversity of protein is not too mattered to our health, thanks.