I have hundreds of samples with different sample sizes/counts (3,000 to 150,000). Due to the uneven sizes, comparing the richness between samples can be tricky without rarefying. However, if I rarefy the data to 3,000 counts per sample, many samples' richness would drop significantly, which is also tricky. I wonder if I should use the rarefied richness or the normal richness for data interpretation.

I personally don't like rarefying, which might throw away many rare taxa. I know Shannon which takes into account the relative abundance of each species can to some extent solve this problem. But what if I'm specifically interested in the richness?

I hope someone could provide some enlightening comments on this.

More Haitao Wang's questions See All
Similar questions and discussions