I have recently identified several snp's from one gene in one population. I want to correlate the genotype with a reference like HapMap. However, the distribution of all snps is heterogenous. Moreover, some snps have a very low prevalence (> 0.01). Such a frequency could potentially bias the HWE as it only shows a difference of 1 compared to zero reference. Therefore, I was wondering what would be the minimal amount/percentage to compare and use for the HWE?