I have a question about the best way to test for violations of Hardy-Weinberg Equilibrium (HWE) among microsatellite loci for a species that is continuously distributed across a study area and showing IBD. We are looking at how landscape features affect gene flow among Eastern Indigo Snakes across a 25 x 50 km study area. We have 110 samples and about half are clustered in the southern half of the study area. A spatial correlogram of individual genetic distance shows that spatial autocorrelation among samples becomes non-significant at 5-10 km. We have used COLONY to identify full-sibs and found about 15 full-sib families although family size was usually two (max. four). There is significant IBD within our study area. STRUCTURE identifies K=4 with all 110 samples but when we randomly exclude all but one full-sib from each full-sib family STRUCTURE identifies K=1-2. We suspect that these STRUCTURE results are the result of neighborhood effects and IBD, respectively.
When we test for violation of HWE at our 15 loci, four have significant violations of HWE. Estimated null allele frequencies at these four loci are 6-15%. When we randomly excluded all but one member from each full-sib family, these four loci were still significantly out of HWE.
In a situation such as this, is it appropriate to test for HWE using all samples? I know that in systems with discrete populations researchers often test for HWE within each population, since violations may represent a mixture of multiple populations. But any designations of “populations” in our study area seem very arbitrary (e.g., driven by sampling intensity rather than the distribution of individuals).
Does anyone have any suggestions about the appropriate way(s) to test for HWE in a system such as ours?
Thanks,
Javan Bauder