I have 83 samples of wild boar (i.e. large mammal, moderately motile) whose sampling locations were obtained by randomly setting the coordinates in green areas (according to Google Earth imagery) within the municipality of collection. In this sense, when more than one animal was sampled at the same municipality, I have the same coordinates.

Since I want to perform landscape genomics analysis with a population-based approach (because of the uncertainty associated to the individual coordinates) I identified 9 “clusters” by drawing a 10km radius buffer around each unique location (i.e. one record per municipality) and clustering together all the individuals within the areas that would overlap, while discarding the stand-alone samples. In doing so, I verified that the clusters would include only individuals with at least 50% assignment to the same genetic cluster according to ADMIXTURE analysis.

Now, how can I report for each cluster the environmental variables from the Worldclim database? Should I first find the centroid of each cluster and then retrieve the value for that location only, should I retrieve the value for each sample included in a cluster and compute the mean? Any suggestion would be much appreciated!

More Giulia Fabbri's questions See All
Similar questions and discussions