Hi there,

Let me start by describing the problem at hand.

We have an 19-loci autosomal STR dataset that we would like to 'compare' with a number of other population datasets, each with a different autosomal STR loci coverage that are not in common with the 19-loci dataset at all loci. Among others, one way to do this kind of analysis is of course to carry out 'population differentiation tests', such as through the Arlequin software. In other words, using this method, one can do locus-by-locus (pairwise) comparisons of allele frequencies among a collection of x number of population datasets, and since each dataset may have a different loci coverage, one can end up with a total of y number of comparisons among these x number of datasets at one loci, and z number of comparison at a different loci.

Now, appreciating the fact that that there has already been some prior discussion on related subject matter at Research Gate, and that 'population differentiation tests' may neither be the sole option nor the best option for that matter, we would like to hear other colleagues' opinion on:

If a Bonferroni correction, such as for Type I errors and/or multiple testing,  would be calculated in this case, whether this should take into consideration the total number of loci analyzed (which varies from one population dataset to another) or whether the number of populations used for each set of analysis (e.g. the total number of population datasets with data at a given locus that are used for comparisons at that locus) should be used.

Once again,  'population differentiation tests' will not be the only method used to compare our dataset with those from other populations. In any case, prior to any corrections,  such 'population differentiation tests' suggest a number of statisticall significant p-values (i.e. below the threshold of 0.05), a good proportion of which are also below 0.0005

Many thanks in advance for your answers.

More Cemal Gurkan's questions See All
Similar questions and discussions