So i am performing correlation test (pearson) on 500 genes. and getting the r value and p value. In total we get 500*499/2=124750 tests.

I know next step is to perform multiple comparaison check using FDR or Bonferroni procedures. lets say we have chosen FDR for getting adjusted p value.

My question is if we first filter the comparisons based on a specific r value like 0.4. say we have now filtered 1000 comparisons out because the absolute r value was greater than 0.4. Now we run fdr for multiple comparisons. then it is going to use 1000 p values only of course. Are we being biased here? can we do this actually? because actually we performed 124750 tests and I am not sure if i am going the right way?

More Amnah Siddiqa's questions See All
Similar questions and discussions