09 September 2016 9 10K Report

Hi everyone! I need to run a series of simple mean comparisons between two groups from a very large sample (n = 986). One of the variables of interest is gender, and sample sizes are extremely unbalanced between men and women (men: 156, women: 830). A typical Students T test for independent samples turns out to be significant, but the effect is clearly spurious and relies solely on sample size. Therefore, I wanted to ask what would be the correct approach to compare means between two groups with this kind of N. Should I choose a random subset from the women group and run the comparison using this data as representative sample? I've seen this kind of strategy on other works, but I'm not sure it is the most appropiate.

Thank you so much!

More Angel Tabullo's questions See All
Similar questions and discussions