Hi friends,
I have 140 samples and want to see relationship between IV and DV as IV at low, medium, and high levels. However, sample size for three levels is different. Low is 75, medium is 29, and high is 35.
I have a rough idea is to randomly choose 30 from 75 (the low group), 29 for medium group, and 30 from 35 from the high group. After that, I run linear regression in SPSS for each group to identify R2 and P value to compare. My question is: Is the random sampling necessary? Can I compare R2 when the sample size for the groups is different?
If my idea is wrong, how should I treat the data? Many thanks!