Is the mean difference in age of people who smoke and people who do not smoke statistically significant (in my sample)?
The sample size is 320 (39 smokers and 281 non-smokers). The age in this sample is not normally distributed (highly positively skewed). Mean age of smokers is 34.4 (SD-10.4) and in non smokers the mean age is 31.0 (SD- 10.5).
When I perform an independent t-test I get the p value of p=.065. So, I then transform my data to resemble normal distribution (which was done successfully by ln10 (age)) and perform the t test again and now the p value is .037.
In the end I perform Mann Whitney U test and the p value is .029.
So are the smokers significantly older in this sample than the non-smokers? I thought that when data were non-normal and I perform a parametric test I was increasing the chances of obtaining a significant result (where there was none) and not the other way around (not obtaining a stat. sig result where there was a difference). Any thoughts on this are very welcomed.