I am building a model with:
- Non-normal continuous dependent variable (a score on a questionniare which does not have literal clinical meaning) - it appears fairly normal on histogram but there are outliers that make it non-normal on swilk test
- All categorical independent variables (more than 3)
The sample size is 257
No transformation makes the DV normal but if I square it then for the multiple linear regression this produces normal residuals but there is slight violation of homodescacity
I am assessing these options:
Use the multiple linear regression model with robust standard errors
Use a factorial ANOVA - is there robust correction in stata for this? can I get away with non-normal DV?
Should I use the square transformation - may be difficult to interpret coefficient?
A few more questions on assumptions:
How do I test for independence of the residuals in a simple way?
How do I test for a linear relationship between (1) IV and residuals and (2) IV and DV - when the IV are all categorical?