I am building a model with:

- Non-normal continuous dependent variable (a score on a questionniare which does not have literal clinical meaning) - it appears fairly normal on histogram but there are outliers that make it non-normal on swilk test

- All categorical independent variables (more than 3)

The sample size is 257

No transformation makes the DV normal but if I square it then for the multiple linear regression this produces normal residuals but there is slight violation of homodescacity

I am assessing these options:

Use the multiple linear regression model with robust standard errors

Use a factorial ANOVA - is there robust correction in stata for this? can I get away with non-normal DV?

Should I use the square transformation - may be difficult to interpret coefficient?

A few more questions on assumptions:

How do I test for independence of the residuals in a simple way?

How do I test for a linear relationship between (1) IV and residuals and (2) IV and DV - when the IV are all categorical?

More Zachary Ramsay's questions See All
Similar questions and discussions