10 November 2015 12 3K Report

I would like to model a depression scale score, which is rather heavily right skewed, from a larger number of predictor variables (~15), including one three-way interaction (and the corresponding subordinate two-way interaction terms).

Because of the outcome distribution, I do not think a standard multiple regression is appropriate. Transforming the variable is also not a sensible option since there is only a limited number of outcome values (between 0 and 15). I tried median regression, but it seems like my sample size (about 700) is not large enough to give stable estimates with the three-way interaction included (I get rather strange results, whereas it seems to work without this three-way interaction).

My question then is, is there any modelling technique which you could recommend in this situation? I am currently thinking about robust regression techniques (e.g. Huber M-estimation), but maybe there are other, better options?

More Hans Bauer's questions See All
Similar questions and discussions