I have several predictor variables and an outcome variable, diabetes self-management which has several questions related to diet, physical activity etc. Each question has a discrete response. For eg, how many of the last seven days did you follow a diet plan? The response ranges from 1-7. But the responses are averaged to get a mean score to calculate the total diabetes self-management score.

Since the outcome data is not continuous, can I still use linear regression? I did a preliminary check on my outcome data and it is concentrated towards 7 and 0.My search of the literature suggested that the type of data is not as important as the assumption validation of the residuals. I also checked the residual assumptions of simple linear regression? But it doesn't look like it meets any assumption.

Any information on how to move forward will be highly appreciated?

More Monika Shrestha's questions See All
Similar questions and discussions