I have several predictor variables and an outcome variable, diabetes self-management which has several questions related to diet, physical activity etc. Each question has a discrete response. For eg, how many of the last seven days did you follow a diet plan? The response ranges from 1-7. But the responses are averaged to get a mean score to calculate the total diabetes self-management score.
Since the outcome data is not continuous, can I still use linear regression? I did a preliminary check on my outcome data and it is concentrated towards 7 and 0.My search of the literature suggested that the type of data is not as important as the assumption validation of the residuals. I also checked the residual assumptions of simple linear regression? But it doesn't look like it meets any assumption.
Any information on how to move forward will be highly appreciated?