Hello everyone

I have extensively read throughout the platform about the usage of different models to analyze likert scale and ordered dependent variables. I wanted to share my plans and see your opinions if it is the best model.

My context is the following: We asked how comfortable they would feel downloading an app with different characteristics (factors) from 1 - 11, 1 being under no circumstance I would download such app and 11 being I would download and use that app everyday. There are three factors,with 2, 3 and 7 levels accordingly (open-source, security and app provider). We deckerized with open source, as our previous research showed it wasn't significant, meaning that respondents were asked to evaluate a set of vignettes either from open-source or non-open source. We used clustered sampling and our sample data is representative of our objective population (with 600 answers).

I have read from sociological methodology that given that the likert scale is of 11 points (bringing a number of benefits) and it is set in an experimental manner, you can use ANOVA, OLS and Random Intercepts Models. However, I feel a bit uncomfortable using these, as some assumptions are broken. Thus, I decided to use an ordered logit regression (OLR) , as for me the dependent variable (willingness to download) is ordered. The parallel line assumptions isn't broken and all variable as significant, so that gives me confidence I can use this model. However, I started doubting if maybe a multinomial logistic regression.

I'm using R for the analysis, with the MASS packaged (specifically the polr function for the OLR and rant and poTest for checking the parallel assumptions). I have crossed checked I get the same results with STATA and it fits.

On the article I plan on also including the ANOVA, OLS and Random Intercepts Model to add robustness to the analysis. What's interesting is that, although some specific coefficients change from OLS to OLM, the conclusion are the same.

Thus: Should I used the multinomial logistic regression or not? Comments on what to report and improvements?

Edit PS: Through my ANOVA, it shows that the ind.var don't interact. Should I still include them in the OLR? Currently it is like dep.var ~ x1 + x2 . Would you suggest dep.var ~ x1 + x2 + x1:x2 as a better fit, even if the ANOVA with interaction says the interaction isn't significant? And if you think that the OLR should include the interaction, do you exactly know how to know if it is significant?

More Claudia Negri-Ribalta's questions See All
Similar questions and discussions