Summary: I was wondering what statistical test is most appropriate for my data: for each of my four experimental conditions seperate participants rated my stimuli. I have about 250 stimuli rated on multiple scales (ordinal, 10 p. likert scale) for each of these four experimental conditions.
Question: I hope to find out if the experimental conditions explain differences in prediction on these scales. can anybody recommend what test to conscider? It would be very nice to acquire some advice and possibly some resources on the fundamentals of these models and applications using python.
More details/my consciderations: At first I made the error to assume that I could just use likert scale as a continous variable, but realized this is not the case when I was plotting distribution and qq-plot and when I read more into this 'discussion'. Now I am conscidering using a Chi-square or likelihood model since one of my resources recommends this for categorical variables (A, Field 3rd edition). However, when I was reading up on possible models, I bumped into ordinal regression (something not in my books).