This sounds like a really basic question, but I am looking for some informed feedback from fellow scientists with knowledge of statistics. Recently I had a debate with reviewers of a manuscript over the choice of statistical analysis for comparing two sets of ordinal data. These data are from psychological clinical scales and are similar to a Likert scale (in this case, a rating from 1-7).
An obvious choice for comparing scale results from two populations of patients would be a singed ranks test like Mann-Whitney U (or a t-test [?!] according to the reviewers), however the scoring distributions are non-normal, which I argued invalidate interpretation of any Mann-Whitney U test results. I opted, instead, to build contingency tables and analyze the groups with a Pearson's Chi Squared test. They did not like that.
In a world of computer statistics packages, it seems to me that many people are quick to dismiss simple statistical tests, despite their being robust and the question asked being simple. Do any of you have opinions regarding a better approach to analyzing ordinal data?
The dataset in question consisted of Likert scale-like assessments, rated 1-7. There were up to 30 questions/observational assessments per scale. I chose to compare each question separately between patient groups. Some questions are arranged in clusters of based on symptom type (for example, in PANSS there are clusters for positive [7], negative [7], and general [16] psychopathology symptoms), but these are general groupings and each assessment still stands on its own to rate a particular issue. For example, in PANSS positive symptom set, there is a rating for delusions and one for grandiosity, both of which are positive symptoms and interact but are distinct behavioral features.
So, what is the best way to analyze these types of data? Thanks.