This is a one sample test design where proficiency for was assessed pre and post, with the sample size being over 1000. There are 4 categories that were used to score them: unsatisfactory, developing, proficient, and exemplary. These categories are dependent of each other. What statistical test should be used here?