I have obtained two ratings (creativity and technicality) from six expert judges based on art-related outcomes. I am following Amabile's (1979) Consensual assessment technique to identify assess creativity. Research suggests that (ideally) ratings of creativity should not correlate with that of technicality in order to demonstrate discriminant validity (i.e. they are not measuring the same thing). Since the data collected is ordinal in nature, I have looked at Spearman's/ Kendall’s Tau correlations. However, I am concerned with the correlation and ICC approach as it does not capture paired differences (what I mean by paired here is same product two independent ratings). Correlations are also problematic given the very small sample size (8 ratings). Could anyone suggest what might be the best approach to identify where ratings for creativity and technicality differ (taking into account each rated product)? Thanks!