Hello,

I am developing 2 AI models and for both models I have compared the models predictions to that of 5 raters. I have calculated the cohen's kappa coefficient between each rater and model1 and then calculated a mean kappa (or median still unsure how to present this?) value for this model. I then repeated this process for model2.

So now I have 2 average cohen kappa coefficient values:

Average rater-model1 kappa values + SD

average rate-model2 kappa value + SD

How do I determine if these 2 kappa values are statistically significantly different?

More Oreoluwa Mohammed's questions See All
Similar questions and discussions