Based on a set of evaluation criteria, a set of pairs of raters have assessed students' projects. Each pair worked on evaluating one set of projects (i.e.: Rater1,2 -->projectset1, Rater 3,4-->projectset2...etc). Which measure of inter-rater can be used to measure agreement and consistency among raters?