I want to establish degree of agreement of individual items in the designed scale, with multiple judges (at least 5) given all items. So it’s a full crossed design, but I’m unable to decide whether to select inter class coefficients (ICC) or Kendall’s coefficient of concordance measure (W)? What are the assumptions of these tests to judge which statistics is best suitable for my data?