I'm calculating an ICC for inter-rater reliability of a measurement on a radiograph. Rater 1 read the radiographs 3 times blind to the other reads to calculate intra-rater reliability. Now raters 2 and 3 have read each radiograph 1, and we need to calculate inter-rater reliability. Should I calculate ICC with the average of rater 1's measurements and measurements of 2 and 3 for three separate measurements? Or with all 3 of rater 1's measurements and the 2 others, for a total of 5?