I have a set of symptoms that may belong to an A condition, B condition, C (both A+B) or D (neither of the options) and I asked two groups of clinicians to rate the belongingness of each symptom to one of the four conditions. I used Gwet´s AC1 to assess interrater agreement of the overall 16 symptoms over each group and ran a paired t-test to evaluate group differences in their ratings (two groups assessing the same set of symptoms). I am unsure about three issues: