I am grading quality of studies with a questionnaire which has 20 questions. Scores vary from 0 to 1 for all but two for which score varies from 0 to 2. I need to evaluate 50 studies. This will be done by 2 reviewers. To find the test retest reliability or inter rater reliability, should I add total score of 20 questions for each study, So I have 50 values from reviewers 1 and 50 values from reviewer 2? How can I find correlation between individual scores of 20 questions of each study?