Appropriate measure and mininum sample size for inter-rater reliability with binary coding?

19 September 2023 1 2K Report

Dear RG community

I've coded N = 500 professional development courses for teachers according to topics (0 = was not part of the course; 1 = was part of the course). I'd like to have the reliability of my coding checked by a second rater. What is the appropriate measure under these circumstances and how many of the 500 courses would a second rater have to rate?

So far, I've come to the conclusion that Cohen's Kappa may not be the preferred choice, but rather Matthews Correlation Coefficient (MCC). Perhaps even simple percent agreement would be suitable in my case since it's only two raters in total and binary coding? I've been unable to find anything on the minimum sample size.

Any help is greatly appreciated.

Best

Marcel

Metehan Güngör

Hello Marcel Grieger

There are many way to calculate inter-rater reliability: Cohen's Kappa, Weighted Cohen's Kappa, Fleiss' Kappa, Conger's Kappa, Light's Kappa, Krippendorff's Alpha, Iota, Scott's Pi, Stuart-Maxwell, Bhapkar Test, Gwet's AC1/AC2, Brennan-Prediger.

Here's a resource on sample size for inter-rater reliability (Cohen's Kappa): https://rdrr.io/cran/irr/man/N.cohen.kappa.html

Badges
Science topic

More Marcel Grieger's questions See All

Can anyone provide me with a data set suitable for discriminant analysis?

Dear all, Can anybody help me out with this? I'm looking for a dataset suitable for discriminant analysis, preferably 5 groups or less. The data are obtained with analytical chemical methods such...

17 May 2024 7,567 3 View

How to find genomic locations of genes and their exons?

Greetings everyone! For a publication I want to include analyses of pathogenic variants found in several genes (e.g. BRCA1/BRCA2) with regard to location and frequency. Thus, for the figures, I...

05 March 2024 906 4 View

Looking for citable sources on interpretation of classification metrics in the field of ML (binary criterion)?

I am currently working on a prediction-project where I am using machine learning classification techniques to do this. I have already computed various classification metrics like accuracy,...

18 September 2023 1,581 3 View

How to prevent TOPAS from adding irrelevant and unnecessary bonds to CIF file? And how to handle guest molecules in voids?

I have a single crystal structure of a coordination polymer and by PXRD I can see, that I'm able to synthesize isotypical structures containing the same ligand and different metal atoms. This is...

14 September 2023 2,283 1 View

How is sustained casing pressure regulated in the middle east?

At the moment, I’m writing a paper about Sustained Casing Pressure (SCP) – pressure in gas or oil well annuli (between steel casings or between casing and tubing) that rebuilds when it’s bled down...

25 July 2023 8,890 0 View

How do I create a heatmap of passed/failed ratio in GraphPad?

Hello everyone, I have two data sets of samples that passed or failed a specific analysis. I have two variables that I would like to compare based on whether the samples passed or failed the...

12 July 2023 9,334 1 View

Are cancer cells able to metabolize medium chian fatty acids?

In the ketogenic diet MCT oils are frequently consumed. But if cancer cells are able metabolize these oils am I not putting myself at risk?

16 February 2023 9,503 10 View

Pharmacokinetics Excel model avaliable?

Does somebody has a pharmacokinetics model available in Excel? I’m working with a lot medications and would very much like to know how it builds up and to what extent. A simple model suffices but...

14 February 2023 7,359 4 View

How to increase yield of RNA of low bacterial concentration?

Hi there, I am looking for a protocol to isolate RNA using low total CFU of bacteria (S. Aureus). We already have a good isolation protocol for mammalian cells (cell culture and tissue), but now...

23 January 2023 7,018 3 View

Why is there no chi-square test for model fit when using multiple imputation (Mplus)?

Dear colleagues I know that Mplus does not provide me with the chi-square test for model fit when working with multiple imputation...

13 March 2022 3,897 4 View

Which distribution type should I use when calculating the average particle size from TEM image? and how to calculate the error ?

average particle size calculation from TEM

04 August 2024 2,921 1 View

I need the datasets of Microgrid for system identification?

Hi I am working on data driven model of the microgrid, for that, i need the reliable datasets for the identification of MG data driven Model. Thanks

02 August 2024 5,748 4 View

How to calculate effect size of AMCE (Average Marginal Component Effect) in Randomized Conjoint Experiment?

I am following Hainmueller, Hopkins, and Yamamoto's (2014) paper for my randomized conjoint experimental data analysis. The link to the paper is provided below. I received a comment from the...

02 August 2024 4,406 0 View

Should I remove an item from a scale to raise Cronbach's alpha and McDonald's omega or is it better to leave it if they are both over .7 already?

Hello! I have this scale which had 10 items initially. I had to remove items 8 and 10 because they correlated negatively with the scale, and then I removed item 9 because Cronbach's alpha and...

01 August 2024 4,606 7 View

How to conduct a sensitivity power analysis for Kendall's Tau?

Is there a straightforward way to conduct a sensitivity power analysis for a Kendall's Tau correlation? I was considering using the sensitivity setting and "Correlation: point biserial model" test...

28 July 2024 6,133 8 View

How to estimate sample size for GWAS of continuous and discrete traits? What are the pre-requisites?

Genome-wide association study (GWAS) Continuous traits: eg. Height Discrete traits: eg. Eye color

28 July 2024 286 0 View

I need a reliable source or an example supported by excel sheet to understand Fuzzy Vikor?

27 July 2024 5,916 1 View

What is the best method for removing paraffin from plant samples prepared for microtome?

...

24 July 2024 3,087 3 View

Is a reliability test necessary in my survey on translations?

Dear all, I gave 116 respondents 18 translated sentences and asked them to indicate their levels of acceptance of these translations on a five-point scale. Some translations result from strategies...

24 July 2024 8,245 5 View

How many samples size should I select to compare both groups?

I want to study the differences between two groups: the treatment group and the comparison group. The total population consists of 60000 women in the treatment group, distributed across different...

21 July 2024 669 3 View