Measuring inter-rater reliability for nominal data - which coefficients and confidence intervals are appropriate?
Antonia ZapfStefanie CastellLars MorawietzAndré KarchPublished in: BMC medical research methodology (2016)
Fleiss' K and Krippendorff's alpha with bootstrap confidence intervals are equally suitable for the analysis of reliability of complete nominal data. The asymptotic confidence interval for Fleiss' K should not be used. In the case of missing data or data or higher than nominal order, Krippendorff's alpha is recommended. Together with this article, we provide an R-script for calculating Fleiss' K and Krippendorff's alpha and their corresponding bootstrap confidence intervals.