An Application of Hierarchical Kappa-type Statistics in the Assessment of Majority Agreement among Multiple Observers

1 June 1977

journal article
research article
Published by JSTOR in Biometrics

Vol. 33 (2), 363-374
https://doi.org/10.2307/2529786

Abstract

This paper presents a general statistical methodology for the analysis of multivariate categorical data involving agreement among more than two observers. Since these situations give rise to very large contingency tables in which most of the observed cell frequencies are zero, procedures based on indicator variables of the raw data for individual subjects are used to generate first-order margins and main diagonal sums from the conceptual multidimensional contingency table. From these quantities, estimates are generated to reflect the strength of an internal majority decision on each subject. Moreover, a subset of observers who demonstrate a high level of interobserver agreement can be identified by using pairwise agreement statistics between each observer and the internal majority standard opinion on each subject. These procedures are all illustrated within the context of a clinical diagnosis example involving seven pathologists.

This publication has 7 references indexed in Scilit:

A General Methodology for the Analysis of Experiments with Repeated Measurement of Categorical Data
Biometrics, 1977
The Measurement of Observer Agreement for Categorical Data
Biometrics, 1977
Comparing the Joint Agreement of Several Raters with Another Rater
Biometrics, 1976
A review of statistical methods in the analysis of data arising from observer reliability studies (Part II)*
Statistica Neerlandica, 1975
A review of statistical methods in the analysis of data arising from observer reliability studies (Part I)*
Statistica Neerlandica, 1975
VARIABILITY IN CLASSIFICATION OF CARCINOMA IN SITU OF UTERINE CERVIX
1967
Measures of Association for Cross Classifications
Journal of the American Statistical Association, 1954

Cited by 2282 articles