A critical discussion of intraclass correlation coefficients

15 December 1994

journal article
research article
Published by Wiley in Statistics in Medicine

Vol. 13 (23-24), 2465-2476
https://doi.org/10.1002/sim.4780132310

Abstract

In general, intraclass correlation coefficients (ICC's) are designed to assess consistency or conformity between two or more quantitative measurements. They are claimed to handle a wide range of problems, including questions of reliability, reproducibility and validity. It is shown that care must be taken in choosing a suitable ICC with respect to the underlying sampling theory. For this purpose a decision tree is developed. It may be used to choose a coefficient which is appropriate for a specific study setting. We demonstrate that different ICC's may result in quite different values for the same data set, even under the same sampling theory. Other general limitations of ICC's are also addressed. Potential alternatives are presented and discussed, and some recommendations are given for the use of an appropriate method.

Keywords

This publication has 21 references indexed in Scilit:

A statistical assessment of clinical equivalence
Statistics in Medicine, 1988
STATISTICAL METHODS FOR ASSESSING AGREEMENT BETWEEN TWO METHODS OF CLINICAL MEASUREMENT
The Lancet, 1986
A nonparametric measure of intraclass correlation
Biometrika, 1979
The Non-Null Distribution of the Spearman Rank Correlation Coefficient
Journal of the American Statistical Association, 1974
Improved Approximation to the Non-Null Distribution of the Correlation Coefficient
Journal of the American Statistical Association, 1973
The Equivalence of Weighted Kappa and the Intraclass Correlation Coefficient as Measures of Reliability
Educational and Psychological Measurement, 1973
Bivariate Agreement Coefficients for Reliability of Data
Sociological Methodology, 1970
A NEW VIEW OF INTER‐OBSERVER AGREEMENT¹
Personnel Psychology, 1963
Intra-class rank correlation
Biometrika, 1949

Cited by 532 articles