Reliability of staging, prognosis, and comorbidity data collection in the National Comprehensive Cancer Network (NCCN) non-Hodgkin lymphoma (NHL) multicenter outcomes database

Open Access

1 December 2008

journal article
research article
Published by Wiley in Cancer

Vol. 113 (11), 3209-3212
https://doi.org/10.1002/cncr.23911

Abstract

Clinical trials and outcomes studies often rely on nonphysicians to abstract complex data from medical records, but the reliability of these data are rarely assessed. We used standardized charts of patients with non-Hodgkin lymphoma to assess the reliability of key clinical data elements abstracted by 6 clinical research associates (CRAs), 3 project staff, and 3 medical oncologists. We assessed reliability on 5 variables: MD-reported and rater-determined disease stage; International Prognostic Index (IPI; low-low intermediate, intermediate-high, high); Charlson comorbidity index score; and presence of any item from the Charlson index. Intraclass correlation coefficients (ICCs) of 0-0.20 were indicative of "slight", 0.21-0.40 indicated "fair", 0.41-0.60 indicated "moderate", 0.61-0.80 "substantial" and >0.80 "almost perfect" reliability. By outcome, the ICC (95% confidence interval) values for MD-reported stage, rater-determined stage, and IPI were 0.86 (0.67, 0.94), 0.82 (0.59, 0.93), and 0.80 (0.55, 0.92), respectively. In contrast, the ICC (95% confidence interval) of the Charlson score, or presence of any Charlson comorbidity item was 0.47 (0.03, 0.75) and 0.61 (0.23, 0.83), respectively. Reliability varied by rater group; no rater group was consistently more reliable than others. Trained CRAs abstracted key clinical variables with a very high degree of reliability, and performed at a level similar to study trainers and oncologists. Elements of the Charlson index were less reliable than other data types, possibly because of inherent ambiguity in the index itself.

Keywords

This publication has 12 references indexed in Scilit:

Interrater reliability of measurements of comorbid illness should be reported
Journal of Clinical Epidemiology, 2006
Reliability of PRISM and PIM scores in paediatric intensive care
Archives of Disease in Childhood, 2005
Inter-observer variability in APACHE II scoring: effect of strict guidelines and training
Intensive Care Medicine, 2001
Accuracy and reliability of APACHE II scoring in two intensive care units
Anaesthesia, 2001
Development of a comorbidity index using physician claims data
Journal of Clinical Epidemiology, 2000
Sample size and optimal designs for reliability studies
Statistics in Medicine, 1998
Comorbidity measures for stroke outcome research: A preliminary study
Archives of Physical Medicine and Rehabilitation, 1997
Adapting a clinical comorbidity index for use with ICD-9-CM administrative databases
Journal of Clinical Epidemiology, 1992
A new method of classifying prognostic comorbidity in longitudinal studies: Development and validation
Journal of Chronic Diseases, 1987
Fisher's Z to R
Educational and Psychological Measurement, 1983

Cited by 11 articles