Reliability of staging, prognosis, and comorbidity data collection in the National Comprehensive Cancer Network (NCCN) non-Hodgkin lymphoma (NHL) multicenter outcomes database
Open Access
- 1 December 2008
- Vol. 113 (11), 3209-3212
- https://doi.org/10.1002/cncr.23911
Abstract
Clinical trials and outcomes studies often rely on nonphysicians to abstract complex data from medical records, but the reliability of these data are rarely assessed. We used standardized charts of patients with non-Hodgkin lymphoma to assess the reliability of key clinical data elements abstracted by 6 clinical research associates (CRAs), 3 project staff, and 3 medical oncologists. We assessed reliability on 5 variables: MD-reported and rater-determined disease stage; International Prognostic Index (IPI; low-low intermediate, intermediate-high, high); Charlson comorbidity index score; and presence of any item from the Charlson index. Intraclass correlation coefficients (ICCs) of 0-0.20 were indicative of "slight", 0.21-0.40 indicated "fair", 0.41-0.60 indicated "moderate", 0.61-0.80 "substantial" and >0.80 "almost perfect" reliability. By outcome, the ICC (95% confidence interval) values for MD-reported stage, rater-determined stage, and IPI were 0.86 (0.67, 0.94), 0.82 (0.59, 0.93), and 0.80 (0.55, 0.92), respectively. In contrast, the ICC (95% confidence interval) of the Charlson score, or presence of any Charlson comorbidity item was 0.47 (0.03, 0.75) and 0.61 (0.23, 0.83), respectively. Reliability varied by rater group; no rater group was consistently more reliable than others. Trained CRAs abstracted key clinical variables with a very high degree of reliability, and performed at a level similar to study trainers and oncologists. Elements of the Charlson index were less reliable than other data types, possibly because of inherent ambiguity in the index itself.Keywords
This publication has 12 references indexed in Scilit:
- Interrater reliability of measurements of comorbid illness should be reportedJournal of Clinical Epidemiology, 2006
- Reliability of PRISM and PIM scores in paediatric intensive careArchives of Disease in Childhood, 2005
- Inter-observer variability in APACHE II scoring: effect of strict guidelines and trainingIntensive Care Medicine, 2001
- Accuracy and reliability of APACHE II scoring in two intensive care unitsAnaesthesia, 2001
- Development of a comorbidity index using physician claims dataJournal of Clinical Epidemiology, 2000
- Sample size and optimal designs for reliability studiesStatistics in Medicine, 1998
- Comorbidity measures for stroke outcome research: A preliminary studyArchives of Physical Medicine and Rehabilitation, 1997
- Adapting a clinical comorbidity index for use with ICD-9-CM administrative databasesJournal of Clinical Epidemiology, 1992
- A new method of classifying prognostic comorbidity in longitudinal studies: Development and validationJournal of Chronic Diseases, 1987
- Fisher's Z to REducational and Psychological Measurement, 1983