Performance of reclassification statistics in comparing risk prediction models

3 February 2011

journal article
research article
Published by Wiley in Biometrical Journal

Vol. 53 (2), 237-258
https://doi.org/10.1002/bimj.201000078

Abstract

Concerns have been raised about the use of traditional measures of model fit in evaluating risk prediction models for clinical use, and reclassification tables have been suggested as an alternative means of assessing the clinical utility of a model. Several measures based on the table have been proposed, including the reclassification calibration (RC) statistic, the net reclassification improvement (NRI), and the integrated discrimination improvement (IDI), but the performance of these in practical settings has not been fully examined. We used simulations to estimate the type I error and power for these statistics in a number of scenarios, as well as the impact of the number and type of categories, when adding a new marker to an established or reference model. The type I error was found to be reasonable in most settings, and power was highest for the IDI, which was similar to the test of association. The relative power of the RC statistic, a test of calibration, and the NRI, a test of discrimination, varied depending on the model assumptions. These tools provide unique but complementary information.

Keywords

This publication has 41 references indexed in Scilit:

Assessment of Clinical Validity of a Breast Cancer Risk Model Combining Genetic and Clinical Information
JNCI Journal of the National Cancer Institute, 2010
Evaluating health risk models
Statistics in Medicine, 2010
Comment: Measures to Summarize and Compare the Predictive Capacity of Markers
The International Journal of Biostatistics, 2010
Assessing the Performance of Prediction Models
Epidemiology, 2010
A Parametric ROC Model‐Based Approach for Evaluating the Predictiveness of Continuous Markers in Case–Control Studies
Biometrics, 2009
Criteria for Evaluation of Novel Markers of Cardiovascular Risk
Circulation, 2009
Using Relative Utility Curves to Evaluate Risk Prediction
Journal of the Royal Statistical Society Series A: Statistics in Society, 2009
Measures to Summarize and Compare the Predictive Capacity of Markers
The International Journal of Biostatistics, 2009
Integrating the Predictiveness of a Marker with Its Performance as a Classifier
American Journal of Epidemiology, 2007
Decision Curve Analysis: A Novel Method for Evaluating Prediction Models
Medical Decision Making, 2006

Cited by 95 articles