Evaluating diagnostic tests: The area under the ROC curve and the balance of errors
Open Access
- 19 January 2010
- journal article
- research article
- Published by Wiley in Statistics in Medicine
- Vol. 29 (14), 1502-1510
- https://doi.org/10.1002/sim.3859
Abstract
Because accurate diagnosis lies at the heart of medicine, it is important to be able to evaluate the effectiveness of diagnostic tests. A variety of accuracy measures are used. One particularly widely used measure is the AUC, the area under the receiver operating characteristic (ROC) curve. This measure has a well-understood weakness when comparing ROC curves which cross. However, it also has the more fundamental weakness of failing to balance different kinds of misdiagnoses effectively. This is not merely an aspect of the inevitable arbitrariness in choosing a performance measure, but is a core property of the way the AUC is defined. This property is explored, and an alternative, the H measure, is described. Copyright © 2010 John Wiley & Sons, Ltd.Keywords
This publication has 14 references indexed in Scilit:
- Using Relative Utility Curves to Evaluate Risk PredictionJournal of the Royal Statistical Society Series A: Statistics in Society, 2009
- Performance criteria for plastic card fraud detection toolsJournal of the Operational Research Society, 2008
- Evaluating the added predictive ability of a new marker: From area under the ROC curve to reclassification and beyondStatistics in Medicine, 2007
- Use and Misuse of the Receiver Operating Characteristic Curve in Risk PredictionCirculation, 2007
- Classifier Technology and the Illusion of ProgressStatistical Science, 2006
- Good practice in retail credit scorecard assessmentJournal of the Operational Research Society, 2005
- On criteria for evaluating models of absolute riskBiostatistics, 2005
- Overestimation of the receiver operating characteristic curve for logistic regressionBiometrika, 2002
- A class of logistic-type discriminant functionsBiometrika, 2002
- Analyzing a Portion of the ROC CurveMedical Decision Making, 1989