Dichotomizing continuous predictors in multiple regression: a bad idea

Top Cited Papers

11 October 2005

journal article
research article
Published by Wiley in Statistics in Medicine

Vol. 25 (1), 127-141
https://doi.org/10.1002/sim.2331

Abstract

In medical research, continuous variables are often converted into categorical variables by grouping values into two or more categories. We consider in detail issues pertaining to creating just two groups, a common approach in clinical research. We argue that the simplicity achieved is gained at a cost; dichotomization may create rather than avoid problems, notably a considerable loss of power and residual confounding. In addition, the use of a data‐derived ‘optimal’ cutpoint leads to serious bias. We illustrate the impact of dichotomization of continuous predictor variables using as a detailed case study a randomized trial in primary biliary cirrhosis. Dichotomization of continuous data is unnecessary for statistical analysis and in particular should not be applied to explanatory variables in regression models. Copyright © 2005 John Wiley & Sons, Ltd.

Keywords

This publication has 29 references indexed in Scilit:

A new approach to modelling interactions between treatment and continuous covariates in clinical trials by using fractional polynomials
Statistics in Medicine, 2004
Confidence intervals for the effect of a prognostic factor after selection of an ‘optimal’ cutpoint
Statistics in Medicine, 2004
A new measure of prognostic separation in survival data
Statistics in Medicine, 2004
Stability of multivariable fractional polynomial models with selection of variables and transformations: a bootstrap investigation
Statistics in Medicine, 2003
On the practice of dichotomization of quantitative variables.
Psychological Methods, 2002
Treatment of continuous data as categoric variables in obstetrics and gynecology
Obstetrics & Gynecology, 1997
PRACTICALp-VALUE ADJUSTMENT FOR OPTIMALLY SELECTED CUTPOINTS
Statistics in Medicine, 1996
Dangers of Using "Optimal" Cutpoints in the Evaluation of Prognostic Factors
JNCI Journal of the National Cancer Institute, 1994
The concept of residual confounding in regression models and some applications
Statistics in Medicine, 1992
The Cost of Dichotomization
Applied Psychological Measurement, 1983

Cited by 1731 articles