Inflation of the type I error rate when a continuous confounding variable is categorized in logistic regression analyses
- 23 March 2004
- journal article
- research article
- Published by Wiley in Statistics in Medicine
- Vol. 23 (7), 1159-1178
- https://doi.org/10.1002/sim.1687
Abstract
This paper demonstrates an inflation of the type I error rate that occurs when testing the statistical significance of a continuous risk factor after adjusting for a correlated continuous confounding variable that has been divided into a categorical variable. We used Monte Carlo simulation methods to assess the inflation of the type I error rate when testing the statistical significance of a risk factor after adjusting for a continuous confounding variable that has been divided into categories. We found that the inflation of the type I error rate increases with increasing sample size, as the correlation between the risk factor and the confounding variable increases, and with a decrease in the number of categories into which the confounder is divided. Even when the confounder is divided in a five‐level categorical variable, the inflation of the type I error rate remained high when both the sample size and the correlation between the risk factor and the confounder were high. Copyright 2004 John Wiley & Sons, Ltd.Keywords
This publication has 25 references indexed in Scilit:
- Estimating linear regression models in the presence of a censored independent variableStatistics in Medicine, 2004
- Moderate Alcohol Intake during Pregnancy and the Risk of Stillbirth and Death in the First Year of LifeAmerican Journal of Epidemiology, 2002
- Measuring Quantity, Frequency, and Volume of DrinkingAlcohol, Clinical and Experimental Research, 1998
- Alcohol and mortality: A reviewJournal of Clinical Epidemiology, 1995
- Dangers of Using "Optimal" Cutpoints in the Evaluation of Prognostic FactorsJNCI Journal of the National Cancer Institute, 1994
- Regression Using Fractional Polynomials of Continuous Covariates: Parsimonious Parametric ModellingJournal of the Royal Statistical Society Series C: Applied Statistics, 1994
- The concept of residual confounding in regression models and some applicationsStatistics in Medicine, 1992
- Effects of mismodelling and mismeasuring explanatory variables on tests of their association with a response variableStatistics in Medicine, 1988
- Maximally Selected Chi Square StatisticsBiometrics, 1982