From single‐race reporting to multiple‐race reporting: using imputation methods to bridge the transition
- 14 April 2003
- journal article
- Published by Wiley in Statistics in Medicine
- Vol. 22 (9), 1571-1587
- https://doi.org/10.1002/sim.1512
Abstract
In 1997, the Office of Management and Budget issued revised standards for the collection of race information within the Federal statistical system. One revision allows individuals to choose more than one race group when responding to Federal surveys and other Federal data collections. This paper explores methods that impute single‐race categories for those who have given multiple‐race responses. Such imputations would be useful when it is desired to conduct analyses involving only single‐race categories, such as when trends over time are being examined by race group so that data collected under the old and new standards are being combined. The National Health Interview Survey has allowed multiple‐race responses for several years, while also asking respondents to specify one race as their primary race. Exploratory analyses of data from the survey suggest that imputation methods that use demographic and contextual covariate information to predict primary race can have advantages with respect to lower bias and improved variance estimation compared to simpler methods discussed by the Office of Management and Budget. It also appears, however, that the relationships between primary race and covariates might be changing over time. Thus, caution should be exercised if an imputation model fitted to data from one time period is to be applied to data from another time period. Published in 2003 by John Wiley & Sons, Ltd.Keywords
This publication has 8 references indexed in Scilit:
- Disease classification: measuring the effect of the Tenth Revision of the International Classification of Diseases on cause‐of‐death data in the United StatesStatistics in Medicine, 2003
- Race/ethnicity and the 2000 census: implications for public healthAmerican Journal of Public Health, 2000
- Multiple-Imputation Inferences with Uncongenial Sources of InputStatistical Science, 1994
- Analyses of Public Use Decennial Census Data with Multiply Imputed Industry and Occupation CodesJournal of the Royal Statistical Society Series C: Applied Statistics, 1993
- Multiple imputation in health‐are databases: An overview and some applicationsStatistics in Medicine, 1991
- Multiple Imputation of Industry and Occupation Codes in Census Public-use Samples Using Bayesian Logistic RegressionJournal of the American Statistical Association, 1991
- Multiple Imputation for Nonresponse in SurveysWiley Series in Probability and Statistics, 1987