Fully conditional specification in multivariate imputation
Top Cited Papers
- 1 December 2006
- journal article
- research article
- Published by Taylor & Francis in Journal of Statistical Computation and Simulation
- Vol. 76 (12), 1049-1064
- https://doi.org/10.1080/10629360600810434
Abstract
The use of the Gibbs sampler with fully conditionally specified models, where the distribution of each variable given the other variables is the starting point, has become a popular method to create imputations in incomplete multivariate data. The theoretical weakness of this approach is that the specified conditional densities can be incompatible, and therefore the stationary distribution to which the Gibbs sampler attempts to converge may not exist. This study investigates practical consequences of this problem by means of simulation. Missing data are created under four different missing data mechanisms. Attention is given to the statistical behavior under compatible and incompatible models. The results indicate that multiple imputation produces essentially unbiased estimates with appropriate coverage in the simple cases investigated, even for the incompatible models. Of particular interest is that these results were produced using only five Gibbs iterations starting from a simple draw from observed marginal distributions. It thus appears that, despite the theoretical weaknesses, the actual performance of conditional model specification for multivariate imputation can be quite good, and therefore deserves further study.Keywords
This publication has 23 references indexed in Scilit:
- Statistical Analysis with Missing DataWiley Series in Probability and Statistics, 2002
- Conditionally Specified Distributions: An Introduction (with comments and a rejoinder by the authors)Statistical Science, 2001
- Treatments of Missing Data: A Monte Carlo Comparison of RBHDI, Iterative Stochastic Regression Imputation, and Expectation-MaximizationStructural Equation Modeling: A Multidisciplinary Journal, 2000
- Analysis of Incomplete Multivariate DataPublished by Taylor & Francis ,1997
- Multiple Imputation After 18+ YearsJournal of the American Statistical Association, 1996
- Effect on Secondary Data Analysis of Common Imputation MethodsSociological Methodology, 1989
- Multiple Imputation for Nonresponse in SurveysWiley Series in Probability and Statistics, 1987
- A Comparison of Methods for Treating Incomplete Data in Selection ResearchEducational and Psychological Measurement, 1987
- Estimation for the multiple factor model when data are missingPsychometrika, 1979
- A proposal for handling missing dataPsychometrika, 1975