5. Three Likelihood-Based Methods for Mean and Covariance Structure Analysis with Nonnormal Missing Data
Top Cited Papers
- 1 August 2000
- journal article
- Published by SAGE Publications in Sociological Methodology
- Vol. 30 (1), 165-200
- https://doi.org/10.1111/0081-1750.00078
Abstract
Survey and longitudinal studies in the social and behavioral sciences generally contain missing data. Mean and covariance structure models play an important role in analyzing such data. Two promising methods for dealing with missing data are a direct maximum-likelihood and a two-stage approach based on the unstructured mean and covariance estimates obtained by the EM-algorithm. Typical assumptions under these two methods are ignorable nonresponse and normality of data. However, data sets in social and behavioral sciences are seldom normal, and experience with these procedures indicates that normal theory based methods for nonnormal data very often lead to incorrect model evaluations. By dropping the normal distribution assumption, we develop more accurate procedures for model inference. Based on the theory of generalized estimating equations, a way to obtain consistent standard errors of the two-stage estimates is given. The asymptotic efficiencies of different estimators are compared under various assumptions. We also propose a minimum chi-square approach and show that the estimator obtained by this approach is asymptotically at least as efficient as the two likelihood-based estimators for either normal or nonnormal data. The major contribution of this paper is that for each estimator, we give a test statistic whose asymptotic distribution is chisquare as long as the underlying sampling distribution enjoys finite fourth-order moments. We also give a characterization for each of the two likelihood ratio test statistics when the underlying distribution is nonnormal. Modifications to the likelihood ratio statistics are also given. Our working assumption is that the missing data mechanism is missing completely at random. Examples and Monte Carlo studies indicate that, for commonly encountered nonnormal distributions, the procedures developed in this paper are quite reliable even for samples with missing data that are missing at random.Keywords
This publication has 48 references indexed in Scilit:
- Structural Equation Modeling with Small Samples: Test StatisticsMultivariate Behavioral Research, 1999
- Asymptotic Chi-Square Tests for a Large Class of Factor Analysis ModelsThe Annals of Statistics, 1990
- Pseudo-Maximum Likelihood Estimation of Mean and Covariance Structures with Missing DataJournal of the American Statistical Association, 1990
- Pseudo maximum likelihood estimation and a test for misspecification in mean and covariance structure modelsPsychometrika, 1989
- Structural Equations with Latent VariablesPublished by Wiley ,1989
- The Asymptotic Normal Distribution of Estimators in Factor Analysis under General ConditionsThe Annals of Statistics, 1988
- Estimation of Linear Models with Incomplete DataSociological Methodology, 1987
- Some contributions to efficient statistics in structural models: Specification and estimation of moment structuresPsychometrika, 1983
- Asymptotic comparison of missing data procedures for estimating factor loadingsPsychometrika, 1983
- Maximum Likelihood Estimates for a Multivariate Normal Distribution when Some Observations are MissingJournal of the American Statistical Association, 1957