5. Three Likelihood-Based Methods for Mean and Covariance Structure Analysis with Nonnormal Missing Data

Top Cited Papers

1 August 2000

journal article
Published by SAGE Publications in Sociological Methodology

Vol. 30 (1), 165-200
https://doi.org/10.1111/0081-1750.00078

Abstract

Survey and longitudinal studies in the social and behavioral sciences generally contain missing data. Mean and covariance structure models play an important role in analyzing such data. Two promising methods for dealing with missing data are a direct maximum-likelihood and a two-stage approach based on the unstructured mean and covariance estimates obtained by the EM-algorithm. Typical assumptions under these two methods are ignorable nonresponse and normality of data. However, data sets in social and behavioral sciences are seldom normal, and experience with these procedures indicates that normal theory based methods for nonnormal data very often lead to incorrect model evaluations. By dropping the normal distribution assumption, we develop more accurate procedures for model inference. Based on the theory of generalized estimating equations, a way to obtain consistent standard errors of the two-stage estimates is given. The asymptotic efficiencies of different estimators are compared under various assumptions. We also propose a minimum chi-square approach and show that the estimator obtained by this approach is asymptotically at least as efficient as the two likelihood-based estimators for either normal or nonnormal data. The major contribution of this paper is that for each estimator, we give a test statistic whose asymptotic distribution is chisquare as long as the underlying sampling distribution enjoys finite fourth-order moments. We also give a characterization for each of the two likelihood ratio test statistics when the underlying distribution is nonnormal. Modifications to the likelihood ratio statistics are also given. Our working assumption is that the missing data mechanism is missing completely at random. Examples and Monte Carlo studies indicate that, for commonly encountered nonnormal distributions, the procedures developed in this paper are quite reliable even for samples with missing data that are missing at random.

Keywords

This publication has 48 references indexed in Scilit:

Structural Equation Modeling with Small Samples: Test Statistics
Multivariate Behavioral Research, 1999
Asymptotic Chi-Square Tests for a Large Class of Factor Analysis Models
The Annals of Statistics, 1990
Pseudo-Maximum Likelihood Estimation of Mean and Covariance Structures with Missing Data
Journal of the American Statistical Association, 1990
Pseudo maximum likelihood estimation and a test for misspecification in mean and covariance structure models
Psychometrika, 1989
Structural Equations with Latent Variables
Published by Wiley ,1989
The Asymptotic Normal Distribution of Estimators in Factor Analysis under General Conditions
The Annals of Statistics, 1988
Estimation of Linear Models with Incomplete Data
Sociological Methodology, 1987
Some contributions to efficient statistics in structural models: Specification and estimation of moment structures
Psychometrika, 1983
Asymptotic comparison of missing data procedures for estimating factor loadings
Psychometrika, 1983
Maximum Likelihood Estimates for a Multivariate Normal Distribution when Some Observations are Missing
Journal of the American Statistical Association, 1957

Cited by 1059 articles