Improved hypothesis testing for coefficients in generalized estimating equations with small samples of clusters
- 3 February 2006
- journal article
- research article
- Published by Wiley in Statistics in Medicine
- Vol. 25 (23), 4081-4098
- https://doi.org/10.1002/sim.2502
Abstract
The sandwich standard error estimator is commonly used for making inferences about parameter estimates found as solutions to generalized estimating equations (GEE) for clustered data. The sandwich tends to underestimate the variability in the parameter estimates when the number of clusters is small, and reference distributions commonly used for hypothesis testing poorly approximate the distribution of Wald test statistics. Consequently, tests have greater than nominal type I error rates. We propose tests that use bias-reduced linearization, BRL, to adjust the sandwich estimator and Satterthwaite or saddlepoint approximations for the reference distribution of resulting Wald t-tests. We conducted a large simulation study of tests using a variety of estimators (traditional sandwich, BRL, Mancl and DeRouen's BC estimator, and a modification of an estimator proposed by Kott) and approximations to reference distributions under diverse settings that varied the distribution of the explanatory variables, the values of coefficients, and the degree of intra-cluster correlation (ICC). Our new method generally worked well, providing accurate estimates of the variability of fitted coefficients and tests with near-nominal type I error rates when the ICC is small. Our method works less well when the ICC is large, but it continues to out-perform the traditional sandwich and other alternatives. Copyright © 2006 John Wiley & Sons, Ltd.Keywords
Funding Information
- National Science Foundation (00017630)
This publication has 18 references indexed in Scilit:
- Saddlepoint Approximation and Bootstrap Inference for the Satterthwaite Class of RatiosJournal of the American Statistical Association, 2002
- Modelling and generating correlated binary variablesBiometrika, 2001
- Practical Saddlepoint ApproximationsThe American Statistician, 1999
- Small sample characteristics of generalized estimating equationsCommunications in Statistics - Simulation and Computation, 1995
- Small sample validity of latent variable models for correlated binary dataCommunications in Statistics - Simulation and Computation, 1994
- Using the jackknife to estimate the variance of regression estimators from repeated measures studiesCommunications in Statistics - Theory and Methods, 1990
- Longitudinal data analysis using generalized linear modelsBiometrika, 1986
- Approximate Tests of Independence and Goodness of Fit Based on Stratified Multistage SamplesJournal of the American Statistical Association, 1980
- The Jackknife--A ReviewBiometrika, 1974