Survey Inference for Subpopulations

Abstract
One frequently analyzes a subset of the data collected in a survey when interest focuses on individuals in a certain subpopulation of the sampled population. Although it may seem natural to eliminate from the data set all data from individuals outside the subpopulation before analysis, this procedure may yield incorrect standard errors and confidence intervals. The authors give two examples of this using data from the 1987 National Health Interview Survey and the 1986 National Mortality Followback Survey. The correct method of analysis is described, as well as a simple condition that, when satisfied, ensures that the elimination approach yields identical answers to the correct method. Am J Epidemiol 1996; 144: 102–6.