Censoring Trace-Level Environmental Data: Statistical Analysis Considerations to Limit Bias
- 23 February 2021
- journal article
- research article
- Published by American Chemical Society (ACS) in Environmental Science & Technology
- Vol. 55 (6), 3786-3795
- https://doi.org/10.1021/acs.est.0c02256
Abstract
Trace-level environmental data typically include values near or below detection and quantitation thresholds where health effects may result from low-concentration exposures to one chemical over time or to multiple chemicals. In a cook stove case study, bias in dibenzo[a,h]anthracene concentration means and standard deviations (SDs) was assessed following censoring at thresholds for selected analysis approaches: substituting threshold/2, maximum likelihood estimation, robust regression on order statistics, Kaplan–Meier, and omitting censored observations. Means and SDs for gas chromatography–mass spectrometry-determined concentrations were calculated after censoring at detection and calibration thresholds, 17% and 55% of the data, respectively. Threshold/2 substitution was the least biased. Measurement values were subsequently simulated from two log-normal distributions at two sample sizes. Means and SDs were calculated for 30%, 50%, and 80% censoring levels and compared to known distribution counterparts. Simulation results illustrated (1) threshold/2 substitution to be inferior to modern after-censoring statistical approaches and (2) all after-censoring approaches to be inferior to including all measurement data in analysis. Additionally, differences in stove-specific group means were tested for uncensored samples and after censoring. Group differences of means tests varied depending on censoring and distributional decisions. Investigators should guard against censoring-related bias from (explicit or implicit) distributional and analysis approach decisions.Keywords
This publication has 68 references indexed in Scilit:
- A Moment-Adjusted Imputation Method for Measurement Error ModelsBiometrics, 2011
- A Distribution-Based Multiple Imputation Method for Handling Bivariate Pesticide Data with Values below the Limit of DetectionEnvironmental Health Perspectives, 2011
- Inter-laboratory comparison study on measuring semi-volatile organic chemicals in standards and air samplesEnvironmental Pollution, 2010
- Epidemiologic Evaluation of Measurement Data in the Presence of Detection LimitsEnvironmental Health Perspectives, 2004
- Detection and quantification limits: origins and historical overviewAnalytica Chimica Acta, 1999
- Simulation-Extrapolation: The Measurement Error JackknifeJournal of the American Statistical Association, 1995
- Replacement of censored observations by a constant: An evaluationWater Research, 1992
- Nondetects, Detection Limits, and the Probability of DetectionJournal of the American Statistical Association, 1991
- Estimating Mean Concentrations Under Transformation for Environmental Data With Detection LimitsTechnometrics, 1989
- The detection limitEnvironmental Science & Technology, 1988