Use of Imputed Population-based Cancer Registry Data as a Method of Accounting for Missing Information: Application to Estrogen Receptor Status for Breast Cancer

Open Access

25 July 2012

journal article
research article
Published by Oxford University Press (OUP) in American Journal of Epidemiology

Vol. 176 (4), 347-356
https://doi.org/10.1093/aje/kwr512

Abstract

The National Cancer Institute's Surveillance, Epidemiology, and End Results (SEER) Program provides a rich source of data stratified according to tumor biomarkers that play an important role in cancer surveillance research. These data are useful for analyzing trends in cancer incidence and survival. These tumor markers, however, are often prone to missing observations. To address the problem of missing data, the authors employed sequential regression multivariate imputation for breast cancer variables, with a particular focus on estrogen receptor status, using data from 13 SEER registries covering the period 1992–2007. In this paper, they present an approach to accounting for missing information through the creation of imputed data sets that can be analyzed using existing software (e.g., SEER*Stat) developed for analyzing cancer registry data. Bias in age-adjusted trends in female breast cancer incidence is shown graphically before and after imputation of estrogen receptor status, stratified by age and race. The imputed data set will be made available in SEER*Stat (http://seer.cancer.gov/analysis/index.html) to facilitate accurate estimation of breast cancer incidence trends. To ensure that the imputed data set is used correctly, the authors provide detailed, step-by-step instructions for conducting analyses. This is the first time that a nationally representative, population-based cancer registry data set has been imputed and made available to researchers for conducting a variety of analyses of breast cancer incidence trends.

Keywords

This publication has 17 references indexed in Scilit:

Incidence of Breast Cancer in the United States: Current and Future Trends
JNCI Journal of the National Cancer Institute, 2011
Breast Cancer Incidence Rates in U.S. Women Are No Longer Declining
Cancer Epidemiology, Biomarkers & Prevention, 2011
Feasibility Study for Collection of HER2 Data by National Cancer Institute (NCI) Surveillance, Epidemiology, and End Results (SEER) Program Central Cancer Registries
Cancer Epidemiology, Biomarkers & Prevention, 2010
Sustained lower rates of breast cancer in the United States
Breast Cancer Research and Treatment, 2008
Race/ethnicity and breast cancer estrogen receptor status: impact of class, missing data, and modeling assumptions
Cancer Causes & Control, 2008
American Society of Clinical Oncology 2007 Update of Recommendations for the Use of Tumor Markers in Breast Cancer
Journal of Clinical Oncology, 2007
Breast Cancer Incidence, 1980 2006: Combined Roles of Menopausal Hormone Therapy, Screening Mammography, and Estrogen Receptor Status
JNCI Journal of the National Cancer Institute, 2007
Recent trends in breast cancer incidence rates by age and tumor characteristics among U.S. women
Breast Cancer Research, 2007
Much Ado About Nothing
The American Statistician, 2007
Risks and Benefits of Estrogen Plus Progestin in Healthy Postmenopausal Women: Principal Results From the Women's Health Initiative Randomized Controlled Trial
JAMA, 2002

Cited by 69 articles