Statistical Challenges in Preprocessing in Microarray Experiments in Cancer
- 30 September 2008
- journal article
- review article
- Published by American Association for Cancer Research (AACR) in Clinical Cancer Research
- Vol. 14 (19), 5959-5966
- https://doi.org/10.1158/1078-0432.ccr-07-4532
Abstract
Many clinical studies incorporate genomic experiments to investigate the potential associations between high-dimensional molecular data and clinical outcome. A critical first step in the statistical analyses of these experiments is that the molecular data are preprocessed. This article provides an overview of preprocessing methods, including summary algorithms and quality control metrics for microarrays. Some of the ramifications and effects that preprocessing methods have on the statistical results are illustrated. The discussions are centered around a microarray experiment based on lung cancer tumor samples with survival as the clinical outcome of interest. The procedures that are presented focus on the array platform used in this study. However, many of these issues are more general and are applicable to other instruments for genome-wide investigation. The discussions here will provide insight into the statistical challenges in preprocessing microarrays used in clinical studies of cancer. These challenges should not be viewed as inconsequential nuisances but rather as important issues that need to be addressed so that informed conclusions can be drawn.Keywords
This publication has 40 references indexed in Scilit:
- Validation of Biomarker-Based Risk Prediction ModelsClinical Cancer Research, 2008
- Validation of Analytic Methods for Biomarkers Used in Drug DevelopmentClinical Cancer Research, 2008
- Gene Expression Profiling Reveals Reproducible Human Lung Adenocarcinoma Subtypes in Multiple Independent Patient CohortsJournal of Clinical Oncology, 2006
- The MicroArray Quality Control (MAQC) project shows inter- and intraplatform reproducibility of gene expression measurementsNature Biotechnology, 2006
- Adjusting batch effects in microarray expression data using empirical Bayes methodsBiostatistics, 2006
- Exploration, normalization, and summaries of high density oligonucleotide array probe level dataBiostatistics, 2003
- Microarray data normalization and transformationNature Genetics, 2002
- Gene-expression profiles predict survival of patients with lung adenocarcinomaNature Medicine, 2002
- Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implicationsProceedings of the National Academy of Sciences, 2001
- Significance analysis of microarrays applied to the ionizing radiation responseProceedings of the National Academy of Sciences, 2001