A Latent Variable Approach for Meta-Analysis of Gene Expression Data from Multiple Microarray Experiments
Open Access
- 27 September 2007
- journal article
- research article
- Published by Springer Nature in BMC Bioinformatics
- Vol. 8 (1), 364
- https://doi.org/10.1186/1471-2105-8-364
Abstract
Background: With the explosion in data generated using microarray technology by different investigators working on similar experiments, it is of interest to combine results across multiple studies. Results: In this article, we describe a general probabilistic framework for combining high-throughput genomic data from several related microarray experiments using mixture models. A key feature of the model is the use of latent variables that represent quantities that can be combined across diverse platforms. We consider two methods for estimation of an index termed the probability of expression (POE). The first, reported in previous work by the authors, involves Markov Chain Monte Carlo (MCMC) techniques. The second method is a faster algorithm based on the expectation-maximization (EM) algorithm. The methods are illustrated with application to a meta-analysis of datasets for metastatic cancer. Conclusion: The statistical methods described in the paper are available as an R package, metaArray 1.8.1, which is at Bioconductor, whose URL is http://www.bioconductor.org/.Keywords
This publication has 30 references indexed in Scilit:
- Gene Expression Profiling Reveals a Massive, Aneuploidy-Dependent Transcriptional Deregulation and Distinct Differences between Lymph Node–Negative and Lymph Node–Positive Colon CarcinomasCancer Research, 2007
- Met-regulated expression signature defines a subset of human hepatocellular carcinomas with poor prognosis and aggressive phenotypeJournal of Clinical Investigation, 2006
- Genes Involved in Invasion and Metastasis of Gastric Cancer Identified by Array-Based Hybridization and Serial Analysis of Gene ExpressionOncology, 2005
- Differences in gene expression between B-cell chronic lymphocytic leukemia and normal B cells: a meta-analysis of three microarray studiesBioinformatics, 2004
- Detecting differential gene expression with a semiparametric hierarchical mixture methodBiostatistics, 2004
- Exploration, normalization, and summaries of high density oligonucleotide array probe level dataBiostatistics, 2003
- A Statistical Framework for Expression-Based Molecular Classification in CancerJournal of the Royal Statistical Society Series B: Statistical Methodology, 2002
- A Direct Approach to False Discovery RatesJournal of the Royal Statistical Society Series B: Statistical Methodology, 2002
- Gene expression profiling predicts clinical outcome of breast cancerNature, 2002
- Significance analysis of microarrays applied to the ionizing radiation responseProceedings of the National Academy of Sciences, 2001