Context-specific infinite mixtures for clustering gene expression profiles across diverse microarray dataset
Open Access
- 18 May 2006
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 22 (14), 1737-1744
- https://doi.org/10.1093/bioinformatics/btl184
Abstract
Motivation: Identifying groups of co-regulated genes by monitoring their expression over various experimental conditions is complicated by the fact that such co-regulation is condition-specific. Ignoring the context-specific nature of co-regulation significantly reduces the ability of clustering procedures to detect co-expressed genes due to additional ‘noise’ introduced by non-informative measurements. Results: We have developed a novel Bayesian hierarchical model and corresponding computational algorithms for clustering gene expression profiles across diverse experimental conditions and studies that accounts for context-specificity of gene expression patterns. The model is based on the Bayesian infinite mixtures framework and does not require a priori specification of the number of clusters. We demonstrate that explicit modeling of context-specificity results in increased accuracy of the cluster analysis by examining the specificity and sensitivity of clusters in microarray data. We also demonstrate that probabilities of co-expression derived from the posterior distribution of clusterings are valid estimates of statistical significance of created clusters. Availability: The open-source package gimm is available at Author Webpage Contact: Mario.Medvedovic@uc.edu Supplementary information: Author WebpageKeywords
This publication has 17 references indexed in Scilit:
- Bayesian mixture model based clustering of replicated microarray dataBioinformatics, 2004
- The KEGG resource for deciphering the genomeNucleic Acids Research, 2004
- A Gene-Coexpression Network for Global Discovery of Conserved Genetic ModulesScience, 2003
- Microarray analysis of gene expression during the cell cycleCell & Chromosome, 2003
- Module networks: identifying regulatory modules and their condition-specific regulators from gene expression dataNature Genetics, 2003
- Splitting vessels: Keeping lymph apart from bloodNature Medicine, 2003
- Transcriptional Regulatory Networks in Saccharomyces cerevisiaeScience, 2002
- Context-Specific Bayesian Clustering for Gene Expression DataJournal of Computational Biology, 2002
- Sampling-Based Approaches to Calculating Marginal DensitiesJournal of the American Statistical Association, 1990
- Sampling-Based Approaches to Calculating Marginal DensitiesJournal of the American Statistical Association, 1990