Identification of differentially expressed gene categories in microarray studies using nonparametric multivariate analysis
Open Access
- 27 November 2007
- journal article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 24 (2), 192-201
- https://doi.org/10.1093/bioinformatics/btm583
Abstract
Motivation: The field of microarray data analysis is shifting emphasis from methods for identifying differentially expressed genes to methods for identifying differentially expressed gene categories. The latter approaches utilize a priori information about genes to group genes into categories and enhance the interpretation of experiments aimed at identifying expression differences across treatments. While almost all of the existing approaches for identifying differentially expressed gene categories are practically useful, they suffer from a variety of drawbacks. Perhaps most notably, many popular tools are based exclusively on gene-specific statistics that cannot detect many types of multivariate expression change. Results: We have developed a nonparametric multivariate method for identifying gene categories whose multivariate expression distribution differs across two or more conditions. We illustrate our approach and compare its performance to several existing procedures via the analysis of a real data set and a unique data-based simulation study designed to capture the challenges and complexities of practical data analysis. We show that our method has good power for differentiating between differentially expressed and non-differentially expressed gene categories, and we utilize a resampling based strategy for controling the false discovery rate when testing multiple categories. Availability: R code (www.r-project.org) for implementing our approach is available from the first author by request. Contact: dnett@iastate.edu Supplementary information: Supplementary data are available at Bioinformatics online.Keywords
This publication has 29 references indexed in Scilit:
- Microarray data analysis: from disarray to consolidation and consensusNature Reviews Genetics, 2006
- Significance analysis of functional categories in gene expression studies: a structured permutation approachBioinformatics, 2005
- Gene expression profile of adult T-cell acute lymphocytic leukemia identifies distinct subsets of patients with different response to therapy and survivalBlood, 2004
- NetAffx Gene Ontology Mining Tool: a visual approach for microarray data analysisBioinformatics, 2004
- GOstat: find statistically overrepresented Gene Ontologies within a group of genesBioinformatics, 2004
- FatiGO: a web tool for finding significant associations of Gene Ontology terms with groups of genesBioinformatics, 2004
- Characterizing gene sets with FuncAssociateBioinformatics, 2003
- MAPPFinder: using Gene Ontology and GenMAPP to create a global gene-expression profile from microarray dataGenome Biology, 2003
- The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003Nucleic Acids Research, 2003
- Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple TestingJournal of the Royal Statistical Society Series B: Statistical Methodology, 1995