Comparative study of gene set enrichment methods
Open Access
- 2 September 2009
- journal article
- research article
- Published by Springer Nature in BMC Bioinformatics
- Vol. 10 (1), 275
- https://doi.org/10.1186/1471-2105-10-275
Abstract
The analysis of high-throughput gene expression data with respect to sets of genes rather than individual genes has many advantages. A variety of methods have been developed for assessing the enrichment of sets of genes with respect to differential expression. In this paper we provide a comparative study of four of these methods: Fisher's exact test, Gene Set Enrichment Analysis (GSEA), Random-Sets (RS), and Gene List Analysis with Prediction Accuracy (GLAPA). The first three methods use associative statistics, while the fourth uses predictive statistics. We first compare all four methods on simulated data sets to verify that Fisher's exact test is markedly worse than the other three approaches. We then validate the other three methods on seven real data sets with known genetic perturbations and then compare the methods on two cancer data sets where our a priori knowledge is limited.Keywords
This publication has 27 references indexed in Scilit:
- Modeling Cancer Progression via Pathway DependenciesPLoS Computational Biology, 2008
- Module-Based Outcome Prediction Using Breast Cancer CompendiaPLOS ONE, 2007
- Random-set methods identify distinct aspects of the enrichment signal in gene-set analysisThe Annals of Applied Statistics, 2007
- Statistical assessment of functional categories of genes deregulated in pathological conditions by using microarray dataBioinformatics, 2007
- Oncogenic pathway signatures in human cancers as a guide to targeted therapiesNature, 2005
- Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profilesProceedings of the National Academy of Sciences, 2005
- Global detection of molecular changes reveals concurrent alteration of several biological pathways in nonsmall cell lung cancer cellsMolecular Genetics and Genomics, 2005
- Ontological analysis of gene expression data: current tools, limitations, and open problemsBioinformatics, 2005
- Cancer genes and the pathways they controlNature Medicine, 2004
- The Hallmarks of CancerCell, 2000