Significance analysis of functional categories in gene expression studies: a structured permutation approach
Top Cited Papers
- 12 January 2005
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 21 (9), 1943-1949
- https://doi.org/10.1093/bioinformatics/bti260
Abstract
In high-throughput genomic and proteomic experiments, investigators monitor expression across a set of experimental conditions. To gain an understanding of broader biological phenomena, researchers have until recently been limited to post hoc analyses of significant gene lists. We describe a general framework, significance analysis of function and expression (SAFE), for conducting valid tests of gene categories ab initio. SAFE is a two-stage, permutation-based method that can be applied to various experimental designs, accounts for the unknown correlation among genes and enables permutation-based estimation of error rates. The utility and flexibility of SAFE is illustrated with a microarray dataset of human lung carcinomas and gene categories based on Gene Ontology and the Protein Family database. Significant gene categories were observed in comparisons of (1) tumor versus normal tissue, (2) multiple tumor subtypes and (3) survival times. Code to implement SAFE in the statistical package R is available from the authors. http://www.bios.unc.edu/~fwright/SAFE.Keywords
This publication has 34 references indexed in Scilit:
- A probabilistic view of gene functionNature Genetics, 2004
- GOstat: find statistically overrepresented Gene Ontologies within a group of genesBioinformatics, 2004
- Nup88 mRNA overexpression is associated with high aggressiveness of breast cancerInternational Journal of Cancer, 2004
- FatiGO: a web tool for finding significant associations of Gene Ontology terms with groups of genesBioinformatics, 2004
- Characterizing gene sets with FuncAssociateBioinformatics, 2003
- Global functional profiling of gene expression☆☆This work was funded in part by a Sun Microsystems grant awarded to S.D., NIH Grant HD36512 to S.A.K., a Wayne State University SOM Dean’s Post-Doctoral Fellowship, and an NICHD Contraception and Infertility Loan to G.C.O. Support from the WSU MCBI mode is gratefully appreciated.Genomics, 2003
- The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003Nucleic Acids Research, 2003
- Gene-expression profiles predict survival of patients with lung adenocarcinomaNature Medicine, 2002
- Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variationNucleic Acids Research, 2002
- Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclassesProceedings of the National Academy of Sciences, 2001