Constructing gene co-expression networks and predicting functions of unknown genes by random matrix theory
Open Access
- 14 August 2007
- journal article
- Published by Springer Nature in BMC Bioinformatics
- Vol. 8 (1), 299
- https://doi.org/10.1186/1471-2105-8-299
Abstract
Large-scale sequencing of entire genomes has ushered in a new age in biology. One of the next grand challenges is to dissect the cellular networks consisting of many individual functional modules. Defining co-expression networks without ambiguity based on genome-wide microarray data is difficult and current methods are not robust and consistent with different data sets. This is particularly problematic for little understood organisms since not much existing biological knowledge can be exploited for determining the threshold to differentiate true correlation from random noise. Random matrix theory (RMT), which has been widely and successfully used in physics, is a powerful approach to distinguish system-specific, non-random properties embedded in complex systems from random noise. Here, we have hypothesized that the universal predictions of RMT are also applicable to biological systems and the correlation threshold can be determined by characterizing the correlation matrix of microarray profiles using random matrix theory.Keywords
This publication has 55 references indexed in Scilit:
- Application of random matrix theory to biological networksPhysics Letters A, 2006
- Global landscape of protein complexes in the yeast Saccharomyces cerevisiaeNature, 2006
- Reverse engineering of regulatory networks in human B cellsNature Genetics, 2005
- Reverse-engineering transcription control networksPhysics of Life Reviews, 2005
- Network biology: understanding the cell's functional organizationNature Reviews Genetics, 2004
- Global analysis of protein localization in budding yeastNature, 2003
- Module networks: identifying regulatory modules and their condition-specific regulators from gene expression dataNature Genetics, 2003
- Transitive functional annotation by shortest-path analysis of gene expression dataProceedings of the National Academy of Sciences, 2002
- Network motifs in the transcriptional regulation network of Escherichia coliNature Genetics, 2002
- Functional organization of the yeast proteome by systematic analysis of protein complexesNature, 2002