Unifying measures of gene function and evolution
- 29 March 2006
- journal article
- research article
- Published by The Royal Society in Proceedings Of The Royal Society B-Biological Sciences
- Vol. 273 (1593), 1507-1515
- https://doi.org/10.1098/rspb.2006.3472
Abstract
Recent genome analyses revealed intriguing correlations between variables characterizing the functioning of a gene, such as expression level (EL), connectivity of genetic and protein–protein interaction networks, and knockout effect, and variables describing gene evolution, such as sequence evolution rate (ER) and propensity for gene loss. Typically, variables within each of these classes are positively correlated, e.g. products of highly expressed genes also have a propensity to be involved in many protein–protein interactions, whereas variables between classes are negatively correlated, e.g. highly expressed genes, on average, evolve slower than weakly expressed genes. Here, we describe principal component (PC) analysis of seven genome-related variables and propose biological interpretations for the first three PCs. The first PC reflects a gene's ‘importance’, or the ‘status’ of a gene in the genomic community, with positive contributions from knockout lethality, EL, number of protein–protein interaction partners and the number of paralogues, and negative contributions from sequence ER and gene loss propensity. The next two PCs define a plane that seems to reflect the functional and evolutionary plasticity of a gene. Specifically, PC2 can be interpreted as a gene's ‘adaptability’ whereby genes with high adaptability readily duplicate, have many genetic interaction partners and tend to be non-essential. PC3 also might reflect the role of a gene in organismal adaptation albeit with a negative rather than a positive contribution of genetic interactions; we provisionally designate this PC ‘reactivity’. The interpretation of PC2 and PC3 as measures of a gene's plasticity is compatible with the observation that genes with high values of these PCs tend to be expressed in a condition- or tissue-specific manner. Functional classes of genes substantially vary in status, adaptability and reactivity, with the highest status characteristic of the translation system and cytoskeletal proteins, highest adaptability seen in cellular processes and signalling genes, and top reactivity characteristic of metabolic enzymes.Keywords
This publication has 37 references indexed in Scilit:
- Converging on a general model of protein evolutionTrends in Biotechnology, 2005
- Significant Impact of Protein Dispensability on the Instantaneous Rate of Protein EvolutionMolecular Biology and Evolution, 2005
- Systemic determinants of gene evolution and functionMolecular Systems Biology, 2005
- Determinants of Adaptive Evolution at the Molecular Level: the Extended Complexity HypothesisMolecular Biology and Evolution, 2004
- MUSCLE: multiple sequence alignment with high accuracy and high throughputNucleic Acids Research, 2004
- Systematic functional analysis of the Caenorhabditis elegans genome using RNAiNature, 2003
- Rate of evolution and gene dispensabilityNature, 2003
- Modeling a Minimal Ribosome Based on Comparative Sequence AnalysisJournal of Molecular Biology, 2002
- Do essential genes evolve slowly?Current Biology, 1999
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997