Genomes in Flux: The Evolution of Archaeal and Proteobacterial Gene Content
- 14 December 2001
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 12 (1), 17-25
- https://doi.org/10.1101/gr.176501
Abstract
In the course of evolution, genomes are shaped by processes like gene loss, gene duplication, horizontal gene transfer, and gene genesis (the de novo origin of genes). Here we reconstruct the gene content of ancestral Archaea and Proteobacteria and quantify the processes connecting them to their present day representatives based on the distribution of genes in completely sequenced genomes. We estimate that the ancestor of the Proteobacteria contained around 2500 genes, and the ancestor of the Archaea around 2050 genes. Although it is necessary to invoke horizontal gene transfer to explain the content of present day genomes, gene loss, gene genesis, and simple vertical inheritance are quantitatively the most dominant processes in shaping the genome. Together they result in a turnover of gene content such that even the lineage leading from the ancestor of the Proteobacteria to the relatively large genome of Escherichia coli has lost at least 950 genes. Gene loss, unlike the other processes, correlates fairly well with time. This clock-like behavior suggests that gene loss is under negative selection, while the processes that add genes are under positive selection.Keywords
This publication has 32 references indexed in Scilit:
- Genome sequence of enterohaemorrhagic Escherichia coli O157:H7Nature, 2001
- Structural and Genomic Correlates of HyperthermostabilityJournal of Biological Chemistry, 2000
- Genome evolutionTrends in Genetics, 2000
- The universal ancestorProceedings of the National Academy of Sciences, 1998
- A Genomic Perspective on Protein FamiliesScience, 1997
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Quartet Puzzling: A Quartet Maximum-Likelihood Method for Reconstructing Tree TopologiesMolecular Biology and Evolution, 1996
- Metabolism and evolution of Haemophilus influenzae deduced from a whole-genome comparison with Escherichia coliCurrent Biology, 1996
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- Identification of common molecular subsequencesJournal of Molecular Biology, 1981