A DNA repair system specific for thermophilic Archaea and bacteria predicted by genomic context analysis
Open Access
- 15 January 2002
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 30 (2), 482-496
- https://doi.org/10.1093/nar/30.2.482
Abstract
During a systematic analysis of conserved gene context in prokaryotic genomes, a previously undetected, complex, partially conserved neighborhood consisting of more than 20 genes was discovered in most Archaea (with the exception of Thermoplasma acidophilum and Halobacterium NRC-1) and some bacteria, including the hyperthermophiles Thermotoga maritima and Aquifex aeolicus. The gene composition and gene order in this neighborhood vary greatly between species, but all versions have a stable, conserved core that consists of five genes. One of the core genes encodes a predicted DNA helicase, often fused to a predicted HD-superfamily hydrolase, and another encodes a RecB family exonuclease; three core genes remain uncharacterized, but one of these might encode a nuclease of a new family. Two more genes that belong to this neighborhood and are present in most of the genomes in which the neighborhood was detected encode, respectively, a predicted HD-superfamily hydrolase (possibly a nuclease) of a distinct family and a predicted, novel DNA polymerase. Another characteristic feature of this neighborhood is the expansion of a superfamily of paralogous, uncharacterized proteins, which are encoded by at least 20–30% of the genes in the neighborhood. The functional features of the proteins encoded in this neighborhood suggest that they comprise a previously undetected DNA repair system, which, to our knowledge, is the first repair system largely specific for thermophiles to be identified. This hypothetical repair system might be functionally analogous to the bacterial–eukaryotic system of translesion, mutagenic repair whose central components are DNA polymerases of the UmuC-DinB-Rad30-Rev1 superfamily, which typically are missing in thermophiles.Keywords
This publication has 74 references indexed in Scilit:
- Regulatory potential, phyletic distribution and evolution of ancient, intracellular small-molecule-binding domains11Edited by F. CohenJournal of Molecular Biology, 2001
- T-coffee: a novel method for fast and accurate multiple sequence alignment 1 1Edited by J. ThorntonJournal of Molecular Biology, 2000
- Who's your neighbor? New computational approaches for functional genomicsNature Biotechnology, 2000
- The Universal Ancestor Lived in a Thermophilic or Hyperthermophilic EnvironmentJournal of Theoretical Biology, 2000
- SMART: a web-based tool for the study of genetically mobile domainsNucleic Acids Research, 2000
- Gleaning non-trivial structural, functional and evolutionary information about proteins by iterative database searchesJournal of Molecular Biology, 1999
- Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequenceNature, 1998
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Crystal Structure of a pol α Family Replication DNA Polymerase from Bacteriophage RB69Cell, 1997
- An attempt to unify the structure of polymerasesProtein Engineering, Design and Selection, 1990