Unique genes in giant viruses: Regular substitution pattern and anomalously short size
- 25 July 2007
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 17 (9), 1353-1361
- https://doi.org/10.1101/gr.6358607
Abstract
Large DNA viruses, including giant mimivirus with a 1.2-Mb genome, exhibit numerous orphan genes possessing no database homologs or genes with homologs solely in close members of the same viral family. Due to their solitary nature, the functions and evolutionary origins of those genes remain obscure. We examined sequence features and evolutionary rates of viral family-specific genes in three nucleo-cytoplasmic large DNA virus (NCLDV) lineages. First, we showed that the proportion of family-specific genes does not correlate with sequence divergence rate. Second, position-dependent nucleotide statistics were similar between family-specific genes and the remaining genes in the genome. Third, we showed that the synonymous-to-nonsynonymous substitution ratios in those viruses are at levels comparable to those estimated for vertebrate proteomes. Thus, the vast majority of family-specific genes do not exhibit an accelerated evolutionary rate, and are thus likely to specify functional polypeptides. On the other hand, these family-specific proteins exhibit several distinct properties: (1) they are shorter, (2) they include a larger fraction of predicted transmembrane proteins, and (3) they are enriched in low-complexity sequences. These results suggest that family-specific genes do not correspond to recent horizontal gene transfer. We propose that their characteristic features are the consequences of the specific evolutionary forces shaping the viral gene repertoires in the context of their parasitic lifestyles.Keywords
This publication has 61 references indexed in Scilit:
- The Mimivirus Genome Encodes a Mitochondrial Carrier That Transports dATP and dTTPJournal of Virology, 2007
- Mimivirus Giant Particles Incorporate a Large Fraction of Anonymous and Unique Gene ProductsJournal of Virology, 2006
- Sequence and annotation of the 369-kb NY-2A and the 345-kb AR158 viruses that infect Chlorella NC64AVirology, 2006
- Viruses in the seaNature, 2005
- Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolutionNature, 2004
- Genome duplication in the teleost fish Tetraodon nigroviridis reveals the early vertebrate proto-karyotypeNature, 2004
- Genome sequence of the Brown Norway rat yields insights into mammalian evolutionNature, 2004
- Origin and Evolution of Viral Interleukin-10 and Other DNA Virus Genes with Vertebrate HomologuesJournal of Molecular Evolution, 2002
- Improved microbial gene identification with GLIMMERNucleic Acids Research, 1999
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997