Genomic Fossils Calibrate the Long-Term Evolution of Hepadnaviruses
Open Access
- 28 September 2010
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLoS Biology
- Vol. 8 (9), e1000495
- https://doi.org/10.1371/journal.pbio.1000495
Abstract
Because most extant viruses mutate rapidly and lack a true fossil record, their deep evolution and long-term substitution rates remain poorly understood. In addition to retroviruses, which rely on chromosomal integration for their replication, many other viruses replicate in the nucleus of their host's cells and are therefore prone to endogenization, a process that involves integration of viral DNA into the host's germline genome followed by long-term vertical inheritance. Such endogenous viruses are highly valuable as they provide a molecular fossil record of past viral invasions, which may be used to decipher the origins and long-term evolutionary characteristics of modern pathogenic viruses. Hepadnaviruses (Hepadnaviridae) are a family of small, partially double-stranded DNA viruses that include hepatitis B viruses. Here we report the discovery of endogenous hepadnaviruses in the genome of the zebra finch. We used a combination of cross-species analysis of orthologous insertions, molecular dating, and phylogenetic analyses to demonstrate that hepadnaviruses infiltrated repeatedly the germline genome of passerine birds. We provide evidence that some of the avian hepadnavirus integration events are at least 19 My old, which reveals a much deeper ancestry of Hepadnaviridae than could be inferred based on the coalescence times of modern hepadnaviruses. Furthermore, the remarkable sequence similarity between endogenous and extant avian hepadnaviruses (up to 75% identity) suggests that long-term substitution rates for these viruses are on the order of 10−8 substitutions per site per year, which is a 1,000-fold slower than short-term rates estimated based on the sequences of circulating hepadnaviruses. Together, these results imply a drastic shift in our understanding of the time scale of hepadnavirus evolution, and suggest that the rapid evolutionary dynamics characterizing modern avian hepadnaviruses do not reflect their mode of evolution on a deep time scale. Paleovirology is the study of ancient viruses and the way they have shaped the innate immune system of their hosts over millions of years. One way to reconstruct the deep evolution of viruses is to search for viral sequences “fossilized” at different evolutionary time points in the genome of their hosts. Besides retroviruses, few virus families are known to have deposited molecular relics in their host's genomes. Here we report on the discovery of multiple fragments of viruses belonging to the Hepadnaviridae family (which includes the human hepatitis B viruses) fossilized in the genome of the zebra finch. We show that some of these fragments infiltrated the germline genome of passerine birds more than 19 million years ago, which implies that hepadnaviruses are much older than previously thought. Based on this age, we can infer a long-term avian hepadnavirus substitution rate, which is a 1,000-fold slower than all short-term substitution rates calculated based on extant hepadnavirus sequences. These results call for a reevaluation of the long-term evolution of Hepadnaviridae, and indicate that some exogenous hepadnaviruses may still be circulating today in various passerine birds.Keywords
This publication has 75 references indexed in Scilit:
- The genome of a songbirdNature, 2010
- Endogenous non-retroviral RNA virus elements in mammalian genomesNature, 2010
- Evolutionary Capture of Viral and Plasmid DNA by Yeast Nuclear ChromosomesEukaryotic Cell, 2009
- A transitional endogenous lentivirus from the genome of a basal primate and implications for lentivirus evolutionProceedings of the National Academy of Sciences, 2008
- Repeated horizontal transfer of a DNA transposon in mammals and other tetrapodsProceedings of the National Academy of Sciences, 2008
- jModelTest: Phylogenetic Model AveragingMolecular Biology and Evolution, 2008
- Geological Dates and Molecular Rates: Fish DNA Sheds Light on Time DependencyMolecular Biology and Evolution, 2008
- Discovery and analysis of the first endogenous lentivirusProceedings of the National Academy of Sciences, 2007
- ProtTest: selection of best-fit models of protein evolutionBioinformatics, 2005
- Initial sequencing and comparative analysis of the mouse genomeNature, 2002