Distinct Viral Lineages from Fish and Amphibians Reveal the Complex Evolutionary History of Hepadnaviruses

Abstract
Hepadnaviruses (HBVs) are the only animal viruses that replicate their DNA by reverse transcription of an RNA intermediate. Until recently, the host range of hepadnaviruses was limited to mammals and birds. We obtained and analyzed the first amphibian HBV genome, as well as several prototype fish HBVs that allow the first comprehensive comparative genomic analysis of hepadnaviruses from four classes of vertebrates. Bluegill hepadnavirus (BGHBV) was characterized from in-house viral metagenomic sequencing. The African cichlid hepadnavirus (ACHBV) and the Tibetan frog hepadnavirus (TFHBV) were discovered using in silico analyses of the whole-genome shotgun and transcriptome Shotgun assembly databases. Residues in the hydrophobic base of the capsid (core) proteins, designated motif I, II and III, are highly conserved, suggesting that structural constraints for proper capsid folding are key to capsid protein evolution. Surface proteins in all vertebrate HBVs contain similar predicted membrane topology, characterized by three transmembrane domains. Most striking was that the BGHBV, ACHBV, and the previously described white sucker hepadnavirus did not form a fish-specific monophyletic group in the phylogenetic analysis of all three hepadnaviral genes. Notably, BGHBV was more closely related to the mammalian hepadnaviruses, indicating that cross-species transmission events have played a major role in viral evolution. Evidence of cross-species transmission was also observed with TFHBV. Hence, these data indicate that the evolutionary history of the hepadnaviruses is more complex than previously realized and combines both virus-host co-divergence over millions of years and host species jumping.