Phylogenetic Analysis of Ribonuclease H Domains Suggests a Late, Chimeric Origin of LTR Retrotransposable Elements and Retroviruses

Abstract
We have conducted a phylogenetic analysis of the Ribonuclease HI (RNH) domains present in Eubacteria, Eukarya, all long-term repeat (LTR)-bearing retrotransposons, and several late-branching clades of non-LTR retrotransposons. Analysis of this simple yet highly conserved enzymatic domain from these disparate sources provides surprising insights into the evolution of eukaryotic retrotransposons. First, it indicates that the lineage of elements leading to vertebrate retroviruses acquired a new RNH domain either from non-LTR retrotransposons or from a eukaryotic host genome. The preexisting retroviral RNH domain degenerated to become the tether (connection) domain of the reverse transcriptase (RT)–RNH complex. Second, it indicates that all LTR retrotransposons arose in eukaryotes well after the origin of the non-LTR retrotransposons. Because of the younger age of the LTR retrotransposons, their complex structure, and the absence of any prokaryotic precursors, we propose that the LTR retrotransposons originated as a fusion between a DNA-mediated transposon and a non-LTR retrotransposon. The resulting two-step mechanism of LTR retrotransposition, in which RNA is reverse transcribed away from the chromosomal target site, rather than directly onto the target site, was probably an adaptation to the uncoupling of transcription and translation in eukaryotic cells.