Conservation of intron position indicates separation of major and variant H2As is an early event in the evolution of eukaryotes

Abstract
Genomic clones ofDrosophila andTetrahymena histone H2A variants were isolated using the corresponding cDNA clones, (van Daal et al. 1988; White et al. 1988). The site corresponding to the initiation of transcription was defined by primer extension for bothDrosophila andTetrahymena genomic sequences. The sequences of the genomic clones revealed the presence of introns in each of the genes. TheDrosophila gene has three introns: one immediately following the initiation codon, one between amino acids 26 and 27 (gln and phe), and one between amino acids 64 and 65 (glu and val). TheTetrahymena gene has two introns, the positions of which are identical to the first two introns of theDrosophila gene. The chicken H2A.F variant gene has been recently sequenced and it contains four introns (Dalton et al. 1989). The first three of these are in the same positions as the introns in theDrosophila gene. The fourth intron interrupts amino acid 108 (gly). In all cases the sizes and the sequences of the introns are divergent. However, the fact that they are in conserved positions suggests that at least two of the introns were present in the ancestral gene. A phylogenetic tree constructed from the sequences of the variant and major cell cycle-regulated histone H2A proteins from several species indicates that the H2A variant proteins are evolutionarily separate and distinct from the major cell cycle-regulated histone H2A proteins. The ancestral H2A gene must have duplicated and diverged before fungi and ciliates diverged from the rest of the eukaryote lineage. In addition, it appears that the variant histone H2A proteins analyzed here are more conserved than the major histone H2A proteins.