Crystal structure of human thymine DNA glycosylase bound to DNA elucidates sequence-specific mismatch recognition

Abstract
Cytosine methylation at CpG dinucleotides produces m5CpG, an epigenetic modification that is important for transcriptional regulation and genomic stability in vertebrate cells. However, m5C deamination yields mutagenic G·T mispairs, which are implicated in genetic disease, cancer, and aging. Human thymine DNA glycosylase (hTDG) removes T from G·T mispairs, producing an abasic (or AP) site, and follow-on base excision repair proteins restore the G·C pair. hTDG is inactive against normal A·T pairs, and is most effective for G·T mispairs and other damage located in a CpG context. The molecular basis of these important catalytic properties has remained unknown. Here, we report a crystal structure of hTDG (catalytic domain, hTDGcat) in complex with abasic DNA, at 2.8 Å resolution. Surprisingly, the enzyme crystallized in a 2:1 complex with DNA, one subunit bound at the abasic site, as anticipated, and the other at an undamaged (nonspecific) site. Isothermal titration calorimetry and electrophoretic mobility-shift experiments indicate that hTDG and hTDGcat can bind abasic DNA with 1:1 or 2:1 stoichiometry. Kinetics experiments show that the 1:1 complex is sufficient for full catalytic (base excision) activity, suggesting that the 2:1 complex, if adopted in vivo, might be important for some other activity of hTDG, perhaps binding interactions with other proteins. Our structure reveals interactions that promote the stringent specificity for guanine versus adenine as the pairing partner of the target base and interactions that likely confer CpG sequence specificity. We find striking differences between hTDG and its prokaryotic ortholog (MUG), despite the relatively high (32%) sequence identity.