Identification of single nucleotide polymorphisms in human DNA repair genes

Abstract
Variation in gene coding sequence represents a significant factor in predisposition to disease, including cancer. Variants of some DNA repair genes (e.g. MLH1, MSH2 and MSH6) are known to predispose to cancer. We identified single nucleotide polymorphisms (SNPs) in five DNA repair genes in 142 healthy individuals using a DNA sequencing protocol optimized for the direct detection of single nucleotide polymorphisms. This approach, called the heterozygote sequencing protocol (HSP), enables moderate-scale population surveys of SNPs. HSP uses fluorescently tagged primers and exploits the large dynamic range and low background of automated fluorescent sequencing. HSP may be used for any sequence that can be amplified by PCR. A total of 12 SNP variants in MGMT, ERCC1, CDK7, CCNH and XRCC4 were identified, 11 at polymorphic frequencies, with an average frequency of 0.22 (95% confidence interval 0.20–0.24). Among the 82 individuals for whom complete SNP profiles were available, no one person carried the GenBank reference sequence for all five genes. The extensive heterogeneity observed in these five genes is intriguing. All variants are in Hardy–Weinberg equilibrium, although the meaning of this equilibrium is unclear. Using this approach, possible associations of sequence variation, and hence of variation in DNA repair, with disease risk can be assessed.