An estimate of unique DNA sequence heterozygosity in the human genome

Abstract
Fifteen different restriction fragment length polymorphisms (RFLPs) were detected in the human genome using 19 cloned DNA segments, derived from flow-sorted metaphase chromosomes or total genomic DNA, as hybridization probes. Since these clones were selected at random with respect to their coding potential, their analysis permitted an unbiassed estimate of single-copy DNA sequence heterozygosity in the human genome. Since our estimate (h=0.0037) is an order of magnitude higher than previous estimates derived from protein data, most of the polymorphic variation present in the genome must occur in non-coding sequences. In addition, it was confirmed that enzymes containing the dinucleotide CpG in their recognition sequence detect more polymorphic variation than those that do not contain CpG.