Genomic structure and evolution of the human pepsinogen A multigene family

Abstract
A human cosmid library was screened with a pepsinogen A (PGA) cDNA probe, yielding 18 clones with (parts of) one, two or three PGA genes. By aligning these cosmids a restriction map of a PGA gene quadruplet was obtained in which the four genes are arranged in a highly ordered fashion in a head-to-tail orientation. Using the length in kilobases of the large polymorphic EcoRI fragment of the PGA genes, this quadruplet can be described as 15.0-12.0-12.0-16.6. An AvaII polymorphism allowed us to identify the two PGA haplotypes of the individual whose DNA had been cloned in the cosmid library to be a gene triplet and a gene quadruplet. By comparing the restriction maps of the central 12.0 genes in these multiplets to those of the flanking 15.0 and 16.6 genes, we postulate that these central genes arose from unequal but homologous crossing over between two 15.0–16.6 gene pairs. This hypothesis provides for the creation of a variety of haplotypes by additional cross overs and mutations. Southern blots of family and population material supports the existance of at least five common PGA haplotypes, including a single-gene haplotype, giving rise to a large number of different EcoRI patterns. The single PGA gene is probably the reciprocal crossing over product. Comparison between the DNA and protein polymorphisms suggests further micro-heterogeneity in the different PGA haplotypes.