Amino acid sequence of human histidine-rich glycoprotein derived from the nucleotide sequence of its cDNA

Abstract
A .lambda.Gt11 library containing cDNA inserts prepared from human liver mRNA has been screened with an affinity-purified antibody to human histidine-rich glycoprotein (HRG) and then with a restriction fragment isolated from the 5'' end of the largest cDNA insert obtained by antibody screening. A number of positive clones were identified and shown to code for HRG by DNA sequence analysis. A total of 2067 nucleotides were determined by sequencing 3 overlapping cDNA clones, which included 121 nucleotides of 5''-noncoding sequence, 54 nucleotides coding for a leader sequence of 18 amino acids, 1521 nucleotides coding for the mature protein of 507 amino acids, a stop codon of TAA, and 352 nucleotides of 3''-noncoding sequence followed by a poly(A) tail of 16 nucleotides. The length of the noncoding sequence of the 3'' end differed in several clones, but each contained a polyadenylylation or processing sequence of AATAAA followed by a poly(A) tail. More than half of the amino acid sequence of HRG consisted of five different types of internal repeats. Within the last 3 internal repeats (type V), there were 12 tandem repetitions of a 5 amino acid segment with a consensus sequence of Gly-His-His-Pro-His. This repeated portion, referred to as a "histidine-rich region", contained 53% histidine and showed a high degree of similarity to a histidine-rich region of high molecular weight kininogen.