Sequence, structure, receptor-binding domains and internal repeats of human apolipoprotein B-100

Abstract
Apolipoprotein (apo) B-100, the major protein component in low density lipoprotein (LDL), is the ligand that binds to the LDL receptor1. It is important in the metabolism of LDL and elevated plasma levels of LDL-apo B are strongly associated with increased risk of coronary artery disease. Although apo B-100 is of great clinical and biological importance its primary structure has defied chemical elucidation, mainly because of its enormous size, insolubility, and tendency to aggregate2. Less than 5% of the apo B-100 sequence has been reported, despite the efforts of many laboratories over the past twenty years. Here we report the complete amino acid sequence of human apo B-100 as deducted by sequence analysis of complementary DNA clones; 2,366 of the 4,536 residues were also confirmed by direct sequencing of apo B-100 tryptic peptides. The distribution of trypsin-accessible and -inaccessible peptides of the protein on LDL is non-random and they can be grouped into 5 hypothetical domains. Of 20 potential N-glycosylation sites identified in the sequence, 13 were foundry direct peptide sequencing to be glycosylated, and 4 unglycosylated. Examination of the primary structure of apo B-100 reveals that it contains a large number of long (>70 residues) internal repeats and an even larger number of shorter ones, suggesting that the apo B-100 sequence was derived largely from internal duplications. Finally, using synthetic peptides of a specific region of apo B-100, we have identified a potential LDL receptor-binding domain (residues 3,345–3,381) which can bind to the LDL receptor and suppress 3-hydroxy-3-methyl-glutaryl coenzyme A (HMG-CoA) reductase activities in cultured human fibroblasts.