Structure of the human retinoblastoma gene.

Abstract
Complete inactivation of the human retinoblastoma gene (RB) is believed to be an essential step in tumorigenesis of several different cancers. To provide a framework for understanding inactivation mechanisms, the structure of RB was delineated. The RB transcript is encoded in 27 exons dispersed over about 2000 kilobases (kb) of genomic DNA. The length of individual exons ranges from 31 to 1889 base pairs (bp). The largest intron spans > 60 kb and the smallest one has only 80 bp. Deletion of exons 13-17 is frequently observed in various types of tumors, including retinoblastoma, breast cancer, and osteosarcoma, and the presence of a potential "hot spot" for recombination in the region is predicted. A putative "leucine-zipper" motif is exclusively encoded by exon 20. The detailed RB structure presented here should prove useful in defining potential functional domains of its encoded protein. Transcription of RB is initiated at multiple positions and the sequences surrounding the initiation sites have a high G + C content. A typical upstream TATA box is not present. Localization of the RB promoter region was accomplished by utilizing a heterologous expression system containing a bacterial chloramphenicol acetyltransferase gene. Deletion analysis revealed that a region as small as 70 bp is sufficient for RB promoter activity, similar to other previously characterized G + C-rich gene promoters. Several direct repeats and possible stem-and-loop structures are found in the promoter region. No enhancer element was detected within the 7.3 kb of upstream sequence studied. Several features of the RB promoter are reminiscent of the characteristics associated with many "housekeeping" genes, consistent with its ubiquitous expression pattern.