Human muscle carbonic anhydrase: gene structure and DNA methylation patterns in fetal and adult tissues.

Abstract
We report the isolation and analysis of genomic clones comprising the entire gene coding for the human muscle carbonic anhydrase, CAIII. The gene spans 10.3 kb and has a seven-exon/six-intron structure. A noncanonical TATA box, a CCAAT motif, and two CCGCCC elements are present in the sequences upstream of exon 1. Although the expression of CAIII shows strict tissue specificity, the gene exhibits a number of features normally associated with housekeeping enzymes. For example, there is 48% homology with a 25-bp consensus sequence between the TATA box and the cap site and there is a CpG-rich island spanning a 469-bp sequence near to the origin of transcription. Methylation studies suggest that some CCGG sites clustered in the CpG-rich island are undermethylated in DNA from fetal and adult muscle and in other tissues irrespective of CAIII expression. In contrast, several nonclustered CCGG sites show a methylation pattern that correlates with gene expression. However DNA from differentiated type II adult muscle fibers is undermethylated at these sites even though CAIII is not expressed.