Mapping and characterization of the mouse and human SS18 genes, two human SS18-like genes and a mouse Ss18 pseudogene

Abstract
We have previously isolated and characterized a mouse cDNA orthologous to the human synovial sarcoma associated SS18 (formerly named SSXT and SYT) cDNA. Here, we report the characterization of the genomic structure of the mouse Ss18 gene. Through in silico methods with sequence information contained in the public databases, we did the same for the human SS18 gene and two human SS18 homologous genes, SS18L1 and SS18L2. In addition, we identified a mouse Ss18 processed pseudogene and mapped it to chromosome 1, band A2-3. The mouse Ss18 gene, which is subject to extensive alternative splicing, is made up of 11 exons, spread out over approximately 45 kb of genomic sequence. The human SS18 gene is also composed of 11 exons with similar intron-exon boundaries, spreading out over about 70 kb of genomic sequence. One alternatively spliced exon, which is not included in the published SS18 cDNA, corresponds to a stretch of sequence which we previously identified in the mouse Ss18 cDNA. The human SS18L1 gene, which is also made up of 11 exons with similar intron-exon boundaries, was mapped to chromosome 20 band q13.3. The smaller SS18L2 gene, which is composed of three exons with similar boundaries as the first three exons of the other three genes, was mapped to chromosome 3 band p21. Through sequence and mutation analyses this gene could be excluded as a candidate gene for 3p21-associated renal cell cancer. In addition, we created a detailed BAC map around the human SS18 gene, placing it unequivocally between the CA-repeat marker AFMc014wf9 and the dihydrofolate reductase pseudogene DHFRP1. The next gene in this map, located distal to SS18, was found to be the TBP associated factor TAFII-105 (TAF2C2). Further analogies between the mouse Ss18 gene, the human SS18 gene and its two homologous genes were found in the putative promoter fragments. All four promoters resemble the promoters of housekeeping genes in that they are TATA-less and embedded in canonical CpG islands, thus explaining the high and widespread expression of the SS18 genes.