Bioinformatic analysis of exon repetition, exon scrambling and trans-splicing in humans
Open Access
- 24 November 2005
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 22 (6), 692-698
- https://doi.org/10.1093/bioinformatics/bti795
Abstract
Motivation: Using bioinformatic approaches we aimed to characterize poorly understood abnormalities in splicing known as exon scrambling, exon repetition and trans-splicing. Results: We developed a software package that allows large-scale comparison of all human expressed sequence tags (EST) sequences to the entire set of human gene sequences. Among 5 992 495 EST sequences, 401 cases of exon repetition and 416 cases of exon scrambling were found. The vast majority of identified ESTs contain fragments rather than full-length repeated or scrambled exons. Their structures suggest that the scrambled or repeated exon fragments may have arisen in the process of cDNA cloning and not from splicing abnormalities. Nevertheless, we found 11 cases of full-length exon repetition showing that this phenomenon is real yet very rare. In searching for examples of trans-splicing, we looked only at reproducible events where at least two independent ESTs represent the same putative trans-splicing event. We found 15 ESTs representing five types of putative trans-splicing. However, all 15 cases were derived from human malignant tissues and could have resulted from genomic rearrangements. Our results provide support for a very rare but physiological occurrence of exon repetition, but suggest that apparent exon scrambling and trans-splicing result, respectively, from in vitro artifact and gene-level abnormalities. Availability: Exon–Intron Database (EID) is available at . Programs are available at . The Laboratory website is available at Contact:afedorov@meduohio.edu Supplementary information: Supplementary file is available atKeywords
This publication has 35 references indexed in Scilit:
- Shotgun sequence assembly and recent segmental duplications within the human genomeNature, 2004
- Genomic organization of the mouse Msh4 gene producing bicistronic, chimeric and antisense mRNAGene, 2004
- Natural Trans-spliced mRNAs Are Generated from the Human Estrogen Receptor-α (hERα) GeneJournal of Biological Chemistry, 2002
- An extensive network of coupling among gene expression machinesNature, 2002
- Origin of alternative splicing by tandem exon duplicationHuman Molecular Genetics, 2001
- Creation of genome-wide protein expression libraries using random activation of gene expressionNature Biotechnology, 2001
- Heterogeneous Sp1 mRNAs in Human HepG2 Cells Include a Product of Homotypic trans-SplicingJournal of Biological Chemistry, 2000
- The Human CYP2C Locus: A Prototype for Intergenic and Exon Repetition Splicing EventsGenomics, 2000
- ThePISSLREGene: Structure, Exon Skipping, and Exclusion as Tumor Suppressor in Breast CancerGenomics, 1999
- dbEST — database for “expressed sequence tags”Nature Genetics, 1993