Comprehensive splice-site analysis using comparative genomics
Open Access
- 12 August 2006
- journal article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 34 (14), 3955-3967
- https://doi.org/10.1093/nar/gkl556
Abstract
We have collected over half a million splice sites from five species-Homo sapiens, Mus musculus, Drosophila melanogaster, Caenorhabditis elegans and Arabidopsis thaliana-and classified them into four subtypes: U2-type GT-AG and GC-AG and U12-type GT-AG and AT-AC. We have also found new examples of rare splice-site categories, such as U12-type introns without canonical borders, and U2-dependent AT-AC introns. The splice-site sequences and several tools to explore them are available on a public website (SpliceRack). For the U12-type introns, we find several features conserved across species, as well as a clustering of these introns on genes. Using the information content of the splice-site motifs, and the phylogenetic distance between them, we identify: (i) a higher degree of conservation in the exonic portion of the U2-type splice sites in more complex organisms; (ii) conservation of exonic nucleotides for U12-type splice sites; (iii) divergent evolution of C.elegans 3' splice sites (3'ss) and (iv) distinct evolutionary histories of 5' and 3'ss. Our study proves that the identification of broad patterns in naturally-occurring splice sites, through the analysis of genomic datasets, provides mechanistic and evolutionary insights into pre-mRNA splicing.Keywords
This publication has 73 references indexed in Scilit:
- Understanding alternative splicing: towards a cellular codeNature Reviews Molecular Cell Biology, 2005
- NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteinsNucleic Acids Research, 2004
- How did alternative splicing evolve?Nature Reviews Genetics, 2004
- Maximum Entropy Modeling of Short Sequence Motifs with Applications to RNA Splicing SignalsJournal of Computational Biology, 2004
- Splicing double: insights from the second spliceosomeNature Reviews Molecular Cell Biology, 2003
- The splicing of U12-type introns can be a rate-limiting step in gene expressionThe EMBO Journal, 2002
- Alternative Splicing of the Adenylyl Cyclase Stimulatory G-protein Gαs Is Regulated by SF2/ASF and Heterogeneous Nuclear Ribonucleoprotein A1 (hnRNPA1) and Involves the Use of an Unusual TG 3′-Splice SiteJournal of Biological Chemistry, 2002
- Initial sequencing and analysis of the human genomeNature, 2001
- Features of spliceosome evolution and function inferred from an analysis of the information at human splice sitesJournal of Molecular Biology, 1992
- Sequence logos: a new way to display consensus sequencesNucleic Acids Research, 1990