Ordered Partitioning Reveals Extended Splice-Site Consensus Information
Open Access
- 5 January 2004
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 14 (1), 67-78
- https://doi.org/10.1101/gr.1715204
Abstract
Using recently available cDNA and genomic data (Berkeley Drosophila Genome Project; http://www.fruitfly.org), we computed a large sample of 10,057 Drosophila splice sites. An information-theoretic analysis of the nucleotide sequences adjacent to these splice sites showed a strong correlation between the sizes of introns and exons and the levels of information, which is a measure of sequence conservation. The strong correlation permitted us to determine extensive consensus sequences at the donor and acceptor sites of longer introns. These sequences were further refined and extended by examining the information in regions around splice sites that only partially matched the consensus. The correlation between length and information provided the basis for determining alternative consensus arrangements associated with shorter introns, as well as general base-composition preferences that likely promote spliceosome function. We also observed a correlation between information near splice sites and the lengths of nonadjacent introns, indicating that there are long-range effects spanning multiple introns. The ordered partitioning approach used in this analysis may become increasingly useful as large genomic data sets become available.Keywords
This publication has 40 references indexed in Scilit:
- An Upstream AG Determines Whether a Downstream AG Is Selected during Catalytic Step II of SplicingMolecular and Cellular Biology, 2001
- Localization of Sequences Required for Size-specific Splicing of a SmallDrosophilaIntronin VitroJournal of Molecular Biology, 1995
- Conserved Sequences in a Class of Rare Eukaryotic Nuclear Introns with Non-consensus Splice SitesJournal of Molecular Biology, 1994
- Features of spliceosome evolution and function inferred from an analysis of the information at human splice sitesJournal of Molecular Biology, 1992
- U5 snRNA interacts with exon sequences at 5′ and 3′ splice sitesCell, 1992
- Unexpected point mutations activate cryptic 3' splice sites by perturbing a natural secondary structure within a yeast intron.Genes & Development, 1991
- 5′ cleavage site in eukaryotic pre-mRNA splicing is determined by the overall 5′ splice region, not by the conserved 5′ GUCell, 1987
- Information content of binding sites on nucleotide sequencesJournal of Molecular Biology, 1986
- Mutations in a yeast intron demonstrate the importance of specific conserved nucleotides for the two stages of nuclear mRNA splicingCell, 1986
- A point mutation in the conserved hexanucleotide at a yeast 5′ splice junction uncouples recognition, cleavage, and ligationCell, 1985