Using RNA secondary structures to guide sequence motif finding towards single-stranded regions
Open Access
- 20 August 2006
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 34 (17), e117
- https://doi.org/10.1093/nar/gkl544
Abstract
RNA binding proteins recognize RNA targets in a sequence specific manner. Apart from the sequence, the secondary structure context of the binding site also affects the binding affinity. Binding sites are often located in single-stranded RNA regions and it was shown that the sequestration of a binding motif in a double-strand abolishes protein binding. Thus, it is desirable to include knowledge about RNA secondary structures when searching for the binding motif of a protein. We present the approach MEMERIS for searching sequence motifs in a set of RNA sequences and simultaneously integrating information about secondary structures. To abstract from specific structural elements, we precompute position-specific values measuring the single-strandedness of all substrings of an RNA sequence. These values are used as prior knowledge about the motif starts to guide the motif search. Extensive tests with artificial and biological data demonstrate that MEMERIS is able to identify motifs in single-stranded regions even if a stronger motif located in double-strand parts exists. The discovered motif occurrences in biological datasets mostly coincide with known protein-binding sites. This algorithm can be used for finding the binding motif of single-stranded RNA-binding proteins in SELEX or other biological sequence data.Keywords
This publication has 33 references indexed in Scilit:
- RNA sequence and secondary structure participate in high-affinity CsrA–RNA interactionRNA, 2005
- MARNA: multiple alignment and consensus structure prediction of RNAs based on sequence structure comparisonsBioinformatics, 2005
- mRNA Openers and Closers: Modulating AU‐Rich Element‐Controlled mRNA Stability by a Molecular Switch in mRNA Secondary StructureChemBioChem, 2004
- Structural Basis of Single-Stranded RNA RecognitionAccounts of Chemical Research, 2004
- RNA–protein interactionsCurrent Opinion in Structural Biology, 2002
- Specific HIV-1 TAR RNA Loop Sequence and Functional Groups Are Required for Human Cyclin T1−Tat−TAR Ternary Complex FormationBiochemistry, 2002
- RNA Destabilization by the Granulocyte Colony-Stimulating Factor Stem-Loop Destabilizing Element Involves a Single Stem-Loop That Promotes DeadenylationMolecular and Cellular Biology, 2002
- Optimized RNA Targets of Two Closely Related Triple KH Domain Proteins, Heterogeneous Nuclear Ribonucleoprotein K and αCP-2KL, Suggest Distinct Modes of RNA RecognitionJournal of Biological Chemistry, 2001
- Nucleolin is a Sequence-specific RNA-binding Protein: Characterization of Targets on Pre-ribosomal RNAJournal of Molecular Biology, 1996
- Detecting Subtle Sequence Signals: a Gibbs Sampling Strategy for Multiple AlignmentScience, 1993