R-Coffee: a method for multiple alignment of non-coding RNA
Open Access
- 16 March 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 36 (9), e52
- https://doi.org/10.1093/nar/gkn174
Abstract
R-Coffee is a multiple RNA alignment package, derived from T-Coffee, designed to align RNA sequences while exploiting secondary structure information. R-Coffee uses an alignment-scoring scheme that incorporates secondary structure information within the alignment. It works particularly well as an alignment improver and can be combined with any existing sequence alignment method. In this work, we used R-Coffee to compute multiple sequence alignments combining the pairwise output of sequence aligners and structural aligners. We show that R-Coffee can improve the accuracy of all the sequence aligners. We also show that the consistency-based component of T-Coffee can improve the accuracy of several structural aligners. R-Coffee was tested on 388 BRAliBase reference datasets and on 11 longer Cmfinder datasets. Altogether our results suggest that the best protocol for aligning short sequences (less than 200 nt) is the combination of R-Coffee with the RNA pairwise structural aligner Consan. We also show that the simultaneous combination of the four best sequence alignment programs with R-Coffee produces alignments almost as accurate as those obtained with R-Coffee/Consan. Finally, we show that R-Coffee can also be used to align longer datasets beyond the usual scope of structural aligners. R-Coffee is freely available for download, along with documentation, from the T-Coffee web site (www.tcoffee.org).Keywords
This publication has 47 references indexed in Scilit:
- Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot projectNature, 2007
- RNA Sampler: a new sampling based algorithm for common RNA secondary structure prediction and structural alignmentBioinformatics, 2007
- Murlet: a practical multiple alignment tool for structural RNA sequencesBioinformatics, 2007
- Inferring Noncoding RNA Families and Classes by Means of Genome-Scale Structure-Based ClusteringPLoS Computational Biology, 2007
- M-Coffee: combining multiple sequence alignment methods with T-CoffeeNucleic Acids Research, 2006
- CMfinder—a covariance model based RNA motif finding algorithmBioinformatics, 2005
- Secondary Structure Prediction for Aligned RNA SequencesJournal of Molecular Biology, 2002
- T-coffee: a novel method for fast and accurate multiple sequence alignment 1 1Edited by J. ThorntonJournal of Molecular Biology, 2000
- Neutral evolution of mutational robustnessProceedings of the National Academy of Sciences, 1999
- The neighbor-joining method: a new method for reconstructing phylogenetic trees.Molecular Biology and Evolution, 1987