PROMALS3D: a tool for multiple protein sequence and structure alignments
Top Cited Papers
Open Access
- 20 February 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 36 (7), 2295-2300
- https://doi.org/10.1093/nar/gkn072
Abstract
Although multiple sequence alignments (MSAs) are essential for a wide range of applications from structure modeling to prediction of functional sites, construction of accurate MSAs for distantly related proteins remains a largely unsolved problem. The rapidly increasing database of spatial structures is a valuable source to improve alignment quality. We explore the use of 3D structural information to guide sequence alignments constructed by our MSA program PROMALS. The resulting tool, PROMALS3D, automatically identifies homologs with known 3D structures for the input sequences, derives structural constraints through structure-based alignments and combines them with sequence constraints to construct consistency-based multiple sequence alignments. The output is a consensus alignment that brings together sequence and structural information about input proteins and their homologs. PROMALS3D can also align sequences of multiple input structures, with the output representing a multiple structure-based alignment refined in combination with sequence constraints. The advantage of PROMALS3D is that it gives researchers an easy way to produce high-quality alignments consistent with both sequences and structures of proteins. PROMALS3D outperforms a number of existing methods for constructing multiple sequence or structural alignments using both reference-dependent and reference-independent evaluation methods.Keywords
This publication has 32 references indexed in Scilit:
- Recent Evolutions of Multiple Sequence Alignment AlgorithmsPLoS Computational Biology, 2007
- MUMMALS: multiple sequence alignment improved by using hidden Markov models with local structural informationNucleic Acids Research, 2006
- Expresso: automatic incorporation of structural information in multiple sequence alignments using 3D-CoffeeNucleic Acids Research, 2006
- MAFFT version 5: improvement in accuracy of multiple sequence alignmentNucleic Acids Research, 2005
- FAST: A novel protein structure alignment algorithmProteins-Structure Function and Bioinformatics, 2004
- 3DCoffee: Combining Protein Sequences and Structures within Multiple Sequence AlignmentsJournal of Molecular Biology, 2004
- MUSCLE: multiple sequence alignment with high accuracy and high throughputNucleic Acids Research, 2004
- T-coffee: a novel method for fast and accurate multiple sequence alignment 1 1Edited by J. ThorntonJournal of Molecular Biology, 2000
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994