Flexible algorithm for direct multiple alignment of protein structures and sequences
- 1 December 1994
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 10 (6), 587-596
- https://doi.org/10.1093/bioinformatics/10.6.587
Abstract
The recently described equivalence between the alignment of two proteins and a conformation of a lattice chain on a two-dimensional square lattice is extended to multiple alignments. The search for the optimal multiple alignment between several proteins, which is equivalent to finding the energy minimum in the conformational space of a multi dimensional lattice chain, is studied by the Monte Carlo approach. This method, while not deterministic, andfor two- dimensional problems slower than dynamic programming, can accept arbitrary scoring functions, including non-local ones, and its speed decreases slowly with increasing number of dimensions. For the local scoring functions, the MC algorithm can also reproduce known exact solutions for the direct multiple alignments. As illustrated by examples, both for structure- and sequence-based alignments, direct multi dimensional alignments are able to capture weak similarities between divergent families much better than ones built from pairwise alignments by a hierarchical approach.Keywords
This publication has 6 references indexed in Scilit:
- A data bank merging related protein structures and sequencesProtein Engineering, Design and Selection, 1992
- β-Trefoil foldJournal of Molecular Biology, 1992
- From comparisons of protein sequences and structures to protein modelling and designTrends in Biochemical Sciences, 1990
- Definition of general topological equivalence in protein structuresJournal of Molecular Biology, 1990
- Protein structure alignmentJournal of Molecular Biology, 1989
- A multiple sequence alignment algorithm for homologous proteins using secondary structure information and optionally keying alignments to functionally important sitesBioinformatics, 1989