Multiple protein sequence alignment from tertiary structure comparison: Assignment of global and residue confidence levels
- 1 October 1992
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 14 (2), 309-323
- https://doi.org/10.1002/prot.340140216
Abstract
An algorithm is presented for the accurate and rapid generation of multiple protein sequence alignments from tertiary structure comparisons. A preliminary multiple sequence alignment is performed using sequence information, which then determines an initial superposition of the structures. A structure comparison algorithm is applied to all pairs of proteins in the superimposed set and a similarity tree calculated. Multiple sequence alignments are then generated by following the tree from the branches to the root. At each branchpoint of the tree, a structure‐based sequence alignment and coordinate transformations are output, with the multiple alignment of all structures output at the root The algorithm encoded in STAMP (Structural Alignment of Multiple Proteins) is shown to give alignments in good agreement with published structural accounts within the dehydrogenase fold domains, globins, and serine proteinases. In order to reduce the need for visual verification, two similarity indices are introduced to determine the quality of each generated structural alignment. Sc quantifies the global structural similarity between pairs or groups of proteins, whereas Pij′ provides a normalized measure of the confidence in the alignment of each residue. STAMP alignments have the quality of each alignment characterized by Sc and Pij′ values and thus provide a reproducible resource for studies of residue conservation within structural motifs.Keywords
This publication has 59 references indexed in Scilit:
- Knowledge-based prediction of protein structures and the design of novel moleculesNature, 1987
- Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical featuresBiopolymers, 1983
- Comparison of AMP and NADH binding to glycogen phosphorylase bJournal of Molecular Biology, 1983
- How different amino acid sequences determine similar protein structures: The structure and evolutionary dynamics of the globinsJournal of Molecular Biology, 1980
- Molecular structure of the α-lytic protease from Myxobacter 495 at 2·8 Å resolutionJournal of Molecular Biology, 1979
- The protein data bank: A computer-based archival file for macromolecular structuresJournal of Molecular Biology, 1977
- Structure of myoglobin refined at 2·0 Å resolutionJournal of Molecular Biology, 1977
- Chemical and biological evolution of a nucleotide-binding proteinNature, 1974
- Crystal structure analysis of sea lamprey hemoglobin at 2 Å resolutionJournal of Molecular Biology, 1973
- An improved method of testing for evolutionary homologyJournal of Molecular Biology, 1966