Prediction of functional sites by analysis of sequence and structure conservation
- 1 April 2004
- journal article
- Published by Wiley in Protein Science
- Vol. 13 (4), 884-892
- https://doi.org/10.1110/ps.03465504
Abstract
We present a method for prediction of functional sites in a set of aligned protein sequences. The method selects sites which are both well conserved and clustered together in space, as inferred from the 3D structures of proteins included in the alignment. We tested the method using 86 alignments from the NCBI CDD database, where the sites of experimentally determined ligand and/or macromolecular interactions are annotated. In agreement with earlier investigations, we found that functional site predictions are most successful when overall background sequence conservation is low, such that sites under evolutionary constraint become apparent. In addition, we found that averaging of conservation values across spatially clustered sites improves predictions under certain conditions: that is, when overall conservation is relatively high and when the site in question involves a large macromolecular binding interface. Under these conditions it is better to look for clusters of conserved sites than to look for particular conserved sites.Keywords
This publication has 34 references indexed in Scilit:
- Automatic Methods for Predicting Functionally Important ResiduesJournal of Molecular Biology, 2003
- Structural Characterisation and Functional Significance of Transient Protein–Protein InteractionsJournal of Molecular Biology, 2003
- Protein–DNA Interactions: Amino Acid Conservation and the Effects of Mutations on Binding SpecificityJournal of Molecular Biology, 2002
- Prediction of functionally important residues based solely on the computed energetics of protein structure 1 1Edited by B. HonigJournal of Molecular Biology, 2001
- Three-dimensional cluster analysis identifies interfaces and functional residue clusters in proteins11Edited by J. ThorntonJournal of Molecular Biology, 2001
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Prediction of protein-protein interaction sites using patch analysis 1 1Edited by G. von HeijneJournal of Molecular Biology, 1997
- An Evolutionary Trace Method Defines Binding Surfaces Common to Protein FamiliesJournal of Molecular Biology, 1996
- The rapid generation of mutation data matrices from protein sequencesBioinformatics, 1992
- Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical featuresBiopolymers, 1983