The ConSurf-DB: pre-calculated evolutionary conservation profiles of protein structures
Open Access
- 29 October 2008
- journal article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 37 (suppl_1), D323-D327
- https://doi.org/10.1093/nar/gkn822
Abstract
ConSurf-DB is a repository for evolutionary conservation analysis of the proteins of known structures in the Protein Data Bank (PDB). Sequence homologues of each of the PDB entries were collected and aligned using standard methods. The evolutionary conservation of each amino acid position in the alignment was calculated using the Rate4Site algorithm, implemented in the ConSurf web server. The algorithm takes into account the phylogenetic relations between the aligned proteins and the stochastic nature of the evolutionary process explicitly. Rate4Site assigns a conservation level for each position in the multiple sequence alignment using an empirical Bayesian inference. Visual inspection of the conservation patterns on the 3D structure often enables the identification of key residues that comprise the functionally important regions of the protein. The repository is updated with the latest PDB entries on a monthly basis and will be rebuilt annually. ConSurf-DB is available online at http://consurfdb.tau.ac.il/Keywords
This publication has 23 references indexed in Scilit:
- siteFiNDER 3D: a web-based tool for predicting the location of functional sites in proteinsNucleic Acids Research, 2007
- HotPatch: A Statistical A pproach to Finding Biologically Relevant Features on Protein SurfacesJournal of Molecular Biology, 2007
- ET viewer: an application for predicting and visualizing functional sites in protein structuresBioinformatics, 2006
- ConSurf 2005: the projection of evolutionary conservation scores of residues on protein structuresNucleic Acids Research, 2005
- The ConSurf‐HSSP database: The mapping of evolutionary conservation among homologs onto PDB structuresProteins-Structure Function and Bioinformatics, 2004
- Comparison of Site-Specific Rate-Inference Methods for Protein Sequences: Empirical Bayesian Methods Are SuperiorMolecular Biology and Evolution, 2004
- Structural clusters of evolutionary trace residues are statistically significant and common in proteinsJournal of Molecular Biology, 2002
- ConSurf: an algorithmic tool for the identification of functional regions in proteins by surface mapping of phylogenetic informationJournal of Molecular Biology, 2001
- The HSSP database of protein structure-sequence alignments and family profilesNucleic Acids Research, 1998
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997