Prediction of protein B‐factor profiles
- 11 January 2005
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 58 (4), 905-912
- https://doi.org/10.1002/prot.20375
Abstract
The polypeptide backbones and side chains of proteins are constantly moving due to thermal motion and the kinetic energy of the atoms. The B‐factors of protein crystal structures reflect the fluctuation of atoms about their average positions and provide important information about protein dynamics. Computational approaches to predict thermal motion are useful for analyzing the dynamic properties of proteins with unknown structures. In this article, we utilize a novel support vector regression (SVR) approach to predict the B‐factor distribution (B‐factor profile) of a protein from its sequence. We explore schemes for encoding sequences and various settings for the parameters used in SVR. Based on a large dataset of high‐resolution proteins, our method predicts the B‐factor distribution with a Pearson correlation coefficient (CC) of 0.53. In addition, our method predicts the B‐factor profile with a CC of at least 0.56 for more than half of the proteins. Our method also performs well for classifying residues (rigid vs. flexible). For almost all predicted B‐factor thresholds, prediction accuracies (percent of correctly predicted residues) are greater than 70%. These results exceed the best results of other sequence‐based prediction methods. Proteins 2005.Keywords
This publication has 42 references indexed in Scilit:
- Regression Approaches for Microarray Data AnalysisJournal of Computational Biology, 2003
- Predicting intrinsic disorder from amino acid sequenceProteins-Structure Function and Bioinformatics, 2003
- Improved amino acid flexibility parametersProtein Science, 2003
- The Protein Data BankNucleic Acids Research, 2000
- Protein secondary structure prediction based on position-specific scoring matrices 1 1Edited by G. Von HeijneJournal of Molecular Biology, 1999
- Vibrational Dynamics of Folded Proteins: Significance of Slow and Fast Motions in Relation to Function and StabilityPhysical Review Letters, 1998
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Prediction of chain flexibility in proteinsThe Science of Nature, 1985
- Dynamic information from protein crystallographyJournal of Molecular Biology, 1979
- On the rigid-body motion of molecules in crystalsActa Crystallographica Section B: Structural Science, Crystal Engineering and Materials, 1968