Prediction of the functional class of metal-binding proteins from sequence derived physicochemical properties by support vector machine approach
Open Access
- 18 December 2006
- journal article
- research article
- Published by Springer Nature in BMC Bioinformatics
- Vol. 7 (S5), S13
- https://doi.org/10.1186/1471-2105-7-s5-s13
Abstract
Metal-binding proteins play important roles in structural stability, signaling, regulation, transport, immune response, metabolism control, and metal homeostasis. Because of their functional and sequence diversity, it is desirable to explore additional methods for predicting metal-binding proteins irrespective of sequence similarity. This work explores support vector machines (SVM) as such a method. SVM prediction systems were developed by using 53,333 metal-binding and 147,347 non-metal-binding proteins, and evaluated by an independent set of 31,448 metal-binding and 79,051 non-metal-binding proteins. The computed prediction accuracy is 86.3%, 81.6%, 83.5%, 94.0%, 81.2%, 85.4%, 77.6%, 90.4%, 90.9%, 74.9% and 78.1% for calcium-binding, cobalt-binding, copper-binding, iron-binding, magnesium-binding, manganese-binding, nickel-binding, potassium-binding, sodium-binding, zinc-binding, and all metal-binding proteins respectively. The accuracy for the non-member proteins of each class is 88.2%, 99.9%, 98.1%, 91.4%, 87.9%, 94.5%, 99.2%, 99.9%, 99.9%, 98.0%, and 88.0% respectively. Comparable accuracies were obtained by using a different SVM kernel function. Our method predicts 67% of the 87 metal-binding proteins non-homologous to any protein in the Swissprot database and 85.3% of the 333 proteins of known metal-binding domains as metal-binding. These suggest the usefulness of SVM for facilitating the prediction of metal-binding proteins. Our software can be accessed at the SVMProt server http://jing.cz3.nus.edu.sg/cgi-bin/svmprot.cgi.Keywords
This publication has 71 references indexed in Scilit:
- Structural Basis for Diversity of the EF-hand Calcium-binding ProteinsJournal of Molecular Biology, 2006
- Artificial di-iron proteins: solution characterization of four helix bundles containing two distinct types of inter-helical loopsJBIC Journal of Biological Inorganic Chemistry, 2005
- Metal Binding Sites in Proteins: Identification and Characterization by Paramagnetic NMR RelaxationBiochemistry, 2005
- RASE: recognition of alternatively spliced exons in C.elegansBioinformatics, 2005
- The DxDxDG Motif for Calcium Binding: Multiple Structural Contexts and Implications for EvolutionJournal of Molecular Biology, 2004
- Predicting Metal-binding Site Residues in Low-resolution Structural ModelsJournal of Molecular Biology, 2004
- Proteomic identification of divalent metal cation binding proteins in plant mitochondriaFEBS Letters, 2003
- Recent advances in RNA–protein recognitionCurrent Opinion in Structural Biology, 2001
- Structure-based redesign of the Catalytic/Metal binding site of Cfr 10I restriction endonuclease reveals importance of spatial rather than sequence conservation of active centre residuesJournal of Molecular Biology, 1998
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997