Proteins of Unknown Function in the Protein Data Bank (PDB): An Inventory of True Uncharacterized Proteins and Computational Tools for Their Analysis
Open Access
- 7 October 2012
- journal article
- research article
- Published by MDPI AG in International Journal of Molecular Sciences
- Vol. 13 (10), 12761-12772
- https://doi.org/10.3390/ijms131012761
Abstract
Proteins of uncharacterized functions form a large part of many of the currently available biological databases and this situation exists even in the Protein Data Bank (PDB). Our analysis of recent PDB data revealed that only 42.53% of PDB entries (1084 coordinate files) that were categorized under “unknown function” are true examples of proteins of unknown function at this point in time. The remainder 1465 entries also annotated as such appear to be able to have their annotations re-assessed, based on the availability of direct functional characterization experiments for the protein itself, or for homologous sequences or structures thus enabling computational function inference.Keywords
This publication has 26 references indexed in Scilit:
- SPRITE and ASSAM: web servers for side chain 3D-motif searching in protein structuresNucleic Acids Research, 2012
- The Pfam protein families databaseNucleic Acids Research, 2011
- InterPro in 2011: new developments in the family and domain prediction databaseNucleic Acids Research, 2011
- PROSITE, a protein domain database for functional characterization and annotationNucleic Acids Research, 2009
- RASMOT-3D PRO: a 3D motif search webserverNucleic Acids Research, 2009
- Structure is three to ten times more conserved than sequence—A study of structural response in protein coresProteins-Structure Function and Bioinformatics, 2009
- The Pfam protein families databaseNucleic Acids Research, 2004
- The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003Nucleic Acids Research, 2003
- The Protein Data BankNucleic Acids Research, 2000
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997