LiveBench‐1: Continuous benchmarking of protein structure prediction servers
- 1 February 2001
- journal article
- research article
- Published by Wiley in Protein Science
- Vol. 10 (2), 352-361
- https://doi.org/10.1110/ps.40501
Abstract
We present a novel, continuous approach aimed at the large-scale assessment of the performance of available fold-recognition servers. Six popular servers were investigated: PDB-Blast, FFAS, T98-lib, GenTHREADER, 3D-PSSM, and INBGU. The assessment was conducted using as prediction targets a large number of selected protein structures released from October 1999 to April 2000. A target was selected if its sequence showed no significant similarity to any of the proteins previously available in the structural database. Overall, the servers were able to produce structurally similar models for one-half of the targets, but significantly accurate sequence-structure alignments were produced for only one-third of the targets. We further classified the targets into two sets: easy and hard. We found that all servers were able to find the correct answer for the vast majority of the easy targets if a structurally similar fold was present in the server's fold libraries. However, among the hard targets--where standard methods such as PSI-BLAST fail--the most sensitive fold-recognition servers were able to produce similar models for only 40% of the cases, half of which had a significantly accurate sequence-structure alignment. Among the hard targets, the presence of updated libraries appeared to be less critical for the ranking. An "ideally combined consensus" prediction, where the results of all servers are considered, would increase the percentage of correct assignments by 50%. Each server had a number of cases with a correct assignment, where the assignments of all the other servers were wrong. This emphasizes the benefits of considering more than one server in difficult prediction tasks. The LiveBench program (http://BioInfo.PL/LiveBench) is being continued, and all interested developers are cordially invited to join.Keywords
This publication has 24 references indexed in Scilit:
- CAFASP‐1: Critical assessment of fully automated structure prediction methodsProteins-Structure Function and Bioinformatics, 1999
- Beyond complete genomes: from sequence to structure and functionCurrent Opinion in Structural Biology, 1998
- Removing near-neighbour redundancy from large protein sequence collections.Bioinformatics, 1998
- Assessing sequence comparison methods with reliable structurally identified distant evolutionary relationshipsProceedings of the National Academy of Sciences, 1998
- Assigning folds to the proteins encoded by the genome of Mycoplasma genitaliumProceedings of the National Academy of Sciences, 1997
- Do aligned sequences share the same fold?Journal of Molecular Biology, 1997
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Multiple sequence threading: an analysis of alignment quality and stabilityJournal of Molecular Biology, 1997
- Perspectives in protein-fold recognitionCurrent Opinion in Structural Biology, 1997
- Tertiary structural constraints on protein evolutionary diversity: templates, key residues and structure predictionProceedings Of The Royal Society B-Biological Sciences, 1990