Functional annotations improve the predictive score of human disease-related mutations in proteins
Open Access
- 12 May 2009
- journal article
- research article
- Published by Hindawi Limited in Human Mutation
- Vol. 30 (8), 1237-1244
- https://doi.org/10.1002/humu.21047
Abstract
Single nucleotide polymorphisms (SNPs) are the simplest and most frequent form of human DNA variation, also valuable as genetic markers of disease susceptibility. The most investigated SNPs are missense mutations resulting in residue substitutions in the protein. Here we propose SNPs&GO, an accurate method that, starting from a protein sequence, can predict whether a mutation is disease related or not by exploiting the protein functional annotation. The scoring efficiency of SNPs&GO is as high as 82%, with a Matthews correlation coefficient equal to 0.63 over a wide set of annotated nonsynonymous mutations in proteins, including 16,330 disease‐related and 17,432 neutral polymorphisms. SNPs&GO collects in unique framework information derived from protein sequence, evolutionary information, and function as encoded in the Gene Ontology terms, and outperforms other available predictive methods. Hum Mutat 30:1–8, 2009.Keywords
This publication has 53 references indexed in Scilit:
- The Human Variome ProjectScience, 2008
- In silico analysis of missense substitutions using sequence-alignment based methodsHuman Mutation, 2008
- Use of estimated evolutionary strength at the codon level improves the prediction of disease-related protein mutations in humansHuman Mutation, 2007
- The human disease networkProceedings of the National Academy of Sciences, 2007
- CanPredict: a computational tool for predicting cancer-associated missense mutationsNucleic Acids Research, 2007
- SNAP: predict effect of non-synonymous polymorphisms on functionNucleic Acids Research, 2007
- Sequence‐based prediction of pathological mutationsProteins-Structure Function and Bioinformatics, 2004
- The Swiss-Prot variant page and the ModSNP database: A resource for sequence and structure information on human protein variantsHuman Mutation, 2004
- The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003Nucleic Acids Research, 2003
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997