CanPredict: a computational tool for predicting cancer-associated missense mutations

Open Access

8 May 2007

journal article
research article
Published by Oxford University Press (OUP) in Nucleic Acids Research

Vol. 35 (Web Server), W595-W598
https://doi.org/10.1093/nar/gkm405

Abstract

Various cancer genome projects are underway to identify novel mutations that drive tumorigenesis. While these screens will generate large data sets, the majority of identified missense changes are likely to be innocuous passenger mutations or polymorphisms. As a result, it has become increasingly important to develop computational methods for distinguishing functionally relevant mutations from other variations. We previously developed an algorithm, and now present the web application, CanPredict (http://www.canpredict.org/ or http://www.cgl.ucsf.edu/Research/genentech/canpredict/), to allow users to determine if particular changes are likely to be cancer-associated. The impact of each change is measured using two known methods: Sorting Intolerant From Tolerant (SIFT) and the Pfam-based LogR.E-value metric. A third method, the Gene Ontology Similarity Score (GOSS), provides an indication of how closely the gene in which the variant resides resembles other known cancer-causing genes. Scores from these three algorithms are analyzed by a random forest classifier which then predicts whether a change is likely to be cancer-associated. CanPredict fills an important need in cancer biology and will enable a large audience of biologists to determine which mutations are the most relevant for further study.

Keywords

This publication has 17 references indexed in Scilit:

MC1R Germline Variants Confer Risk for BRAF -Mutant Melanoma
Science, 2006
Somatic Mutations of the Protein Kinase Gene Family in Human Lung Cancer
Cancer Research, 2005
Mutations in a signalling pathway
Nature, 2005
A screen of the complete protein kinase gene family identifies diverse patterns of somatic mutations in human breast cancer
Nature Genetics, 2005
LS-SNP: large-scale annotation of coding non-synonymous SNPs based on multiple information sources
Bioinformatics, 2005
Prediction of the phenotypic effects of non-synonymous single nucleotide polymorphisms using structural and evolutionary information
Bioinformatics, 2005
Large-scale analysis of non-synonymous coding region single nucleotide polymorphisms
Bioinformatics, 2004
A comparative study of machine-learning methods to predict the effects of single nucleotide polymorphisms on protein function
Bioinformatics, 2003
Human non-synonymous SNPs: server and survey
Nucleic Acids Research, 2002
Variation is the spice of life
Nature Genetics, 2001

Cited by 154 articles