CanPredict: a computational tool for predicting cancer-associated missense mutations
Open Access
- 8 May 2007
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 35 (Web Server) , W595-W598
- https://doi.org/10.1093/nar/gkm405
Abstract
Various cancer genome projects are underway to identify novel mutations that drive tumorigenesis. While these screens will generate large data sets, the majority of identified missense changes are likely to be innocuous passenger mutations or polymorphisms. As a result, it has become increasingly important to develop computational methods for distinguishing functionally relevant mutations from other variations. We previously developed an algorithm, and now present the web application, CanPredict (http://www.canpredict.org/ or http://www.cgl.ucsf.edu/Research/genentech/canpredict/), to allow users to determine if particular changes are likely to be cancer-associated. The impact of each change is measured using two known methods: Sorting Intolerant From Tolerant (SIFT) and the Pfam-based LogR.E-value metric. A third method, the Gene Ontology Similarity Score (GOSS), provides an indication of how closely the gene in which the variant resides resembles other known cancer-causing genes. Scores from these three algorithms are analyzed by a random forest classifier which then predicts whether a change is likely to be cancer-associated. CanPredict fills an important need in cancer biology and will enable a large audience of biologists to determine which mutations are the most relevant for further study.Keywords
This publication has 17 references indexed in Scilit:
- MC1R Germline Variants Confer Risk for BRAF -Mutant MelanomaScience, 2006
- Somatic Mutations of the Protein Kinase Gene Family in Human Lung CancerCancer Research, 2005
- Mutations in a signalling pathwayNature, 2005
- A screen of the complete protein kinase gene family identifies diverse patterns of somatic mutations in human breast cancerNature Genetics, 2005
- LS-SNP: large-scale annotation of coding non-synonymous SNPs based on multiple information sourcesBioinformatics, 2005
- Prediction of the phenotypic effects of non-synonymous single nucleotide polymorphisms using structural and evolutionary informationBioinformatics, 2005
- Large-scale analysis of non-synonymous coding region single nucleotide polymorphismsBioinformatics, 2004
- A comparative study of machine-learning methods to predict the effects of single nucleotide polymorphisms on protein functionBioinformatics, 2003
- Human non-synonymous SNPs: server and surveyNucleic Acids Research, 2002
- Variation is the spice of lifeNature Genetics, 2001