SNAP: predict effect of non-synonymous polymorphisms on function
Top Cited Papers
Open Access
- 7 May 2007
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 35 (11) , 3823-3835
- https://doi.org/10.1093/nar/gkm238
Abstract
Many genetic variations are single nucleotide polymorphisms (SNPs). Non-synonymous SNPs are ‘neutral’ if the resulting point-mutated protein is not functionally discernible from the wild type and ‘non-neutral’ otherwise. The ability to identify non-neutral substitutions could significantly aid targeting disease causing detrimental mutations, as well as SNPs that increase the fitness of particular phenotypes. Here, we introduced comprehensive data sets to assess the performance of methods that predict SNP effects. Along we introduced SNAP (screening for non- a cceptable p olymorphisms), a neural network-based method for the prediction of the functional effects of non-synonymous SNPs. SNAP needs only sequence information as input, but benefits from functional and structural annotations, if available. In a cross-validation test on over 80 000 mutants, SNAP identified 80% of the non-neutral substitutions at 77% accuracy and 76% of the neutral substitutions at 80% accuracy. This constituted an important improvement over other methods; the improvement rose to over ten percentage points for mutants for which existing methods disagreed. Possibly even more importantly SNAP introduced a well-calibrated measure for the reliability of each prediction. This measure will allow users to focus on the most accurate predictions and/or the most severe effects. Available at http://www.rostlab.org/services/SNAPKeywords
This publication has 54 references indexed in Scilit:
- Distinguishing Protein-Coding from Non-Coding RNAs through Support Vector MachinesPLoS Genetics, 2006
- Sequence‐based prediction of pathological mutationsProteins-Structure Function and Bioinformatics, 2004
- UniProt: the Universal Protein knowledgebaseNucleic Acids Research, 2004
- The Pfam protein families databaseNucleic Acids Research, 2004
- Predicting the functional consequences of non-synonymous single nucleotide polymorphisms: structure-based assessment of amino acid variation11Edited by F. CohenJournal of Molecular Biology, 2001
- The Protein Data BankNucleic Acids Research, 2000
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- Prediction of Protein Secondary Structure at Better than 70% AccuracyJournal of Molecular Biology, 1993
- Crystal structure of sickle-cell deoxyhemoglobin at 5 Å resolutionJournal of Molecular Biology, 1975