dbNSFP: A lightweight database of human nonsynonymous SNPs and their functional predictions
Top Cited Papers
Open Access
- 21 April 2011
- journal article
- databases
- Published by Hindawi Limited in Human Mutation
- Vol. 32 (8) , 894-899
- https://doi.org/10.1002/humu.21517
Abstract
With the advance of sequencing technologies, whole exome sequencing has increasingly been used to identify mutations that cause human diseases, especially rare Mendelian diseases. Among the analysis steps, functional prediction (of being deleterious) plays an important role in filtering or prioritizing nonsynonymous SNP (NS) for further analysis. Unfortunately, different prediction algorithms use different information and each has its own strength and weakness. It has been suggested that investigators should use predictions from multiple algorithms instead of relying on a single one. However, querying predictions from different databases/Web‐servers for different algorithms is both tedious and time consuming, especially when dealing with a huge number of NSs identified by exome sequencing. To facilitate the process, we developed dbNSFP (database for nonsynonymous SNPs' functional predictions). It compiles prediction scores from four new and popular algorithms (SIFT, Polyphen2, LRT, and MutationTaster), along with a conservation score (PhyloP) and other related information, for every potential NS in the human genome (a total of 75,931,005). It is the first integrated database of functional predictions from multiple algorithms for the comprehensive collection of human NSs. dbNSFP is freely available for download at http://sites.google.com/site/jpopgen/dbNSFP. Hum Mutat 32:894–899, 2011.Keywords
Funding Information
- the National Institutes of Health (RC2-HL02419-01, RC2 HL103010-01, 1U01HG005728-01)
This publication has 29 references indexed in Scilit:
- A method and server for predicting damaging missense mutationsNature Methods, 2010
- Single-nucleotide evolutionary constraint scores highlight disease-causing mutationsNature Methods, 2010
- Dealing with missing values in large-scale studies: microarray data imputation and beyondBriefings in Bioinformatics, 2009
- Bioinformatics approaches for genomics and post genomics applications of next-generation sequencingBriefings in Bioinformatics, 2009
- Identification of deleterious mutations within three human genomesGenome Research, 2009
- Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithmNature Protocols, 2009
- Next generation tools for the annotation of human SNPsBriefings in Bioinformatics, 2009
- F-SNP: computationally predicted functional SNPs for disease association studiesNucleic Acids Research, 2007
- Predicting the Effects of Amino Acid Substitutions on Protein FunctionAnnual Review of Genomics and Human Genetics, 2006
- Predicting the insurgence of human genetic diseases associated to single point protein mutations with support vector machines and evolutionary informationBioinformatics, 2006