The Predicted Impact of Coding Single Nucleotide Polymorphisms Database
Open Access
- 1 November 2005
- journal article
- Published by American Association for Cancer Research (AACR) in Cancer Epidemiology, Biomarkers & Prevention
- Vol. 14 (11) , 2598-2604
- https://doi.org/10.1158/1055-9965.epi-05-0469
Abstract
Nonsynonymous single nucleotide polymorphisms (nsSNP) have the potential to affect the structure or function of expressed proteins and are, therefore, likely to represent modifiers of inherited susceptibility. We have classified and catalogued the predicted functionality of nsSNPs in genes relevant to the biology of cancer to facilitate sequence-based association studies. Candidate genes were identified using targeted search terms and pathways to interrogate the Gene Ontology Consortium database, Kyoto Encyclopedia of Genes and Genomes database, Iobion's Interaction Explorer PathwayAssist Program, National Center for Biotechnology Information Entrez Gene database, and CancerGene database. A total of 9,537 validated nsSNPs located within annotated genes were retrieved from National Center for Biotechnology Information dbSNP Build 123. Filtering this list and linking it to 7,080 candidate genes yielded 3,666 validated nsSNPs with minor allele frequencies ≥0.01 in Caucasian populations. The functional effect of nsSNPs in genes with a single mRNA transcript was predicted using three computational tools—Grantham matrix, Polymorphism Phenotyping, and Sorting Intolerant from Tolerant algorithms. The resultant pool of 3,009 fully annotated nsSNPs is accessible from the Predicted Impact of Coding SNPs database at http://www.icr.ac.uk/cancgen/molgen/MolPopGen_PICS_database.htm. Predicted Impact of Coding SNPs is an ongoing project that will continue to curate and release data on the putative functionality of coding SNPs.Keywords
This publication has 37 references indexed in Scilit:
- NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteinsNucleic Acids Research, 2004
- Association studies for finding cancer-susceptibility genetic variantsNature Reviews Cancer, 2004
- The search for low-penetrance cancer susceptibility allelesOncogene, 2004
- Many amino acid substitution variants identified in DNA repair genes during human population screenings are predicted to impact protein functionGenomics, 2004
- The International HapMap ProjectNature, 2003
- The Human Genome Browser at UCSCGenome Research, 2002
- Predicting the functional consequences of non-synonymous single nucleotide polymorphisms: structure-based assessment of amino acid variation11Edited by F. CohenJournal of Molecular Biology, 2001
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Nonrandomness of point mutation as reflected in nucleotide substitutions in pseudogenes and its evolutionary implicationsJournal of Molecular Evolution, 1984
- Amino Acid Difference Formula to Help Explain Protein EvolutionScience, 1974