Data mining: Efficiency of using sequence databases for polymorphism discovery
- 23 January 2001
- journal article
- research article
- Published by Hindawi Limited in Human Mutation
- Vol. 17 (2) , 141-150
- https://doi.org/10.1002/1098-1004(200102)17:2<141::aid-humu6>3.0.co;2-1
Abstract
An open question in research on Single Nucleotide Polymorphisms (SNPs) is, what is the percentage of true SNPs found by in silico pre-screening? To this end, we selected 13 genes, and determined the complete collection of “true” polymorphisms, or polymorphisms experimentally detected, existing in these genes in our laboratory using Denaturing High Performance Liquid Chromatography (DHPLC) and fluorescent sequencing, or in other laboratories using comparable methods. The genes studied by our group were PTGS2, IGFBP1, IGFBP3, and CYP19. GenBank sequence information was then aligned using two methods, and sequence differences termed “candidate” polymorphisms. We then compared the series of SNPs obtained experimentally and in silico and we have found that in silico methods are relatively specific (up to 55% of candidate SNPs found by SNPFinder have been discovered by experimental procedure) but have low sensitivity (not more than 27% of true SNPs are found by in silico methods). Hum Mutat 17:141–150, 2001.Keywords
This publication has 18 references indexed in Scilit:
- A candidate gene for psoriasis near HLA-C, HCR (Pg8), is highly polymorphic with a disease-associated susceptibility allele.Human Molecular Genetics, 2000
- Identification of Candidate Coding Region Single Nucleotide Polymorphisms in 165 Human Genes Using Assembled Expressed Sequence TagsGenome Research, 1999
- Characterization of single-nucleotide polymorphisms in coding regions of human genesNature Genetics, 1999
- Patterns of single-nucleotide polymorphisms in candidate genes for blood-pressure homeostasisNature Genetics, 1999
- Reliable identification of large numbers of candidate SNPs from public EST dataNature Genetics, 1999
- Genetic polymorphism of human O6-alkylguanine-DNA alkyltransferasePharmacogenetics, 1999
- The P-selectin gene is highly polymorphic: reduced frequency of the Pro715 allele carriers in patients with myocardial infarctionHuman Molecular Genetics, 1998
- Single nucleotide polymorphism hunting in cyberspaceHuman Mutation, 1998
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Polymorphisms of the Transforming Growth Factor-β1 Gene in Relation to Myocardial Infarction and Blood PressureHypertension, 1996