Predicting cancer susceptibility from single-nucleotide polymorphism data
- 21 August 2005
- proceedings article
- Published by Association for Computing Machinery (ACM)
Abstract
This paper asks whether susceptibility to early-onset (diagnosis before age 40) of a particularly deadly form of cancer, Multiple Myeloma, can be predicted from single-nucleotide polymorphism (SNP) profiles with an accuracy greater than chance. Specifically, given SNP profiles for 80 Multiple Myeloma patients -- of which we believe 40 to have high susceptibility and 40 to have lower susceptibility -- we train a support vector machine (SVM) to predict age at diagnosis. We chose SVMs for this task because they are well suited to deal with interactions among features and redundant features. The accuracy of the trained SVM estimated by leave-one-out cross-validation is 71%, significantly greater than random guessing. This result is particularly encouraging since only 3000 SNPs were used in profiling, whereas several million SNPs are known.Keywords
This publication has 19 references indexed in Scilit:
- The International HapMap ProjectNature, 2003
- Haplotypic relationship between SNP and microsatellite markers at the NOS2A locus in two populationsGenes & Immunity, 2003
- A 3.9-Centimorgan-Resolution Human Single-Nucleotide Polymorphism Linkage Map and Screening SetAmerican Journal of Human Genetics, 2003
- Finding Genes That Underlie Complex TraitsScience, 2002
- The Structure of Haplotype Blocks in the Human GenomeScience, 2002
- The Use of Molecular Profiling to Predict Survival after Chemotherapy for Diffuse Large-B-Cell LymphomaNew England Journal of Medicine, 2002
- Analyzing Array Data Using Supervised MethodsPharmacogenomics, 2002
- Gene expression profiling predicts clinical outcome of breast cancerNature, 2002
- Multiple myelomaCurrent Opinion in Oncology, 1994
- GENETICS OF HUMAN CANCERAnnual Review of Genetics, 1986