Tree and spline based association analysis of gene–gene interaction models for ischemic stroke
- 6 April 2004
- journal article
- research article
- Published by Wiley in Statistics in Medicine
- Vol. 23 (9) , 1439-1453
- https://doi.org/10.1002/sim.1749
Abstract
In the biology of complex disorders, such as atherothrombosis, interactions among genetic factors may play an important role, and theoretical considerations suggest that gene–gene interactions are quite common in such diseases. We used a nested case-control sample from the Physicians' Health Study, a randomized trial assessing the effects of aspirin and beta-carotene on cardiovascular disease and cancer among 22071 US male physicians, to examine these relationships for ischemic stroke. Data were available on 92 polymorphisms from 56 candidate genes related to inflammation, thrombosis and lipid metabolism, assessed in 319 incident cases of ischemic stroke and 2090 disease-free controls. We used classification and regression trees (CART) and multivariate adaptive regression spline (MARS) models to explore the presence of genetic interactions in these data. These models offer advantages over typical logistic regression methods in that they may uncover interactions among genes that do not exhibit strong marginal effects. Final models were selected using either the Bayes Information Criterion or cross-validation. Model fit was assessed using 10-fold cross-validation of the entire selection process. Both the CART and two-way MARS-logit models identified an interaction between two polymorphisms linked to inflammation, the P-selectin (val640leu) and interleukin-4 (C(582) T) genes. Internal validation of these models, however, suggested that effects of these polymorphisms are additive. Although further external validation of these models is necessary, these methods may be valuable in exploring and identifying potential gene–gene as well as gene–environment interactions in association studies. Copyright © 2004 John Wiley & Sons, Ltd.Keywords
This publication has 28 references indexed in Scilit:
- SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivationNature Genetics, 2008
- Polymorphism in the P-selectin and interleukin-4 genes as determinants of stroke: a population-based, prospective genetic analysisHuman Molecular Genetics, 2003
- Epistasis: what it means, what it doesn't mean, and statistical methods to detect it in humansHuman Molecular Genetics, 2002
- A Perspective on Epistasis: Limits of Models Displaying No Main EffectAmerican Journal of Human Genetics, 2002
- Internal validation of predictive modelsJournal of Clinical Epidemiology, 2001
- A Combinatorial Partitioning Method to Identify Multilocus Genotypic Partitions That Predict Quantitative Trait VariationGenome Research, 2001
- Polychotomous RegressionJournal of the American Statistical Association, 1997
- Polychotomous RegressionJournal of the American Statistical Association, 1997
- Who's afraid of epistasis?Nature Genetics, 1996
- Multivariate Adaptive Regression SplinesThe Annals of Statistics, 1991