Methods for interaction analyses using family‐based case‐control data: conditional logistic regression versus generalized estimating equations
- 12 June 2007
- journal article
- research article
- Published by Wiley in Genetic Epidemiology
- Vol. 31 (8) , 883-893
- https://doi.org/10.1002/gepi.20249
Abstract
A complex web of gene‐gene and gene‐environment interactions likely underlies late‐onset disease development. We compared conditional logistic regression (CLR) and generalized estimating equations (GEE) in modeling such interactions in pedigrees with missing parents. Using the simulation of linkage and association (SIMLA) program, disease genes, an environmental risk factor, gene‐gene interaction, and gene‐environment interaction were generated in family‐based data sets. Four scenarios for the relationship between the marker and disease loci were examined: linkage and association, linkage without association, association without linkage, and absence of both linkage and association. Models for CLR and GEE (with exchangeable and independence correlation matrices) were built, and type I error, power, average odds ratio (OR), standard deviation, and 95% confidence intervals were estimated. CLR and GEE were valid tests of association in the presence of linkage, but type I error was inflated for association without linkage, particularly with GEE. CLR generated estimates of the OR with lower bias but often more variability than the OR estimates observed for GEE. Further, GEE was more powerful than CLR in detecting main and interactive effects. Although GEE with both matrices had similar power, use of the independence matrix resulted in lower type I error and less biased OR estimation as compared to the exchangeable matrix. Our findings support the use of GEE in maximizing power to detect gene‐gene and gene‐environment interactions but caution its use under potential association without linkage (e.g., population stratification) and the interpretation of its OR estimates. Genet. Epidemiol. 2007.Keywords
This publication has 14 references indexed in Scilit:
- SNP-SNP interactions in breast cancer susceptibilityBMC Cancer, 2006
- High-density single-nucleotide polymorphism maps of the human genomeGenomics, 2005
- Functional polymorphisms in cell death pathway genes FAS and FASL contribute to risk of lung cancerJournal of Medical Genetics, 2005
- Extension of the SIMLA Package for Generating Pedigrees with Complex Inheritance Patterns: Environmental Covariates, Gene-Gene and Gene-Environment InteractionStatistical Applications in Genetics and Molecular Biology, 2005
- Candidate‐gene association studies with pedigree data: Controlling for environmental covariatesGenetic Epidemiology, 2003
- Statistical Analysis of Correlated Data Using Generalized Estimating Equations: An OrientationAmerican Journal of Epidemiology, 2003
- Testing Linkage Disequilibrium in SibshipsAmerican Journal of Human Genetics, 2000
- Review of Software to Fit Generalized Estimating Equation Regression ModelsThe American Statistician, 1999
- A Comparison of Cluster-Specific and Population-Averaged Approaches for Analyzing Correlated Binary DataInternational Statistical Review, 1991