Two‐Stage sampling designs for gene association studies
- 12 November 2004
- journal article
- research article
- Published by Wiley in Genetic Epidemiology
- Vol. 27 (4) , 401-414
- https://doi.org/10.1002/gepi.20047
Abstract
We consider two‐stage case‐control designs for testing associations between single nucleotide polymorphisms (SNPs) and disease, in which a subsample of subjects is used to select a panel of “tagging” SNPs that will be considered in the main study. We propose a pseudolikelihood [Pepe and Flemming, 1991: JASA 86:108–113] that combines the information from both the main study and the substudy to test the association with any polymorphism in the original set. SNP‐tagging [Chapman et al., 2003: Hum Hered 56:18–31] and haplotype‐tagging [Stram et al., 2003a; Hum Hered 55:27–36] approaches are compared. We show that the cost‐efficiency of such a design for estimating the relative risk associated with the causal polymorphism can be considerably better than for a single‐stage design, even if the causal polymorphism is not included in the tag‐SNP set. We also consider the optimal selection of cases and controls in such designs and the relative efficiency for estimating the location of a causal variant in linkage disequilibrium mapping. Nevertheless, as the cost of high‐volume genotyping plummets and haplotype tagging information from the International HapMap project [Gibbs et al., 2003; Nature 426:789–796] rapidly accumulates in public databases, such two‐stage designs may soon become unnecessary.Genet. Epidemiol.Keywords
This publication has 44 references indexed in Scilit:
- SNPs, haplotypes, and model selection in a candidate gene region: The SIMPle analysis for multilocus dataGenetic Epidemiology, 2004
- Haplotype Block Partitioning and Tag SNP Selection Using Genotype Data and Their Applications to Association StudiesGenome Research, 2004
- The International HapMap ProjectNature, 2003
- Principal component analysis for selection of optimal SNP‐sets that capture intragenic genetic variationGenetic Epidemiology, 2003
- Logistic regression of family data from retrospective study designsGenetic Epidemiology, 2003
- Selection and Evaluation of Tagging SNPs in the Neuronal-Sodium-Channel Gene SCN1A: Implications for Linkage-Disequilibrium Gene MappingAmerican Journal of Human Genetics, 2003
- Selection of Genetic Markers for Association Analyses, Using Linkage Disequilibrium and HaplotypesAmerican Journal of Human Genetics, 2003
- Score Tests for Association between Traits and Haplotypes when Linkage Phase Is AmbiguousAmerican Journal of Human Genetics, 2002
- Sequential analysis and case-control candidate gene association studies: Reply to sobell et alAmerican Journal of Medical Genetics, 1994
- Generalized Inverses, Wald's Method, and the Construction of Chi-Squared Tests of FitJournal of the American Statistical Association, 1977