Optimal two‐stage genotyping in population‐based association studies
- 8 August 2003
- journal article
- research article
- Published by Wiley in Genetic Epidemiology
- Vol. 25 (2) , 149-157
- https://doi.org/10.1002/gepi.10260
Abstract
We propose a cost-effective two-stage approach to investigate gene-disease associations when testing a large number of candidate markers using a case-control design. Under this approach, all the markers are genotyped and tested at stage 1 using a subset of affected cases and unaffected controls, and the most promising markers are genotyped on the remaining individuals and tested using all the individuals at stage 2. The sample size at stage 1 is chosen such that the power to detect the true markers of association is 1−β1 at significance level α1. The most promising markers are tested at significance level α2 at stage 2. In contrast, a one-stage approach would evaluate and test all the markers on all the cases and controls to identify the markers significantly associated with the disease. The goal is to determine the two-stage parameters (α1, β1, α2) that minimize the cost of the study such that the desired overall significance is α and the desired power is close to 1−β, the power of the one-stage approach. We provide analytic formulae to estimate the two-stage parameters. The properties of the two-stage approach are evaluated under various parametric configurations and compared with those of the corresponding one-stage approach. The optimal two-stage procedure does not depend on the signal of the markers associated with the study. Further, when there is a large number of markers, the optimal procedure is not substantially influenced by the total number of markers associated with the disease. The results show that, compared to a one-stage approach, a two-stage procedure typically halves the cost of the study. Genet Epidemiol 25:149–157, 2003.Keywords
This publication has 16 references indexed in Scilit:
- Two‐Stage Designs for Gene–Disease Association StudiesBiometrics, 2004
- Population stratification and spurious allelic associationThe Lancet, 2003
- Estimation of Haplotype Frequencies, Linkage-Disequilibrium Measures, and Combination of Haplotype Copies in Each Pool by Use of Pooled DNA DataAmerican Journal of Human Genetics, 2003
- On the use of DNA pooling to estimate haplotype frequenciesGenetic Epidemiology, 2002
- Candidate-gene approaches for studying complex genetic traits: practical considerationsNature Reviews Genetics, 2002
- Fine‐scale mapping using Hardy–Weinberg disequilibriumAnnals of Human Genetics, 2001
- Association study designs for complex diseasesNature Reviews Genetics, 2001
- Detecting Marker-Disease Association by Testing for Hardy-Weinberg Disequilibrium at a Marker LocusAmerican Journal of Human Genetics, 1998
- A novel MHC class I–like gene is mutated in patients with hereditary haemochromatosisNature Genetics, 1996
- Annals of Human Genetics