Power and Sample Size for Testing Associations of Haplotypes with Complex Traits
- 31 October 2005
- journal article
- research article
- Published by Wiley in Annals of Human Genetics
- Vol. 70 (1) , 116-130
- https://doi.org/10.1111/j.1529-8817.2005.00215.x
Abstract
Summary: Evaluation of the association of haplotypes with either quantitative traits or disease status is common practice, and under some situations provides greater power than the evaluation of individual marker loci. The focus on haplotype analyses will increase as more single nucleotide polymorphisms (SNPs) are discovered, either because of interest in candidate gene regions, or because of interest in genome‐wide association studies. However, there is little guidance on the determination of the sample size needed to achieve the desired power for a study, particularly when linkage phase of the haplotypes is unknown, and when a subset of tag‐SNP markers is measured. There is a growing wealth of information on the distribution of haplotypes in different populations, and it is not unusual for investigators to measure genetic markers in pilot studies in order to gain knowledge of the distribution of haplotypes in the target population. Starting with this basic information on the distribution of haplotypes, we derive analytic methods to determine sample size or power to test the association of haplotypes with either a quantitative trait or disease status (e.g., a case‐control study design), assuming that all subjects are unrelated. Our derivations cover both phase‐known and phase‐unknown haplotypes, allowing evaluation of the loss of efficiency due to unknown phase. We also extend our methods to when a subset of tag‐SNPs is chosen, allowing investigators to explore the impact of tag‐SNPs on power. Simulations illustrate that the theoretical power predictions are quite accurate over a broad range of conditions. Our theoretical formulae should provide useful guidance when planning haplotype association studies.Keywords
This publication has 22 references indexed in Scilit:
- Evaluating associations of haplotypes with traitsGenetic Epidemiology, 2004
- Comparison of prospective and retrospective methods for haplotype inference in case-control studiesGenetic Epidemiology, 2004
- Effect of Two- and Three-Locus Linkage Disequilibrium on the Power to Detect Marker/Phenotype AssociationsGenetics, 2004
- The utility of single nucleotide polymorphisms in inferences of population historyTrends in Ecology & Evolution, 2003
- ?Reply to Fallin et al.?Genetic Epidemiology, 2002
- Score Tests for Association between Traits and Haplotypes when Linkage Phase Is AmbiguousAmerican Journal of Human Genetics, 2002
- Effectiveness of computational methods in haplotype predictionHuman Genetics, 2001
- Complexity and Power in Case-Control Association StudiesAmerican Journal of Human Genetics, 2001
- Accuracy of Haplotype Frequency Estimation for Biallelic Loci, via the Expectation-Maximization Algorithm for Unphased Diploid Genotype DataAmerican Journal of Human Genetics, 2000
- Power/Sample Size Calculations for Generalized Linear ModelsBiometrics, 1988