Efficiency and consistency of haplotype tagging of dense SNP maps in multiple samples
Open Access
- 14 September 2004
- journal article
- research article
- Published by Oxford University Press (OUP) in Human Molecular Genetics
- Vol. 13 (21) , 2557-2565
- https://doi.org/10.1093/hmg/ddh294
Abstract
Haplotype tagging is a means of retaining most of the information in high density marker maps, while reducing genotyping requirements. Estimates of the numbers of tagging SNPs required to cover the human genome have varied widely, ranging from 100 000 to 1 000 000. Tagging has been applied to a number of gene-based datasets but has not been evaluated in contexts reflecting those of genome-wide association studies—large chromosome regions and multiple samples drawn from the same population. We analysed 5000 common markers across a 10 Mb segment of human chromosome 20 in three samples (UK Caucasian, CEPH Caucasian, African American) to evaluate tagging efficiency and consistency. Overall, the results indicate a high degree of efficiency, yielding 3–5-fold savings in Caucasians and 2–3-fold savings in African Americans. These levels varied according to linkage disequilibrium (LD) levels, tagging thresholds and allele frequencies, but in high LD regions they did not vary markedly due to marker density. However, a strong positive relationship between marker density and tagging was observed, relating to the fact that increasing marker density yields greater sequence coverage in high LD, thus requiring more tag SNPs to cover a greater fraction of the genome. Encouragingly, whatever the density employed, a high level of robustness was observed between UK and CEPH samples, as most of the htSNPs selected in one sample were also appropriate as tags in the other.Keywords
This publication has 34 references indexed in Scilit:
- Does haplotype diversity predict power for association mapping of disease susceptibility?Human Genetics, 2004
- Haplotype Block Partitioning and Tag SNP Selection Using Genotype Data and Their Applications to Association StudiesGenome Research, 2004
- The International HapMap ProjectNature, 2003
- Genome scans and candidate gene approaches in the study of common diseases and variable drug responsesTrends in Genetics, 2003
- Selection and Evaluation of Tagging SNPs in the Neuronal-Sodium-Channel Gene SCN1A: Implications for Linkage-Disequilibrium Gene MappingAmerican Journal of Human Genetics, 2003
- Selection of Genetic Markers for Association Analyses, Using Linkage Disequilibrium and HaplotypesAmerican Journal of Human Genetics, 2003
- A high-resolution recombination map of the human genomeNature Genetics, 2002
- Patterns of linkage disequilibrium in the human genomeNature Reviews Genetics, 2002
- The DNA sequence and comparative analysis of human chromosome 20Nature, 2001
- Sequence Variation and Linkage Disequilibrium in the Human T-Cell Receptor β (TCRB) LocusAmerican Journal of Human Genetics, 2001