A comparison of tagging methods and their tagging space
Open Access
- 15 August 2005
- journal article
- research article
- Published by Oxford University Press (OUP) in Human Molecular Genetics
- Vol. 14 (18) , 2757-2767
- https://doi.org/10.1093/hmg/ddi309
Abstract
Single-nucleotide polymorphism (SNP) tagging is widely used as a way of saving genotyping costs in association studies. A number of different tagging methods have been developed to reduce the number of markers to be genotyped while maintaining power for detecting effects on non-assayed SNPs. How the different methods perform in different settings, the degree to which they overlap and share common tags and how they differ are important questions. We investigated these questions by comparing three widely used tagging methods/algorithms—one haplotype r2 -based method, one pair-wise r2 -based method and one method which was based on haplotype diversity but focused on major haplotypes. Tagging efficiency was defined as the number of genotyped markers divided by the number of tagging SNPs. Tagging effectiveness was defined as the proportion of un-genotyped or ‘hidden’ SNPs being detected (having a pair-wise or haplotype r2 with a set of tagging SNPs over a threshold, e.g. haplotype r2 ≥0.80). The ENCODE regions genotyped on the HapMap CEPH individuals were examined in this study. Tagging effectiveness was generally poor for rare SNPs than for common SNPs, for all three tagging methods. Inclusion of rare SNPs into initial HapMap scheme could enhance the performance of tags on rare hidden SNPs at the expense of increased genotyping cost. At a moderate tagging efficiency, more than 90% of hidden SNPs detected by tagging SNPs selected by one method were also detected by tagging SNPs selected by another method, and this figure could be increased to 100% if tagging efficiency was allowed to drop. These results indicate that the tagging space is highly concordant between different tagging methods, despite the fact that they often involve different sets of tagging SNPs.Keywords
This publication has 29 references indexed in Scilit:
- A single-nucleotide polymorphism tagging set for human drug metabolism and transportNature Genetics, 2004
- CLUSTAG: hierarchical clustering and graph methods for selecting tag SNPsBioinformatics, 2004
- Haploview: analysis and visualization of LD and haplotype mapsBioinformatics, 2004
- Optimal Haplotype Block-Free Selection of Tagging SNPs for Genome-Wide Association StudiesGenome Research, 2004
- Haplotype Block Partitioning and Tag SNP Selection Using Genotype Data and Their Applications to Association StudiesGenome Research, 2004
- The International HapMap ProjectNature, 2003
- Entropy-based SNP selection for genetic association studiesHuman Genetics, 2003
- Genome scans and candidate gene approaches in the study of common diseases and variable drug responsesTrends in Genetics, 2003
- Selection and Evaluation of Tagging SNPs in the Neuronal-Sodium-Channel Gene SCN1A: Implications for Linkage-Disequilibrium Gene MappingAmerican Journal of Human Genetics, 2003
- Selection of Genetic Markers for Association Analyses, Using Linkage Disequilibrium and HaplotypesAmerican Journal of Human Genetics, 2003