Whole population, genome-wide mapping of hidden relatedness
Top Cited Papers
Open Access
- 29 October 2008
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 19 (2) , 318-326
- https://doi.org/10.1101/gr.081398.108
Abstract
We present GERMLINE, a robust algorithm for identifying segmental sharing indicative of recent common ancestry between pairs of individuals. Unlike methods with comparable objectives, GERMLINE scales linearly with the number of samples, enabling analysis of whole-genome data in large cohorts. Our approach is based on a dictionary of haplotypes that is used to efficiently discover short exact matches between individuals. We then expand these matches using dynamic programming to identify long, nearly identical segmental sharing that is indicative of relatedness. We use GERMLINE to comprehensively survey hidden relatedness both in the HapMap as well as in a densely typed island population of 3000 individuals. We verify that GERMLINE is in concordance with other methods when they can process the data, and also facilitates analysis of larger scale studies. We bolster these results by demonstrating novel applications of precise analysis of hidden relatedness for (1) identification and resolution of phasing errors and (2) exposing polymorphic deletions that are otherwise challenging to detect. This finding is supported by concordance of detected deletions with other evidence from independent databases and statistical analyses of fluorescence intensity not used by GERMLINE.Keywords
This publication has 37 references indexed in Scilit:
- The Fine-Scale and Complex Architecture of Human Copy-Number VariationAmerican Journal of Human Genetics, 2008
- The IBD process along four chromosomesTheoretical Population Biology, 2007
- Rapid and Accurate Haplotype Phasing and Missing-Data Inference for Whole-Genome Association Studies By Use of Localized Haplotype ClusteringAmerican Journal of Human Genetics, 2007
- A second generation human haplotype map of over 3.1 million SNPsNature, 2007
- Prediction of multi-locus inbreeding coefficients and relation to linkage disequilibrium in random mating populationsTheoretical Population Biology, 2007
- PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage AnalysesAmerican Journal of Human Genetics, 2007
- Global variation in copy number in the human genomeNature, 2006
- A haplotype map of the human genomeNature, 2005
- BLAT—The BLAST-Like Alignment ToolGenome Research, 2002
- Basic Local Alignment Search ToolJournal of Molecular Biology, 1990