The Power of Single-Nucleotide Polymorphisms for Large-Scale Parentage Inference

Top Cited Papers

1 April 2006

journal article
Published by Oxford University Press (OUP) in Genetics

Vol. 172 (4) , 2567-2582
https://doi.org/10.1534/genetics.105.048074

Abstract

Likelihood-based parentage inference depends on the distribution of a likelihood-ratio statistic, which, in most cases of interest, cannot be exactly determined, but only approximated by Monte Carlo simulation. We provide importance-sampling algorithms for efficiently approximating very small tail probabilities in the distribution of the likelihood-ratio statistic. These importance-sampling methods allow the estimation of small false-positive rates and hence permit likelihood-based inference of parentage in large studies involving a great number of potential parents and many potential offspring. We investigate the performance of these importance-sampling algorithms in the context of parentage inference using single-nucleotide polymorphism (SNP) data and find that they may accelerate the computation of tail probabilities >1 millionfold. We subsequently use the importance-sampling algorithms to calculate the power available with SNPs for large-scale parentage studies, paying particular attention to the effect of genotyping errors and the occurrence of related individuals among the members of the putative mother–father–offspring trios. These simulations show that 60–100 SNPs may allow accurate pedigree reconstruction, even in situations involving thousands of potential mothers, fathers, and offspring. In addition, we compare the power of exclusion-based parentage inference to that of the likelihood-based method. Likelihood-based inference is much more powerful under many conditions; exclusion-based inference would require 40% more SNP loci to achieve the same accuracy as the likelihood-based approach in one common scenario. Our results demonstrate that SNPs are a powerful tool for parentage inference in large managed and/or natural populations.

Keywords

This publication has 42 references indexed in Scilit:

Localization of Cancer Susceptibility Genes by Genome-wide Single-Nucleotide Polymorphism Linkage-Disequilibrium Mapping
Cancer Research, 2004
Estimation of genotype error rate using samples with pedigree information—an application on the GeneChip Mapping 10K array
Genomics, 2004
Impact of candidate sire number and sire relatedness on DNA polymorphism‐based measures of exclusion probability and probability of unambiguous parentage
Animal Genetics, 2004
The utility of single nucleotide polymorphisms in inferences of population history
Trends in Ecology & Evolution, 2003
The Structure of Haplotype Blocks in the Human Genome
Science, 2002
Relationship Inference from Trios of Individuals, in the Presence of Typing Error
American Journal of Human Genetics, 2002
The Discovery of Single-Nucleotide Polymorphisms—and Inferences about Human Demographic History
American Journal of Human Genetics, 2001
Statistical confidence for likelihood‐based paternity inference in natural populations
Molecular Ecology, 1998
Error tolerant parent identification from a finite set of individuals
Genetics Research, 1997
A sublinear algorithm for approximate keyword searching
Algorithmica, 1994