Discovering Disease Genes: Multipoint Linkage Analysis via a New Markov Chain Monte Carlo Approach
Open Access
- 1 November 2003
- journal article
- Published by Institute of Mathematical Statistics in Statistical Science
- Vol. 18 (4) , 515-531
- https://doi.org/10.1214/ss/1081443233
Abstract
Multipoint linkage analyses of data collected on related individuals are often performed as a first step in the discovery of disease genes. Through the dependence in inheritance of genes segregating at several linked loci, multipoint linkage analysis detects and localizes chromosomal regions (called trait loci) which contain disease genes. Our ability to correctly detect and position these trait loci is increased with the analysis of data observed on large pedigrees and multiple genetic markers. However, large pedigrees generally contain substantial missing data and exact calculation of the required multipoint likelihoods quickly becomes intractable. In this paper, we present a new Markov chain Monte Carlo approach to multipoint linkage analysis which greatly extends the range of models and data sets for which analysis is practical. Several advances in Markov chain Monte Carlo theory, namely joint updates of latent variables across loci or meioses, integrated proposals, Metropolis--Hastings restarts via sequential imputation and Rao--Blackwellized estimators, are incorporated into a sampling strategy which mixes well and produces accurate results in real time. The methodology is demonstrated through its application to several data sets originating from a study of early-onset Alzheimer's disease in families of Volga-German ethnic origin.Keywords
This publication has 34 references indexed in Scilit:
- Performance of Markov Chain–Monte Carlo Approaches for Mapping Genes in Oligogenic Models with an Unknown Number of LociAmerican Journal of Human Genetics, 2000
- MCMC Estimation of Multi‐locus Genome Sharing and Multipoint Gene Location ScoresInternational Statistical Review, 2000
- Blocking Gibbs Sampling for Linkage Analysis in Large Pedigrees with Many LoopsAmerican Journal of Human Genetics, 1999
- Multipoint Oligogenic Analysis of Age-at-Onset Data with Applications to Alzheimer Disease PedigreesAmerican Journal of Human Genetics, 1999
- Estimation of conditional multilocus gene identity among relativesPublished by Institute of Mathematical Statistics ,1999
- Annealing Markov Chain Monte Carlo with Applications to Ancestral InferenceJournal of the American Statistical Association, 1995
- Monte Carlo Likelihood in Genetic MappingStatistical Science, 1994
- Sequential Imputations and Bayesian Missing Data ProblemsJournal of the American Statistical Association, 1994
- Sampling-Based Approaches to Calculating Marginal DensitiesJournal of the American Statistical Association, 1990
- Pedigree analysis of Hodgkin's disease in a Newfoundland genealogyAnnals of Human Genetics, 1981