Sequence features in regions of weak and strong linkage disequilibrium
Open Access
- 26 October 2005
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 15 (11) , 1519-1534
- https://doi.org/10.1101/gr.4421405
Abstract
We use genotype data generated by the International HapMap Project to dissect the relationship between sequence features and the degree of linkage disequilibrium in the genome. We show that variation in linkage disequilibrium is broadly similar across populations and examine sequence landscape in regions of strong and weak disequilibrium. Linkage disequilibrium is generally low within ∼15 Mb of the telomeres of each chromosome and noticeably elevated in large, duplicated regions of the genome as well as within ∼5 Mb of centromeres and other heterochromatic regions. At a broad scale (100–1000 kb resolution), our results show that regions of strong linkage disequilibrium are typically GC poor and have reduced polymorphism. In addition, these regions are enriched for LINE repeats, but have fewer SINE, DNA, and simple repeats than the rest of the genome. At a fine scale, we examine the sequence composition of “hotspots” for the rapid breakdown of linkage disequilibrium and show that they are enriched in SINEs, in simple repeats, and in sequences that are conserved between species. Regions of high and low linkage disequilibrium (the top and bottom quartiles of the genome) have a higher density of genes and coding bases than the rest of the genome. Closer examination of the data shows that whereas some types of genes (including genes involved in immune response and sensory perception) are typically located in regions of low linkage disequilibrium, other genes (including those involved in DNA and RNA metabolism, response to DNA damage, and the cell cycle) are preferentially located in regions of strong linkage disequilibrium. Our results provide a detailed analysis of the relationship between sequence features and linkage disequilibrium and suggest an evolutionary justification for the heterogeneity in linkage disequilibrium in the genome.Keywords
This publication has 56 references indexed in Scilit:
- Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomesGenome Research, 2005
- Complement Factor H Polymorphism in Age-Related Macular DegenerationScience, 2005
- How Strong Is the Mutagenicity of Recombination in Mammals?Molecular Biology and Evolution, 2004
- Genome sequence of the Brown Norway rat yields insights into mammalian evolutionNature, 2004
- The UCSC Table Browser data retrieval toolNucleic Acids Research, 2004
- The International HapMap ProjectNature, 2003
- Initial sequencing and comparative analysis of the mouse genomeNature, 2002
- The Human Genome Browser at UCSCGenome Research, 2002
- A map of human genome sequence variation containing 1.42 million single nucleotide polymorphismsNature, 2001
- Linkage disequilibrium due to random genetic driftGenetics Research, 1969