Sequence variations in the public human genome data reflect a bottlenecked population history
Open Access
- 26 December 2002
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 100 (1) , 376-381
- https://doi.org/10.1073/pnas.222673099
Abstract
Single-nucleotide polymorphisms (SNPs) constitute the great majority of variations in the human genome, and as heritable variable landmarks they are useful markers for disease mapping and resolving population structure. Redundant coverage in overlaps of large-insert genomic clones, sequenced as part of the Human Genome Project, comprises a quarter of the genome, and it is representative in terms of base compositional and functional sequence features. We mined these regions to produce 500,000 high-confidence SNP candidates as a uniform resource for describing nucleotide diversity and its regional variation within the genome. Distributions of marker density observed at different overlap length scales under a model of recombination and population size change show that the history of the population represented by the public genome sequence is one of collapse followed by a recent phase of mild size recovery. The inferred times of collapse and recovery are Upper Paleolithic, in agreement with archaeological evidence of the initial modern human colonization of Europe.Keywords
This publication has 29 references indexed in Scilit:
- Human Diallelic Insertion/Deletion PolymorphismsAmerican Journal of Human Genetics, 2002
- A new hominid from the Upper Miocene of Chad, Central AfricaNature, 2002
- The Structure of Haplotype Blocks in the Human GenomeScience, 2002
- A map of human genome sequence variation containing 1.42 million single nucleotide polymorphismsNature, 2001
- Initial sequencing and analysis of the human genomeNature, 2001
- Regions of Low Single-Nucleotide Polymorphism Incidence in Human and Orangutan Xq: Deserts and Recent CoalescencesGenomics, 2001
- Genome-wide analysis of single-nucleotide polymorphisms in human expressed sequencesNature Genetics, 2000
- SNP frequencies in human genes: an excess of rare alleles and differing modes of selectionTrends in Genetics, 2000
- A Greedy Algorithm for Aligning DNA SequencesJournal of Computational Biology, 2000
- Evolutionary Rate at the Molecular LevelNature, 1968