Widely distributed noncoding purifying selection in the human genome
- 24 July 2007
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 104 (30) , 12410-12415
- https://doi.org/10.1073/pnas.0705140104
Abstract
It is widely assumed that human noncoding sequences comprise a substantial reservoir for functional variants impacting gene regulation and other chromosomal processes. Evolutionarily conserved noncoding sequences (CNSs) in the human genome have attracted considerable attention for their potential to simplify the search for functional elements and phenotypically important human alleles. A major outstanding question is whether functionally significant human noncoding variation is concentrated in CNSs or distributed more broadly across the genome. Here, we combine wholegenome sequence data from four nonhuman species (chimp, dog, mouse, and rat) with recently available comprehensive human polymorphism data to analyze selection at single-nucleotide resolution. We show that a substantial fraction of active purifying selection in human noncoding sequences occurs outside of CNSs and is diffusely distributed across the genome. This finding suggests the existence of a large complement of human noncoding variants that may impact gene expression and phenotypic traits, the majority of which will escape detection with current approaches to genome analysis.Keywords
This publication has 30 references indexed in Scilit:
- Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot projectNature, 2007
- Ubiquitous selective constraints in the Drosophila genome revealed by a genome-wide interspecies comparisonGenome Research, 2006
- A haplotype map of the human genomeNature, 2005
- Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomesGenome Research, 2005
- Conserved non-genic sequences — an unexpected feature of mammalian genomesNature Reviews Genetics, 2005
- A Model of the Statistical Power of Comparative Genome Sequence AnalysisPLoS Biology, 2005
- Megabase deletions of gene deserts result in viable miceNature, 2004
- Comparative genomics at the vertebrate extremesNature Reviews Genetics, 2004
- Initial sequencing and comparative analysis of the mouse genomeNature, 2002
- Adaptive protein evolution at the Adh locus in DrosophilaNature, 1991