A genomic mutational constraint map using variation in 76,156 human genomes
Top Cited Papers
- 6 December 2023
- journal article
- research article
- Published by Springer Nature in Nature
- Vol. 625 (7993) , 92-100
- https://doi.org/10.1038/s41586-023-06045-0
Abstract
The depletion of disruptive variation caused by purifying natural selection (constraint) has been widely used to investigate protein-coding genes underlying human disorders1,2,3,4, but attempts to assess constraint for non-protein-coding regions have proved more difficult. Here we aggregate, process and release a dataset of 76,156 human genomes from the Genome Aggregation Database (gnomAD)—the largest public open-access human genome allele frequency reference dataset—and use it to build a genomic constraint map for the whole genome (genomic non-coding constraint of haploinsufficient variation (Gnocchi)). We present a refined mutational model that incorporates local sequence context and regional genomic features to detect depletions of variation. As expected, the average constraint for protein-coding sequences is stronger than that for non-coding regions. Within the non-coding genome, constrained regions are enriched for known regulatory elements and variants that are implicated in complex human diseases and traits, facilitating the triangulation of biological annotation, disease association and natural selection to non-coding DNA analysis. More constrained regulatory elements tend to regulate more constrained protein-coding genes, which in turn suggests that non-coding constraint can aid the identification of constrained genes that are as yet unrecognized by current gene constraint metrics. We demonstrate that this genome-wide constraint map improves the identification and interpretation of functional human genetic variation.Keywords
This publication has 89 references indexed in Scilit:
- A copy number variation morbidity map of developmental delayNature Genetics, 2011
- Mapping and analysis of chromatin state dynamics in nine human cell typesNature, 2011
- Copy-Number Variations Involving the IHH Locus Are Associated with Syndactyly and CraniosynostosisAmerican Journal of Human Genetics, 2011
- BEDTools: a flexible suite of utilities for comparing genomic featuresBioinformatics, 2010
- De novo copy number variants identify new genes and loci in isolated sporadic tetralogy of FallotNature Genetics, 2009
- Potential etiologic and functional implications of genome-wide association loci for human diseases and traitsProceedings of the National Academy of Sciences, 2009
- Large recurrent microdeletions associated with schizophreniaNature, 2008
- Recurrent Reciprocal Genomic Rearrangements of 17q12 Are Associated with Renal Disease, Diabetes, and EpilepsyAmerican Journal of Human Genetics, 2007
- Mendelian Inheritance in Man and Its Online Version, OMIMAmerican Journal of Human Genetics, 2007
- Human Gene Mutation Database (HGMD®): 2003 updateHuman Mutation, 2003