Assessing and managing risk when sharing aggregate genetic variant data
- 16 September 2011
- journal article
- research article
- Published by Springer Nature in Nature Reviews Genetics
- Vol. 12 (10) , 730-736
- https://doi.org/10.1038/nrg3067
Abstract
Meta-analyses that use data from several genome-wide association studies are revealing more disease-associated genetic variants, but how can the privacy of study participants be protected when data are shared? This article considers ways to evaluate the risk of loss of privacy. Access to genetic data across studies is an important aspect of identifying new genetic associations through genome-wide association studies (GWASs). Meta-analysis across multiple GWASs with combined cohort sizes of tens of thousands of individuals often uncovers many more genome-wide associated loci than the original individual studies; this emphasizes the importance of tools and mechanisms for data sharing. However, even sharing summary-level data, such as allele frequencies, inherently carries some degree of privacy risk to study participants. Here we discuss mechanisms and resources for sharing data from GWASs, particularly focusing on approaches for assessing and quantifying the privacy risks to participants that result from the sharing of summary-level data.Keywords
This publication has 43 references indexed in Scilit:
- Duplications of the neuropeptide receptor gene VIPR2 confer significant risk for schizophreniaNature, 2011
- A map of human genome variation from population-scale sequencingNature, 2010
- Biological, clinical and population relevance of 95 loci for blood lipidsNature, 2010
- The gene, environment association studies consortium (GENEVA): maximizing the knowledge obtained from GWAS by collaboration across studies of multiple conditionsGenetic Epidemiology, 2010
- Finding the missing heritability of complex diseasesNature, 2009
- A new statistic and its power to infer membership in a genome-wide association study using genotype frequenciesNature Genetics, 2009
- The Population Reference Sample, POPRES: A Resource for Population, Disease, and Pharmacological Genetics ResearchPublished by Elsevier ,2008
- The NCBI dbGaP database of genotypes and phenotypesNature Genetics, 2007
- Joint analysis is more efficient than replication-based analysis for two-stage genome-wide association studiesNature Genetics, 2006
- The International HapMap ProjectNature, 2003