Identifying Personal Genomes by Surname Inference
Top Cited Papers
- 18 January 2013
- journal article
- research article
- Published by American Association for the Advancement of Science (AAAS) in Science
- Vol. 339 (6117) , 321-324
- https://doi.org/10.1126/science.1229566
Abstract
Anonymity Compromised: The balance between maintaining individual privacy and sharing genomic information for research purposes has been a topic of considerable controversy. Gymrek et al. (p. 321 ; see the Policy Forum by Rodriguez et al. ) demonstrate that the anonymity of participants (and their families) can be compromised by analyzing Y-chromosome sequences from public genetic genealogy Web sites that contain (sometimes distant) relatives with the same surname. Short tandem repeats (STRs) on the Y chromosome of a target individual (whose sequence was freely available and identified in GenBank) were compared with information in public genealogy Web sites to determine the shortest time to the most recent common ancestor and find the most likely surname, which, when combined with age and state of residency identified the individual. When STRs from 911 individuals were used as the starting points, the analysis projected a success rate of 12% within the U.S. male population with Caucasian ancestry. Further analysis of detailed pedigrees from one collection revealed that families of individuals whose genomes are in public repositories could be identified with high probability.Keywords
This publication has 22 references indexed in Scilit:
- lobSTR: A short tandem repeat profiler for personal genomesGenome Research, 2012
- On Sharing Quantitative Trait GWAS Results in an Era of Multiple-omics Data and the Limits of Genomic PrivacyAmerican Journal of Human Genetics, 2012
- Assessing and managing risk when sharing aggregate genetic variant dataNature Reviews Genetics, 2011
- A map of human genome variation from population-scale sequencingNature, 2010
- A new statistic and its power to infer membership in a genome-wide association study using genotype frequenciesNature Genetics, 2009
- Founders, Drift, and Infidelity: The Relationship between Y Chromosome Diversity and Patrilineal SurnamesMolecular Biology and Evolution, 2009
- Inferential Genotyping of Y Chromosomes in Latter-Day Saints Founders and Comparison to Utah Samples in the HapMap ProjectAmerican Journal of Human Genetics, 2009
- Resolving Individuals Contributing Trace Amounts of DNA to Highly Complex Mixtures Using High-Density SNP Genotyping MicroarraysPLoS Genetics, 2008
- The Diploid Genome Sequence of an Individual HumanPLoS Biology, 2007
- Variation of 52 new Y-STR loci in the Y Chromosome Consortium worldwide panel of 76 diverse individualsInternational journal of legal medicine, 2006