Bayesian method to predict individual SNP genotypes from gene expression data
- 8 April 2012
- journal article
- research article
- Published by Springer Nature in Nature Genetics
- Vol. 44 (5) , 603-608
- https://doi.org/10.1038/ng.2248
Abstract
Eric Schadt and colleagues report a Bayesian method to predict individual SNP genotypes based on RNA expression data. Using simulations and empirical data sets, they show that it is possible to infer a genotypic barcode specific to an individual, although the identification of an individual as a participant in a study is limited by factors such as the availability of large-scale expression quantitative trait loci (eQTLs) and expression data sets. RNA profiling can be used to capture the expression patterns of many genes that are associated with expression quantitative trait loci (eQTLs). Employing published putative cis eQTLs, we developed a Bayesian approach to predict SNP genotypes that is based only on RNA expression data. We show that predicted genotypes can accurately and uniquely identify individuals in large populations. When inferring genotypes from an expression data set using eQTLs of the same tissue type (but from an independent cohort), we were able to resolve 99% of the identities of individuals in the cohort at Padjusted ≤ 1 × 10−5. When eQTLs derived from one tissue were used to predict genotypes using expression data from a different tissue, the identities of 90% of the study subjects could be resolved at Padjusted ≤ 1 × 10−5. We discuss the implications of deriving genotypic information from RNA data deposited in the public domain.Keywords
This publication has 30 references indexed in Scilit:
- Hundreds of variants clustered in genomic loci and biological pathways affect human heightNature, 2010
- Liver and Adipose Expression Associated SNPs Are Enriched for Association to Type 2 DiabetesPLoS Genetics, 2010
- NCBI GEO: archive for high-throughput functional genomic dataNucleic Acids Research, 2009
- Gene Expression in Fixed Tissues and Outcome in Hepatocellular CarcinomaNew England Journal of Medicine, 2008
- Mapping the Genetic Architecture of Gene Expression in Human LiverPLoS Biology, 2008
- Genetics of gene expression and its effect on diseaseNature, 2008
- Comparison of the Agilent, ROMA/NimbleGen and Illumina platforms for classification of copy number alterations in human breast tumorsBMC Genomics, 2008
- Identification and Validation of a Novel Gene Signature Associated with the Recurrence of Human Hepatocellular CarcinomaClinical Cancer Research, 2007
- Genetics of gene expression surveyed in maize, mouse and manNature, 2003
- Gene-expression profiles predict survival of patients with lung adenocarcinomaNature Medicine, 2002