BIOFILTER: A KNOWLEDGE-INTEGRATION SYSTEM FOR THE MULTI-LOCUS ANALYSIS OF GENOME-WIDE ASSOCIATION STUDIES
Open Access
- 1 November 2008
- proceedings article
- Published by World Scientific Pub Co Pte Ltd in Pacific Symposium on Biocomputing
Abstract
Genome-wide association studies provide an unprecedented opportunity to identify combinations of genetic variants that contribute to disease susceptibility. The combinatorial problem of jointly analyzing the millions of genetic variations accessible by high-throughput genotyping technologies is a difficult challenge. One approach to reducing the search space of this variable selection problem is to assess specific combinations of genetic variations based on prior statistical and biological knowledge. In this work, we provide a systematic approach to integrate multiple public databases of gene groupings and sets of disease-related genes to produce multi-SNP models that have an established biological foundation. This approach yields a collection of models which can be tested statistically in genome-wide data, along with an ordinal quantity describing the number of data sources that support any given model. Using this knowledge-driven approach reduces the computational and statistical burden of large-scale interaction analysis while simultaneously providing a biological foundation for the relevance of any significant statistical result that is found.Keywords
This publication has 14 references indexed in Scilit:
- KEGG for linking genomes to life and the environmentNucleic Acids Research, 2007
- GATHERING THE GOLD DUST: METHODS FOR ASSESSING THE AGGREGATE IMPACT OF SMALL EFFECT GENES IN GENOMIC SCANSPacific Symposium on Biocomputing, 2007
- Prioritized Subset Analysis: Improving Power in Genome-wide Association StudiesHuman Heredity, 2007
- Reactome: a knowledge base of biologic pathways and processesGenome Biology, 2007
- The Genetic Association DatabaseNature Genetics, 2004
- Mapping complex disease loci in whole-genome association studiesNature, 2004
- Routine discovery of complex genetic models using genetic algorithmsApplied Soft Computing, 2003
- A Perspective on Epistasis: Limits of Models Displaying No Main EffectAmerican Journal of Human Genetics, 2002
- DIP, the Database of Interacting Proteins: a research tool for studying cellular networks of protein interactionsNucleic Acids Research, 2002
- Who's afraid of epistasis?Nature Genetics, 1996