ToppGene Suite for gene list enrichment analysis and candidate gene prioritization
Top Cited Papers
Open Access
- 22 May 2009
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 37 (Web Server) , W305-W311
- https://doi.org/10.1093/nar/gkp427
Abstract
ToppGene Suite (http://toppgene.cchmc.org; this web site is free and open to all users and does not require a login to access) is a one-stop portal for (i) gene list functional enrichment, (ii) candidate gene prioritization using either functional annotations or network analysis and (iii) identification and prioritization of novel disease candidate genes in the interactome. Functional annotation-based disease candidate gene prioritization uses a fuzzy-based similarity measure to compute the similarity between any two genes based on semantic annotations. The similarity scores from individual features are combined into an overall score using statistical meta-analysis. A P-value of each annotation of a test gene is derived by random sampling of the whole genome. The protein–protein interaction network (PPIN)-based disease candidate gene prioritization uses social and Web networks analysis algorithms (extended versions of the PageRank and HITS algorithms, and the K-Step Markov method). We demonstrate the utility of ToppGene Suite using 20 recently reported GWAS-based gene–disease associations (including novel disease genes) representing five diseases. ToppGene ranked 19 of 20 (95%) candidate genes within the top 20%, while ToppNet ranked 12 of 16 (75%) candidate genes among the top 20%.Keywords
This publication has 41 references indexed in Scilit:
- Prioritization of Positional Candidate Genes Using Multiple Web-Based Software ToolsTwin Research and Human Genetics, 2007
- Improved human disease candidate gene prioritization using mouse phenotypeBMC Bioinformatics, 2007
- Genes2Networks: connecting lists of gene symbols using mammalian protein interactions databasesBMC Bioinformatics, 2007
- Protein interactions and disease: computational approaches to uncover the etiology of diseasesBriefings in Bioinformatics, 2007
- Candidate Gene Identification Approach: Progress and ChallengesInternational Journal of Biological Sciences, 2007
- Computational disease gene identification: a concert of methods prioritizes type 2 diabetes and obesity candidate genesNucleic Acids Research, 2006
- Gene prioritization through genomic data fusionNature Biotechnology, 2006
- SUSPECTS: enabling fast and effective prioritization of positional candidatesBioinformatics, 2006
- A Human Protein-Protein Interaction Network: A Resource for Annotating the ProteomePublished by Elsevier ,2005
- Disruption of Abcc6 in the mouse: novel insight in the pathogenesis of pseudoxanthoma elasticumHuman Molecular Genetics, 2005