TranscriptomeBrowser: A Powerful and Flexible Toolbox to Explore Productively the Transcriptional Landscape of the Gene Expression Omnibus Database
Open Access
- 23 December 2008
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLOS ONE
- Vol. 3 (12) , e4001
- https://doi.org/10.1371/journal.pone.0004001
Abstract
As public microarray repositories are constantly growing, we are facing the challenge of designing strategies to provide productive access to the available data. We used a modified version of the Markov clustering algorithm to systematically extract clusters of co-regulated genes from hundreds of microarray datasets stored in the Gene Expression Omnibus database (n = 1,484). This approach led to the definition of 18,250 transcriptional signatures (TS) that were tested for functional enrichment using the DAVID knowledgebase. Over-representation of functional terms was found in a large proportion of these TS (84%). We developed a JAVA application, TBrowser that comes with an open plug-in architecture and whose interface implements a highly sophisticated search engine supporting several Boolean operators (http://tagc.univ-mrs.fr/tbrowser/). User can search and analyze TS containing a list of identifiers (gene symbols or AffyIDs) or associated with a set of functional terms. As proof of principle, TBrowser was used to define breast cancer cell specific genes and to detect chromosomal abnormalities in tumors. Finally, taking advantage of our large collection of transcriptional signatures, we constructed a comprehensive map that summarizes gene-gene co-regulations observed through all the experiments performed on HGU133A Affymetrix platform. We provide evidences that this map can extend our knowledge of cellular signaling pathways.Keywords
This publication has 25 references indexed in Scilit:
- DAVID Knowledgebase: a gene-centered database integrating heterogeneous gene annotation resources to facilitate high-throughput gene functional analysisBMC Bioinformatics, 2007
- [19] Gene Expression Omnibus: Microarray Data Storage, Submission, Retrieval, and AnalysisPublished by Elsevier ,2006
- Global landscape of protein complexes in the yeast Saccharomyces cerevisiaeNature, 2006
- How does gene expression clustering work?Nature Biotechnology, 2005
- GeneMCL in microarray analysisComputational Biology and Chemistry, 2005
- ArrayExpress--a public repository for microarray gene expression data at the EBINucleic Acids Research, 2004
- SOURCE: a unified genomic resource of functional annotations, ontologies, and gene expression dataNucleic Acids Research, 2003
- Microarray databases: standards and ontologiesNature Genetics, 2002
- An efficient algorithm for large-scale detection of protein familiesNucleic Acids Research, 2002
- Exploring Expression Data: Identification and Analysis of Coexpressed GenesGenome Research, 1999