The Comparative Toxicogenomics Database facilitates identification and understanding of chemical-gene-disease associations: arsenic as a case study
Open Access
- 9 October 2008
- journal article
- research article
- Published by Springer Nature in BMC Medical Genomics
- Vol. 1 (1) , 48
- https://doi.org/10.1186/1755-8794-1-48
Abstract
Background: The etiology of many chronic diseases involves interactions between environmental factors and genes that modulate physiological processes. Understanding interactions between environmental chemicals and genes/proteins may provide insights into the mechanisms of chemical actions, disease susceptibility, toxicity, and therapeutic drug interactions. The Comparative Toxicogenomics Database (CTD; http://ctd.mdibl.org) provides these insights by curating and integrating data describing relationships between chemicals, genes/proteins, and human diseases. To illustrate the scope and application of CTD, we present an analysis of curated data for the chemical arsenic. Arsenic represents a major global environmental health threat and is associated with many diseases. The mechanisms by which arsenic modulates these diseases are not well understood. Methods: Curated interactions between arsenic compounds and genes were downloaded using export and batch query tools at CTD. The list of genes was analyzed for molecular interactions, Gene Ontology (GO) terms, KEGG pathway annotations, and inferred disease relationships. Results: CTD contains curated data from the published literature describing 2,738 molecular interactions between 21 different arsenic compounds and 1,456 genes and proteins. Analysis of these genes and proteins provide insight into the biological functions and molecular networks that are affected by exposure to arsenic, including stress response, apoptosis, cell cycle, and specific protein signaling pathways. Integrating arsenic-gene data with gene-disease data yields a list of diseases that may be associated with arsenic exposure and genes that may explain this association. Conclusion: CTD data integration and curation strategies yield insight into the actions of environmental chemicals and provide a basis for developing hypotheses about the molecular mechanisms underlying the etiology of environmental diseases. While many reports describe the molecular response to arsenic, CTD integrates these data with additional curated data sets that facilitate construction of chemical-gene-disease networks and provide the groundwork for investigating the molecular basis of arsenic-associated diseases or toxicity. The analysis reported here is extensible to any environmental chemical or therapeutic drug.This publication has 28 references indexed in Scilit:
- Environmental epigenomics in human health and diseaseEnvironmental and Molecular Mutagenesis, 2008
- KEGG for linking genomes to life and the environmentNucleic Acids Research, 2007
- Arsenic in the environment: Biology and ChemistryScience of The Total Environment, 2007
- Environmental pollutants and breast cancerCancer, 2007
- Environmental Biology and Human DiseaseScience, 2007
- Progress in the epidemiological understanding of gene–environment interactions in major diseases: cancerComptes Rendus Biologies, 2007
- The Comparative Toxicogenomics Database: A Cross-Species Resource for Building Chemical-Gene Interaction NetworksToxicological Sciences, 2006
- Modeling the Probability of Arsenic in Groundwater in New England as a Tool for Exposure AssessmentEnvironmental Science & Technology, 2006
- Arsenic: In Search of an Antidote to a Global PoisonEnvironmental Health Perspectives, 2005
- Gene Ontology: tool for the unification of biologyNature Genetics, 2000