KEGG as a glycome informatics resource
Open Access
- 1 May 2006
- journal article
- review article
- Published by Oxford University Press (OUP) in Glycobiology
- Vol. 16 (5) , 63R-70R
- https://doi.org/10.1093/glycob/cwj010
Abstract
Bioinformatics approaches to carbohydrate research have recently begun using large amounts of protein and carbohydrate data. In this field called glycome informatics, the foremost necessity is a comprehensive resource for genome-scale bioinformatics analysis of glycan data. Although the accumulation of experimental data may be useful as a reference of biological and biochemical information on carbohydrates, this is insufficient for bioinformatics analysis. Thus, we have developed a glycome informatics resource (http://www.genome.jp/kegg/glycan/) in KEGG (Kyoto Encyclopedia of Genes and Genomes), an integrated knowledge base of protein networks, genomic information, and chemical information. This review describes three noteworthy features: (1) GLYCAN, a database of carbohydrate structures; (2) glycan-related pathways; and (3) Composite Structure Map (CSM), a map illustrating all possible variations of carbohydrate structures within organisms. GLYCAN includes two useful tools: an intuitive drawing tool called KegDraw, and an efficient glycan search and alignment tool called KEGG Carbohydrate Matcher (KCaM). KEGG’s glycan biosynthesis and metabolism pathways, integrating carbohydrate structures, proteins, and reactions, are also a pivotal resource. CSM is constructed as a bridge between carbohydrate functions and structures. CSM is able to display, for example, expression data of glycosyltransferases in a compact manner. In all the KEGG resources, various objects including KEGG pathways, chemical compounds, as well as carbohydrate structures are commonly represented as graphs, which are widely studied and utilized in the computer science field.Keywords
This publication has 20 references indexed in Scilit:
- A score matrix to reveal the hidden links in glycansBioinformatics, 2004
- Application of a new probabilistic model for recognizing complex patterns in glycansBioinformatics, 2004
- KCaM (KEGG Carbohydrate Matcher): a software tool for analyzing the structures of carbohydrate sugar chainsNucleic Acids Research, 2004
- Biases and complex patterns in the residues flanking protein N-glycosylation sitesGlycobiology, 2003
- Development of a Chemical Structure Comparison Method for Integrated Analysis of Chemical and Genomic Information in the Metabolic PathwaysJournal of the American Chemical Society, 2003
- GlycoSuiteDB: a curated relational database of glycoprotein glycan structures and their biological sources. 2003 updateNucleic Acids Research, 2003
- LINUCS: LInear Notation for Unique description of Carbohydrate SequencesCarbohydrate Research, 2001
- Glycoprotein Structure Determination by Mass SpectrometryScience, 2001
- Letter to the Glyco-ForumGlycobiology, 1992
- The complex carbohydrate structure databaseTrends in Biochemical Sciences, 1989