A score matrix to reveal the hidden links in glycans
- 7 December 2004
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 21 (8) , 1457-1463
- https://doi.org/10.1093/bioinformatics/bti193
Abstract
Glycans are the third major class of biomolecules following DNA and proteins. They are extremely vital for the functioning of multicellular organisms. However, comparing the fast development of sequence analysis techniques, informatics work on glycans have a long way to go. Alignment algorithms for glycan tree structures are one of the foremost concerns. In addition, the statistical analysis of these algorithms in terms of biological significance needs to be addressed. We developed a tree-structure alignment algorithm for glycans and performed a statistical analysis of these alignment scores such that biologically interesting features could be captured into a score matrix for glycans. We generated our score matrix in a manner similar to BLOSUM, but with slight variations to accomodate our glycan data, including the incorporation of linkage information. We verified the effectiveness of our new glycan score matrix by illustrating how well the resulting score matrix entries correspond with biological knowledge. Future work for even better improvements with the use of a variety of score matrices for different subclasses of glycans due to their complexity is also discussed. mami@kuicr.kyoto-u.ac.jp The glycan score matrix can be downloaded from http://kanehisa.kuicr.kyoto-u.ac.jp/Paper/kcam/glycanMatrix0.1.txt.Keywords
This publication has 30 references indexed in Scilit:
- Amino acid substitution matrices from an information theoretic perspectivePublished by Elsevier ,2005
- The Unified Medical Language System (UMLS): integrating biomedical terminologyNucleic Acids Research, 2004
- Antibody Domain Exchange Is an Immunological Solution to Carbohydrate Cluster RecognitionScience, 2003
- GlycoSuiteDB: a curated relational database of glycoprotein glycan structures and their biological sources. 2003 updateNucleic Acids Research, 2003
- Negative regulation of T-cell activation and autoimmunity by Mgat5 N-glycosylationNature, 2001
- Identification and characterization of large galactosyltransferase gene families: galactosyltransferases for all functionsBiochimica et Biophysica Acta (BBA) - General Subjects, 1999
- [27] Local alignment statisticsPublished by Elsevier ,1996
- The complex carbohydrate structure databaseTrends in Biochemical Sciences, 1989
- [47] Establishing homologies in protein sequencesPublished by Elsevier ,1983
- The enzymic synthesis of ganglioside: IV. UDP-N-acetylgalactosamine:(N-acetylneuraminyl)-galactosylglucosyl ceramide N-acetylgalactosaminyl-transferase in rat brainBiochimica et Biophysica Acta (BBA) - Lipids and Lipid Metabolism, 1971