SAGA: a subgraph matching tool for biological graphs
Open Access
- 16 November 2006
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 23 (2) , 232-239
- https://doi.org/10.1093/bioinformatics/btl571
Abstract
Motivation: With the rapid increase in the availability of biological graph datasets, there is a growing need for effective and efficient graph querying methods. Due to the noisy and incomplete characteristics of these datasets, exact graph matching methods have limited use and approximate graph matching methods are required. Unfortunately, existing graph matching methods are too restrictive as they only allow exact or near exact graph matching. This paper presents a novel approximate graph matching technique called SAGA. This technique employs a flexible model for computing graph similarity, which allows for node gaps, node mismatches and graph structural differences. SAGA employs an indexing technique that allows it to efficiently evaluate queries even against large graph datasets. Results: SAGA has been used to query biological pathways and literature datasets, which has revealed interesting similarities between distinct pathways that cannot be found by existing methods. These matches associate seemingly unrelated biological processes, connect studies in different sub-areas of biomedical research and thus pose hypotheses for new discoveries. SAGA is also orders of magnitude faster than existing methods. Availability: SAGA can be accessed freely via the web at . Binaries are also freely available at this website. Contact:jignesh@eecs.umich.edu Supplementary material: Supplementary material is available at .Keywords
This publication has 14 references indexed in Scilit:
- From genomics to chemical genomics: new developments in KEGGNucleic Acids Research, 2006
- Conserved patterns of protein interaction in multiple speciesProceedings of the National Academy of Sciences, 2005
- Reactome: a knowledgebase of biological pathwaysNucleic Acids Research, 2004
- A high-throughput assay for Tn5 Tnp-induced DNA cleavageNucleic Acids Research, 2004
- PathAlignerApplied Bioinformatics, 2004
- Wnts and Hedgehogs: lipid-modified proteins and similarities in signaling mechanisms at the cell surfaceDevelopment, 2003
- P18(Ink4c) Collaborates With Other Cdk–Inhibitory Proteins in the Regenerating LiverHepatology, 2003
- Similarities between the Hedgehog and Wnt signaling pathwaysTrends in Cell Biology, 2002
- Algorithm 457: finding all cliques of an undirected graphCommunications of the ACM, 1973
- The Biochemistry of Affective DisordersThe British Journal of Psychiatry, 1967