A Life Scientist's Gateway to Distributed Data Management and Computing: The PathPort/ToolBus Framework
- 1 January 2003
- journal article
- research article
- Published by Mary Ann Liebert Inc in OMICS: A Journal of Integrative Biology
- Vol. 7 (1) , 79-88
- https://doi.org/10.1089/153623103322006661
Abstract
The emergent needs of the bioinformatics community challenge current information systems. The pace of biological data generation far outstrips Moore's Law. Therefore, a gap continues to widen between the capabilities to produce biological (molecular and cell) data sets and the capability to manage and analyze these data sets. As a result, Federal investments in large data set generation produces diminishing returns in terms of the community's capabilities of understanding biology and leveraging that understanding to make scientific and technological advances that improve society. We are building an open framework to address various data management issues including data and tool interoperability, nomenclature and data communication standardization, and database integration. PathPort, short for Pathogen Portal, employs a generic, web-services based framework to deal with some of the problems identified by the bioinformatics community. The motivating research goal of a scalable system to provide data management and analysis for key pathosystems, especially relating to molecular data, has resulted in a generic framework using two major components. On the server-side, we employ web-services. On the client-side, a Java application called ToolBus acts as a client-side "bus" for contacting data and tools and viewing results through a single, consistent user interface.Keywords
This publication has 4 references indexed in Scilit:
- Creating the Gene Ontology Resource: Design and ImplementationGenome Research, 2001
- The InterPro database, an integrated documentation resource for protein families, domains and functional sitesNucleic Acids Research, 2001
- The COG database: new developments in phylogenetic classification of proteins from complete genomesNucleic Acids Research, 2001
- A spiral model of software development and enhancementComputer, 1988