Integrating life sciences data-with a little Garlic
- 8 November 2002
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
Vast amounts of life sciences data today reside in specialized data sources, with specialized query processing capabilities. Data from one source must often be combined with data from other sources to give users the information they desire. Database middleware systems such as Garlic allow users to combine data from multiple sources in a single query. Garlic provides the user with a virtual database to which they can pose arbitrarily complex queries, though the actual data needed to answer the query may be stored in several different sources, and those sources may not even possess all the functionality needed to answer such a query themselves. The Garlic technology, as incorporated in IBM's DB2 product, forms the basis of the DiscoveryLink service offering for the life sciences industry. We describe the DiscoveryLink offering, focusing on two key contributions of Garlic, the wrapper architecture and the query optimizer, and illustrate how it can be used to integrate life sciences data from heterogeneous data sources.Keywords
This publication has 17 references indexed in Scilit:
- Object exchange across heterogeneous information sourcesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000Nucleic Acids Research, 2000
- Query caching and optimization in distributed mediator systemsACM SIGMOD Record, 1996
- Optimizing queries over multimedia repositoriesACM SIGMOD Record, 1996
- Object-oriented extensions in SQL3Published by Association for Computing Machinery (ACM) ,1994
- Query evaluation techniques for large databasesACM Computing Surveys, 1993
- Description of several chemical structure file formats used by computer programs developed at Molecular Design LimitedJournal of Chemical Information and Computer Sciences, 1992
- Basic Local Alignment Search ToolJournal of Molecular Biology, 1990
- SMILES. 2. Algorithm for generation of unique SMILES notationJournal of Chemical Information and Computer Sciences, 1989
- Access path selection in a relational database management systemPublished by Association for Computing Machinery (ACM) ,1979