A schema-based approach to building a bioinformatics database federation
- 8 November 2002
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
Developments in our ability to integrate and analyse the data held in existing heterogeneous data resources can lead to an increase in our understanding of biological function at all levels. However, supporting ad-hoc queries across multiple data resources and correlating the data retrieved from these is still difficult. To address this, we are building a mediator based on the functional data model database, P/FDM, which integrates access to heterogeneous, distributed biological databases, while making use of existing search engines and indexes, without infringing on the autonomy of the underlying databases. Central to our design philosophy is the use of schemas. We have adopted a federated architecture with a five-level schema, arising from the use of the ANSI-SPARC three-level schema to describe both the existing autonomous data resources and the mediator itself. We describe the use of mapping functions and list comprehensions in query splitting, producing execution plans, code generation and result fusion. We give an example of cross-database querying involving data held locally in P/FDM systems and external data in the Sequence Retrieval System (SRS).Keywords
This publication has 13 references indexed in Scilit:
- SFINKS: Secure Focused Information, News, and Knowledge SharingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2008
- A Graph-Oriented Model for Articulation of Ontology InterdependenciesPublished by Springer Nature ,2000
- Query processing in the TAMBIS bioinformatics source integration systemPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1999
- The Evolving Role of Constraints in the Functional Data ModelJournal of Intelligent Information Systems, 1999
- Bioinformatics: Essential Infrastructure for Global Biology1Journal of Computational Biology, 1996
- Efficient access to FDM objects stored in a relational databasePublished by Springer Nature ,1994
- SRS—an indexing and retrieval tool for flat file data librariesBioinformatics, 1993
- Mediators in the architecture of future information systemsComputer, 1992
- Federated database systems for managing distributed, heterogeneous, and autonomous databasesACM Computing Surveys, 1990
- The functional data model and the data languages DAPLEXACM Transactions on Database Systems, 1981