MONDRIAN: Annotating and Querying Databases through Colors and Blocks
- 1 January 2006
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
Annotations play a central role in the curation of scientific databases. Despite their importance, data formats and schemas are not designed to manage the increasing variety of annotations. Moreover, DBMS’s often lack support for storing and querying annotations. Furthermore, annotations and data are only loosely coupled. This paper introduces an annotation-oriented data model for the manipulation and querying of both data and annotations. In particular, the model allows for the specification of annotations on sets of values and for effectively querying the information on their association. We use the concept of block to represent an annotated set of values. Different colors applied to the blocks represent different annotations. We introduce a color query language for our model and prove it to be both complete (it can express all possible queries over the class of annotated databases), and minimal (all the algebra operators are primitive). We present MONDRIAN, a prototype implementation of our annotation mechanism, and we conduct experiments that investigate the set of parameters which influence the evaluation cost for color queries.Keywords
This publication has 14 references indexed in Scilit:
- MONDRIAN: Annotating and Querying Databases through Colors and BlocksPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2006
- DBNotesPublished by Association for Computing Machinery (ACM) ,2005
- Colorful XMLPublished by Association for Computing Machinery (ACM) ,2004
- Containment of Relational Queries with Annotation PropagationPublished by Springer Nature ,2004
- The Gene Ontology (GO) database and informatics resourceNucleic Acids Research, 2004
- The UCSC Genome Browser DatabaseNucleic Acids Research, 2003
- Annotea: an open RDF infrastructure for shared Web annotationsComputer Networks, 2002
- On propagation of deletions and annotations through viewsPublished by Association for Computing Machinery (ACM) ,2002
- Tracing the lineage of view data in a warehousing environmentACM Transactions on Database Systems, 2000
- Challenges in Integrating Biological Data SourcesJournal of Computational Biology, 1995