Optimizing top-k selection queries over multimedia repositories
- 2 August 2004
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Knowledge and Data Engineering
- Vol. 16 (8) , 992-1009
- https://doi.org/10.1109/tkde.2004.30
Abstract
Repositories of multimedia objects having multiple types of attributes (e.g., image, text) are becoming increasingly common. A query on these attributes will typically, request not just a set of objects, as in the traditional relational query model (filtering), but also a grade of match associated with each object, which indicates how well the object matches the selection condition (ranking). Furthermore, unlike in the relational model, users may just want the k top-ranked objects for their selection queries for a relatively small k. In addition to the differences in the query model, another peculiarity of multimedia repositories is that they may allow access to the attributes of each object only through indexes. We investigate how to optimize the processing of top-k selection queries over multimedia repositories. The access characteristics of the repositories and the above query model lead to novel issues in query optimization. In particular, the choice of the indexes used to search the repository strongly influences the cost of processing the filtering condition. We define an execution space that is search-minimal, i.e., the set of indexes searched is minimal. Although the general problem of picking an optimal plan in the search-minimal execution space is NP-hard, we present an efficient algorithm that solves the problem optimally with respect to our cost model and execution space when the predicates in the query are independent. We also show that the problem of optimizing top-k selection queries can be viewed, in many cases, as that of evaluating more traditional selection conditions. Thus, both problems can be viewed together as an extended filtering problem to which techniques of query processing and optimization may be adapted.Keywords
This publication has 27 references indexed in Scilit:
- Single table access using multiple indexes: Optimization, execution, and concurrency control techniquesPublished by Springer Nature ,2005
- Content-based image retrieval at the end of the early yearsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2000
- GlOSSACM Transactions on Database Systems, 1999
- Using Fagin's algorithm for merging ranked results in multimedia middlewarePublished by Institute of Electrical and Electronics Engineers (IEEE) ,1999
- Optimization techniques for queries with expensive methodsACM Transactions on Database Systems, 1998
- Supporting ranked Boolean similarity queries in MARSIEEE Transactions on Knowledge and Data Engineering, 1998
- Managing multimedia information in database systemsCommunications of the ACM, 1997
- Optimizing disjunctive queries with expensive predicatesACM SIGMOD Record, 1994
- QBIC project: querying images by content, using color, texture, and shapePublished by SPIE-Intl Soc Optical Eng ,1993
- On the optimal nesting order for computing N -relational joinsACM Transactions on Database Systems, 1984