Top- k selection queries over relational databases

Top Cited Papers

1 June 2002

journal article
Published by Association for Computing Machinery (ACM) in ACM Transactions on Database Systems

Vol. 27 (2) , 153-187
https://doi.org/10.1145/568518.568519

Abstract

In many applications, users specify target values for certain attributes, without requiring exact matches to these values in return. Instead, the result to such queries is typically a rank of the "top k " tuples that best match the given attribute values. In this paper, we study the advantages and limitations of processing a top- k query by translating it into a single range query that a traditional relational database management system (RDBMS) can process efficiently. In particular, we study how to determine a range query to evaluate a top- k query by exploiting the statistics available to an RDBMS, and the impact of the quality of these statistics on the retrieval efficiency of the resulting scheme. We also report the first experimental evaluation of the mapping strategies over a real RDBMS, namely over Microsoft's SQL Server 7.0. The experiments show that our new techniques are robust and significantly more efficient than previously known strategies requiring at least one sequential scan of the data sets.

Keywords

This publication has 20 references indexed in Scilit:

STHoles
Published by Association for Computing Machinery (ACM) ,2001
Self-tuning histograms
Published by Association for Computing Machinery (ACM) ,1999
Multidimensional access methods
ACM Computing Surveys, 1998
CONTROL
Published by Association for Computing Machinery (ACM) ,1998
Relaxing the Uniformity and Independence Assumptions Using the Concept of Fractal Dimension
Journal of Computer and System Sciences, 1997
On saying “Enough already!” in SQL
Published by Association for Computing Machinery (ACM) ,1997
Optimizing queries over multimedia repositories
Published by Association for Computing Machinery (ACM) ,1996
The hB-tree: a multiattribute indexing method with good guaranteed performance
ACM Transactions on Database Systems, 1990
VAGUE: a user interface to relational databases that permits vague queries
ACM Transactions on Information Systems, 1988
The Grid File
ACM Transactions on Database Systems, 1984