Accurate estimation of the cost of spatial selections
- 7 November 2002
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 123-134
- https://doi.org/10.1109/icde.2000.839399
Abstract
Optimizing queries that involve operations on spatial data requires estimating the selectivity and cost of these operations. In this paper, we focus on estimating the cost of spatial selections, or window queries, where the query windows and data objects are general polygons. Cost estimation techniques previously proposed in the literature only handle rectangular query windows over rectangular data objects, thus ignoring the very significant cost of exact geometry comparison (the refinement step in a “filter and refine” query processing strategy). The cost of the exact geometry comparison depends on the selectivity of the filtering step and the average number of vertices in the candidate objects identified by this step. In this paper, we introduce a new type of histogram for spatial data that captures the complexity and size of the spatial objects as well as their location. Capturing these attributes makes this type of histogram useful for accurate estimation, as we experimentally demonstrate. We also investigate sampling-based estimation approaches. Sampling can yield better selectivity estimates than histograms for polygon data, but at the high cost of performing exact geometry comparisons for all the sampled objectsKeywords
This publication has 18 references indexed in Scilit:
- Building a scaleable geo-spatial DBMSACM SIGMOD Record, 1997
- A model for the prediction of R-tree performancePublished by Association for Computing Machinery (ACM) ,1996
- Improved histograms for selectivity estimation of range predicatesACM SIGMOD Record, 1996
- Partition based spatial-merge joinACM SIGMOD Record, 1996
- Multi-step processing of spatial joinsPublished by Association for Computing Machinery (ACM) ,1994
- Beyond uniformity and independencePublished by Association for Computing Machinery (ACM) ,1994
- On packing R-treesPublished by Association for Computing Machinery (ACM) ,1993
- Equi-depth multidimensional histogramsACM SIGMOD Record, 1988
- The Quadtree and Related Hierarchical Data StructuresACM Computing Surveys, 1984
- R-treesPublished by Association for Computing Machinery (ACM) ,1984