The pyramid-technique
- 1 June 1998
- journal article
- Published by Association for Computing Machinery (ACM) in ACM SIGMOD Record
- Vol. 27 (2) , 142-153
- https://doi.org/10.1145/276305.276318
Abstract
In this paper, we propose the Pyramid-Technique, a new indexing method for high-dimensional data spaces. The Pyramid-Technique is highly adapted to range query processing using the maximum metric L max . In contrast to all other index structures, the performance of the Pyramid-Technique does not deteriorate when processing range queries on data of higher dimensionality. The Pyramid-Technique is based on a special partitioning strategy which is optimized for high-dimensional data. The basic idea is to divide the data space first into 2d pyramids sharing the center point of the space as a top. In a second step, the single pyramids are cut into slices parallel to the basis of the pyramid. These slices from the data pages. Furthermore, we show that this partition provides a mapping from the given d-dimensional space to a 1-dimensional space. Therefore, we are able to use a B+-tree to manage the transformed data. As an analytical evaluation of our technique for hypercube range queries and uniform data distribution shows, the Pyramid-Technique clearly outperforms index structures using other partitioning strategies. To demonstrate the practical relevance of our technique, we experimentally compared the Pyramid-Technique with the X-tree, the Hilbert R-tree, and the Linear Scan. The results of our experiments using both, synthetic and real data, demonstrate that the Pyramid-Technique outperforms the X-tree and the Hilbert R-tree by a factor of up to 14 (number of page accesses) and up to 2500 (total elapsed time) for range queries.Keywords
This publication has 11 references indexed in Scilit:
- A cost model for nearest neighbor search in high-dimensional data spacePublished by Association for Computing Machinery (ACM) ,1997
- The SR-treePublished by Association for Computing Machinery (ACM) ,1997
- Data warehousing and OLAP for decision supportPublished by Association for Computing Machinery (ACM) ,1997
- Efficient and effective Querying by Image ContentJournal of Intelligent Information Systems, 1994
- A retrieval technique for similar shapesPublished by Association for Computing Machinery (ACM) ,1991
- The R*-tree: an efficient and robust access method for points and rectanglesPublished by Association for Computing Machinery (ACM) ,1990
- The K-D-B-treePublished by Association for Computing Machinery (ACM) ,1981
- Ubiquitous B-TreeACM Computing Surveys, 1979
- An Algorithm for Finding Best Matches in Logarithmic Expected TimeACM Transactions on Mathematical Software, 1977
- Organization and maintenance of large ordered indexesActa Informatica, 1972