Keyword proximity search on XML graphs
- 13 May 2004
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
XKeyword provides efficient keyword proximity queries on large XML graph databases. A query is simply a list of keywords and does not require any schema or query language knowledge for its formulation. XKeyword is built on a relational database and, hence, can accommodate very large graphs. Query evaluation is optimized by using the graph's schema. In particular, XKeyword consists of two stages. In the preprocessing stage a set of keyword indices are built along with indexed path relations that describe particular patterns of paths in the graph. In the query processing stage plans are developed that use a near optimal set of path relations to efficiently locate the keyword query results. The results are presented graphically using the novel idea of interactive result graphs, which are populated on-demand according to the user's navigation and allow efficient information discovery. We provide theoretical and experimental points for the selection of the appropriate set of precomputed path relations. We also propose and experimentally evaluate algorithms to minimize the number of queries sent to the database to output the top-K results.Keywords
This publication has 8 references indexed in Scilit:
- Storing and querying XML data using denormalized relational databasesThe VLDB Journal, 2005
- DBXplorer: a system for keyword-based search over relational databasesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- From XML schema to relations: a cost-based approach to XML storagePublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Optimizing regular path expressions using graph schemasPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Keyword searching and browsing in databases using BANKSPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Flexible queries over semistructured dataPublished by Association for Computing Machinery (ACM) ,2001
- Integrating keyword search into XML query processingComputer Networks, 2000
- Storing semistructured data with STOREDPublished by Association for Computing Machinery (ACM) ,1999