Topic-sensitive pagerank: A context-sensitive ranking algorithm for web search
Top Cited Papers
- 9 July 2003
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Knowledge and Data Engineering
- Vol. 15 (4) , 784-796
- https://doi.org/10.1109/tkde.2003.1208999
Abstract
The original PageRank algorithm for improving the ranking of search-query results computes a single vector, using the link structure of the Web, to capture the relative "importance" of Web pages, independent of any particular search query. To yield more accurate search results, we propose computing a set of PageRank vectors, biased using a set of representative topics, to capture more accurately the notion of importance with respect to a particular topic. For ordinary keyword search queries, we compute the topic-sensitive PageRank scores for pages satisfying the query using the topic of the query keywords. For searches done in context (e.g., when the search query is performed by highlighting words in a Web page), we compute the topic-sensitive PageRank scores using the topic of the context in which the query appeared. By using linear combinations of these (precomputed) biased PageRank vectors to generate context-specific importance scores for pages at query time, we show that we can generate more accurate rankings than with a single, generic PageRank vector. We describe techniques for efficiently implementing a large-scale search system based on the topic-sensitive PageRank scheme.Keywords
This publication has 16 references indexed in Scilit:
- Scaling personalized web searchPublished by Association for Computing Machinery (ACM) ,2003
- Extrapolation methods for accelerating PageRank computationsPublished by Association for Computing Machinery (ACM) ,2003
- Winners don't take all: Characterizing the competition for links on the webProceedings of the National Academy of Sciences, 2002
- When experts agreePublished by Association for Computing Machinery (ACM) ,2001
- What is this page known for? Computing Web page reputationsComputer Networks, 2000
- Authoritative sources in a hyperlinked environmentJournal of the ACM, 1999
- QuantizationIEEE Transactions on Information Theory, 1998
- The anatomy of a large-scale hypertextual Web search engineComputer Networks and ISDN Systems, 1998
- Automatic resource compilation by analyzing hyperlink structure and associated textComputer Networks and ISDN Systems, 1998
- Randomized AlgorithmsPublished by Cambridge University Press (CUP) ,1995