skip to main content
10.1145/511446.511513acmconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
Article

Topic-sensitive PageRank

Published:07 May 2002Publication History

ABSTRACT

In the original PageRank algorithm for improving the ranking of search-query results, a single PageRank vector is computed, using the link structure of the Web, to capture the relative "importance" of Web pages, independent of any particular search query. To yield more accurate search results, we propose computing a set of PageRank vectors, biased using a set of representative topics, to capture more accurately the notion of importance with respect to a particular topic. By using these (precomputed) biased PageRank vectors to generate query-specific importance scores for pages at query time, we show that we can generate more accurate rankings than with a single, generic PageRank vector. For ordinary keyword search queries, we compute the topic-sensitive PageRank scores for pages satisfying the query using the topic of the query keywords. For searches done in context (e.g., when the search query is performed by highlighting words in a Web page), we compute the topic-sensitive PageRank scores using the topic of the context in which the query appeared.

References

  1. The Google Search Engine: Commercial search engine founded by the originators of PageRank. http://www.google.com/.Google ScholarGoogle Scholar
  2. The Open Directory Project: Web directory for over 2.5 million URLs. http://www.dmoz.org/.Google ScholarGoogle Scholar
  3. 'More Evil Than Dr. Evil?' http://searchenginewatch.com/sereport/99/11-google.html.Google ScholarGoogle Scholar
  4. Krishna Bharat and Monika R. Henzinger. Improved algorithms for topic distillation in a hyperlinked environment. In Proceedings of the ACM-SIGIR, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Krishna Bharat and George~A. Mihaila. When experts agree: Using non-affiliated experts to rank popular topics. In Proceedings of the Tenth International World Wide Web Conference, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Sergey Brin, Rajeev Motwani, Larry Page, and Terry Winograd. What can you do with a web in your pocket. In Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, 1998.Google ScholarGoogle Scholar
  7. Sergey Brin and Larry Page. The anatomy of a large-scale hypertextual web search engine. In Proceedings of the Seventh International World Wide Web Conference, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. S. Chakrabarti, B. Dom, D. Gibson, J. Kleinberg, P. Raghavan, and S. Rajagopalan. Automatic resource compilation by analyzing hyperlink structure and associated text. In Proceedings of the Seventh International World Wide Web Conference, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Cynthia Dwork, Ravi Kumar, Moni Naor, and D. Sivakumar. Rank aggregation methods for the web. In Proceedings of the Tenth International World Wide Web Conference, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Lev Finkelstein, Evgeniy Gabrilovich, Yossi Matias, Ehud Rivlin, Zach Solan, Gadi Wolfman, and Eytan Ruppin. Placing search in context: the concept revisited. In Proceedings of the Tenth International World Wide Web Conference, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Taher H. Haveliwala. Efficient computation of PageRank. Stanford University Technical Report, 1999.Google ScholarGoogle Scholar
  12. J. Hirai, S. Raghavan, H. Garcia-Molina, and A. Paepcke. Webbase: A repository of web pages. In Proceedings of the Ninth International World Wide Web Conference, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Glen Jeh and Jennifer Widom. Scaling personalized web search. Stanford University Technical Report, 2002.Google ScholarGoogle Scholar
  14. Jon Kleinberg. Authoritative sources in a hyperlinked environment. In Proceedings of the ACM-SIAM Symposium on Discrete Algorithms, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Rajeev Motwani and Prabhakar Raghavan. Randomized Algorithms. Cambridge University Press, United Kingdom, 1995. Google ScholarGoogle ScholarCross RefCross Ref
  16. Larry Page. PageRank: Bringing order to the web. Stanford Digital Libraries Working Paper, 1997.Google ScholarGoogle Scholar
  17. Davood Rafiei and Alberto O. Mendelzon. What is this page known for? Computing web page reputations. In Proceedings of the Ninth International World Wide Web Conference, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Matthew Richardson and Pedro Domingos. The Intelligent Surfer: Probabilistic Combination of Link and Content Information in PageRank, volume 14. MIT Press, Cambridge, MA, 2002 (To appear).Google ScholarGoogle Scholar

Index Terms

  1. Topic-sensitive PageRank

              Recommendations

              Comments

              Login options

              Check if you have access through your login credentials or your institution to get full access on this article.

              Sign in
              • Published in

                cover image ACM Conferences
                WWW '02: Proceedings of the 11th international conference on World Wide Web
                May 2002
                754 pages
                ISBN:1581134495
                DOI:10.1145/511446

                Copyright © 2002 ACM

                Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                Publisher

                Association for Computing Machinery

                New York, NY, United States

                Publication History

                • Published: 7 May 2002

                Permissions

                Request permissions about this article.

                Request Permissions

                Check for updates

                Qualifiers

                • Article

                Acceptance Rates

                Overall Acceptance Rate1,899of8,196submissions,23%

                Upcoming Conference

                WWW '24
                The ACM Web Conference 2024
                May 13 - 17, 2024
                Singapore , Singapore

              PDF Format

              View or Download as a PDF file.

              PDF

              eReader

              View online with eReader.

              eReader