skip to main content
10.1145/1183614.1183658acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
Article

Summarizing local context to personalize global web search

Published:06 November 2006Publication History

ABSTRACT

The PC Desktop is a very rich repository of personal information, efficiently capturing user's interests. In this paper we propose a new approach towards an automatic personalization of web search in which the user specific information is extracted from such local desktops, thus allowing for an increased quality of user profiling, while sharing less private information with the search engine. More specifically, we investigate the opportunities to select personalized query expansion terms for web search using three different desktop oriented approaches: summarizing the entire desktop data, summarizing only the desktop documents relevant to each user query, and applying natural language processing techniques to extract dispersive lexical compounds from relevant desktop resources. Our experiments with the Google API showed at least the latter two techniques to produce a very strong improvement over current web search.

References

  1. P. G. Anick and S. Tipirneni. The paraphrase search assistant: Terminological feedback for iterative information seeking. In Proc. of the 22nd Annual Intl. ACM SIGIR Conf. on Research and Development in Information Retrieval, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. R. Baeza-Yates and B. Ribeiro-Neto. Modern Information Retrieval. ACM Press / Addison-Wesley, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. J. Budzik and K. Hammond. Watson: Anticipating and contextualizing information needs. In Proceedings of the Sixty-second Annual Meeting of the American Society for Information Science, 1999.Google ScholarGoogle Scholar
  4. J. Carbonell and J. Goldstein. The use of mmr, diversity-based reranking for reordering documents and producing summaries. In Proc. of the 21st Intl. ACM SIGIR Conf. on Research and Development in Information Retrieval, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. P. A. Chirita, C. S. Firan, and W. Nejdl. Pushing task relevant web links down to the desktop. In Proc. of the 8th ACM Intl. Workshop on Web Information and Data Management held at the 15th Intl. ACM CIKM Conference on Information and Knowledge Management, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. P.-A. Chirita, W. Nejdl, R. Paiu, and C. Kohlschütter. Using odp metadata to personalize search. In Proc. of the 28th Intl. ACM SIGIR Conf. on Research and Development in Information Retrieval, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. P.-A. Chirita, D. Olmedilla, and W. Nejdl. Pros: A personalized ranking platform for web search. In Proc. of the 3rd Intl. Conf. on Adaptive Hypermedia and Adaptive Web-Based Systems, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  8. D. R. Cutting, D. R. Karger, and J. O. Pedersen. Constant interaction-time scatter/gather browsing of very large document collections. In SIGIR, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. D. R. Cutting, J. O. Pedersen, D. R. Karger, and J. W. Tukey. Scatter/gather: A cluster-based approach to browsing large document collections. In SIGIR, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. S. Dumais, E. Cutrell, R. Sarin, and E. Horvitz. Implicit queries (iq) for contextualized search. In Proc. of the 27th Intl. ACM SIGIR Conf. on Research and Development in Information Retrieval, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. G. Erkan and D. R. Radev. Lexrank: Graph-based lexical centrality as salience in text summarization. J. Artif. Intell. Res. (JAIR), 22:457--479, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. S. Gauch, J. Chaffee, and A. Pretschner. Ontology-based personalized search and browsing. Web Intelli. and Agent Sys., 1(3-4):219--234, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. J. Goldstein, M. Kantrowitz, V. Mittal, and J. Carbonell. Summarizing text documents: Sentence selection and evaluation metrics. In Proc. of the 22nd Intl. ACM SIGIR Conf. on Research and Development in Information Retrieval, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. T. Haveliwala. Topic-sensitive pagerank. In In Proceedings of the Eleventh International World Wide Web Conference, Honolulu, Hawaii, May 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. G. Jeh and J. Widom. Scaling personalized web search. In Proc. of the 12th Intl. World Wide Web Conference, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. T. Joachims, L. Granka, B. Pan, H. Hembrooke, and G. Gay. Accurately interpreting clickthrough data as implicit feedback. In Proc. of the 28th Intl. ACM SIGIR Conf. on Research and Development in Information Retrieval, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. K. S. Jones, S. Walker, and S. Robertson. Probabilistic model of information retrieval: Development and status. Technical report, Cambridge University, 1998.Google ScholarGoogle Scholar
  18. S. Katz. Distribution of content words and phrases in text and language modelling. Natural Language Engineering, 2(1):15--59, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. A. M. Lam-Adesina and G. J. F. Jones. Applying summarization techniques for term selection in relevance feedback. In Proc. of the 24th Intl. ACM SIGIR Conf. on Research and Development in Information Retrieval, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. D. Lawrie and W. Croft. Generating hierarchical summaries for web searches. In Proc. of the 26th Intl. ACM SIGIR Conf. on Research and Development in Information Retr., 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. D. Lawrie, W. B. Croft, and A. L. Rosenberg. Finding topic words for hierarchical summarization. In Proc. of the 24th Intl. ACM SIGIR Conf. on Research and Development in Information Retrieval, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. F. Liu, C. Yu, and W. Meng. Personalized web search for improving retrieval effectiveness. IEEE Trans. on Knowledge and Data Eng., 16(1):28--40, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. H. Luhn. Automatic creation of literature abstracts. IBM Journ. of Research and Development, 2(2):159--165, 1958.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. G. Miller. Wordnet: An electronic lexical database. Communications of the ACM, 38(11):39--41, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. M. Mitra, A. Singhal, and C. Buckley. Improving automatic query expansion. In Proc. of the 21st Intl. ACM SIGIR Conf. on Research and Development in Information Retr., 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. T. Nomoto and Y. Matsumoto. A new approach to unsupervised text summarization. In Proc. of the 24th Intl. ACM SIGIR Conf. on Research and Development in Information Retrieval, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. L. Page, S. Brin, R. Motwani, and T. Winograd. The PageRank citation ranking: Bringing order to the web. Technical report, Stanford Digital Library Technologies Project, 1998.Google ScholarGoogle Scholar
  28. D. R. Radev, H. Jing, M. Stys, and D. Tam. Centroid-based summarization of multiple documents. Inf. Process. and Management, 40(6):919--938, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. B. J. Rhodes and P. Maes. Just-in-time information retrieval agents. IBM Syst. J., 39(3-4):685--704, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. S. E. Robertson and S. Walker. Okapi/keenbow at trec-8. In TREC, 1999.Google ScholarGoogle Scholar
  31. J. Rocchio. Relevance feedback in information retrieval. The Smart Retrieval System: Experiments in Automatic Document Processing, pages 313--323, 1971.Google ScholarGoogle Scholar
  32. D. Rose, R. Mander, T. Oren, D. Ponceleon, G. Salomon, and Y. Wong. Content awareness in a file system interface: Implementing the 'pile' metaphor for organizing information. In Proc. of the 16th Intl. ACM SIGIR Conf. on Research and Development in Information Retrieval, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. M. Sanderson and W. B. Croft. Deriving concept hierarchies from text. In Proc. of the 22nd Intl. ACM SIGIR Conf. on Research and Development in Information Retr., 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. K. Sugiyama, K. Hatano, and M. Yoshikawa. Adaptive web search based on user profile constructed without any effort from users. In Proc. of the 13th Intl. WWW Conf., 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. D. Sullivan. The older you are, the more you want personalized search, 2004. http://searchenginewatch.com/searchday/article.php/3385131.Google ScholarGoogle Scholar
  36. J. Teevan, S. Dumais, and E. Horvitz. Personalizing search via automated analysis of interests and activities. In Proc. of the 28th Intl. ACM SIGIR Conf. on Research and Development in Information Retrieval, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. A. Tombros and M. Sanderson. Advantages of query biased summaries in information retrieval. In Proc. of the 21st Intl. ACM SIGIR Conf. on Research and Development in Information Retrieval, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. E. Volokh. Personalization and privacy. Commun. ACM, 43(8), 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. P. Willett. Recent trends in hierarchic document clustering: a critical review. Inf. Process. and Management, 24(5), 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. O. Zamir and O. Etzioni. Grouper: a dynamic clustering interface to web search results. Comput. Networks, 31(11-16), 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. H.-J. Zeng, Q.-C. He, Z. Chen, W.-Y. Ma, and J. Ma. Learning to cluster web search results. In Proc. of the 27th Intl. ACM SIGIR Conf. on Research and Development in Information Retrieval, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Summarizing local context to personalize global web search

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Conferences
            CIKM '06: Proceedings of the 15th ACM international conference on Information and knowledge management
            November 2006
            916 pages
            ISBN:1595934332
            DOI:10.1145/1183614

            Copyright © 2006 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 6 November 2006

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • Article

            Acceptance Rates

            Overall Acceptance Rate1,861of8,427submissions,22%

            Upcoming Conference

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader