skip to main content
10.1145/3121050.3121059acmconferencesArticle/Chapter ViewAbstractPublication PagesictirConference Proceedingsconference-collections
research-article

On the Additivity and Weak Baselines for Search Result Diversification Research

Published:01 October 2017Publication History

ABSTRACT

A recent study on the topic of additivity addresses the task of search result diversification and concludes that while weaker baselines are almost always significantly improved by the evaluated diversification methods, for stronger baselines, just the opposite happens, i.e., no significant improvement can be observed. Due to the importance of the issue in shaping future research directions and evaluation strategies in search results diversification, in this work, we first aim to reproduce the findings reported in the previous study, and then investigate its possible limitations. Our extensive experiments first reveal that under the same experimental setting with that previous study, we can reach similar results. Next, we hypothesize that for stronger baselines, tuning the parameters of some methods (i.e., the trade-off parameter between the relevance and diversity of the results in this particular scenario) should be done in a more fine-grained manner. With trade-off parameters that are specifically determined for each baseline run, we show that the percentage of significant improvements even over the strong baselines can be doubled. As a further issue, we discuss the possible impact of using the same strong baseline retrieval function for the diversity computations of the methods. Our takeaway message is that in the case of a strong baseline, it is more crucial to tune the parameters of the diversification methods to be evaluated; but once this is done, additivity is achievable.

References

  1. Rakesh Agrawal, Sreenivas Gollapudi, Alan Halverson, and Samuel Ieong 2009. Diversifying Search Results. In Proceedings of WSDM. 5--14. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Timothy Armstrong, Alistair Moffat, William Webber, and Justin Zobel 2009. Improvements that don't add up: ad-hoc retrieval results since 1998 Proceedings of CIKM. 601--610. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Van Dang and W. Bruce Croft 2012. Diversity by proportionality: an election-based approach to search result diversification. Proceedings of SIGIR. 65--74. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Elena Demidova, Peter Fankhauser, Xuan Zhou, and Wolfgang Nejdl 2010. DivQ: diversification for keyword search over structured databases Proceedings of SIGIR. 331--338. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Joseph A. Fox and Edward Shaw 1994. Combination of multiple sources: The TREC-2 interactive track matrix experiment Proceedings of SIGIR.Google ScholarGoogle Scholar
  6. Sreenivas Gollapudi and Aneesh Sharma 2009. An Axiomatic Approach for Result Diversification. Proceedings of WWW. 381--390. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Sadegh Kharazmi, Falk Scholer, David Vallet, and Mark Sanderson 2016. Examining Additivity and Weak Baselines. Transactions on Information Systems Vol. 34, 4 (2016), 23. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Sadegh Kharazmi, Falk Scholer, David Vallet, and Mark Sanderson 2017. Personal communication. (May 2017).Google ScholarGoogle Scholar
  9. Joon Ho Lee. 1997. Analyses of multiple evidence combination. In Proceedings of SIGIR. 267--276. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Enrico Minack, Wolf Siberski, and Wolfgang Nejdl. 2011. Incremental diversification for very large sets: a streaming-based approach Proceedings of SIGIR. 585--594. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Kaweh Djafari Naini, Ismail Sengor Altingovde, and Wolf Siberski 2016. Scalable and Efficient Web Search Result Diversification. Transactions on the Web Vol. 10, 3 (2016), 15:1--15:30. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Kezban Dilek Onal, Ismail Sengor Altingovde, and Pinar Karagoz 2015. Utilizing Word Embeddings for Result Diversification in Tweet Search Proceedings of AIRS. 366--378.Google ScholarGoogle Scholar
  13. Ahmet Murat Ozdemiray and Ismail Sengor Altingovde 2014. Query Performance Prediction for Aspect Weighting in Search Result Diversification Proceedings of CIKM. 1871--1874. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Ahmet Murat Ozdemiray and Ismail Sengor Altingovde 2015. Explicit search result diversification using score and rank aggregation methods. JASIST, Vol. 66, 6 (2015), 1212--1228.Google ScholarGoogle Scholar
  15. Makbule Gulcin Ozsoy, Kezban Dilek Onal, and Ismail Sengor Altingovde 2014. Result Diversification for Tweet Search. In Proceedings of WISE. 78--89.Google ScholarGoogle ScholarCross RefCross Ref
  16. Monica Lestari Paramita, Jiayu Tang, and Mark Sanderson. 2009. Generic and Spatial Approaches to Image Search Results Diversification Proceedings of ECIR. 603--610. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Stephen Robertson and Hugo Zaragoza 2009. The Probabilistic Relevance Framework: BM25 and Beyond. Foundations and Trends in Information Retrieval, Vol. 3, 4 (2009), 333--389. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. On the Additivity and Weak Baselines for Search Result Diversification Research

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        ICTIR '17: Proceedings of the ACM SIGIR International Conference on Theory of Information Retrieval
        October 2017
        348 pages
        ISBN:9781450344906
        DOI:10.1145/3121050

        Copyright © 2017 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 1 October 2017

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        ICTIR '17 Paper Acceptance Rate27of54submissions,50%Overall Acceptance Rate209of482submissions,43%

        Upcoming Conference

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader