research-article

On the Additivity and Weak Baselines for Search Result Diversification Research

Authors:
Mehmet Akcay

Middle East Technical University & ASELSAN, Ankara, Turkey

Middle East Technical University & ASELSAN, Ankara, Turkey
View Profile

,
Ismail Sengor Altingovde

Middle East Technical University, Ankara, Turkey

Middle East Technical University, Ankara, Turkey
View Profile

,
Craig Macdonald

University of Glasgow, Glasgow, Scotland Uk

University of Glasgow, Glasgow, Scotland Uk
View Profile

,
Iadh Ounis

University of Glasgow, Glasgow, Scotland Uk

University of Glasgow, Glasgow, Scotland Uk
View Profile

ICTIR '17: Proceedings of the ACM SIGIR International Conference on Theory of Information RetrievalOctober 2017Pages 109–116https://doi.org/10.1145/3121050.3121059

Published:01 October 2017Publication History

ICTIR '17: Proceedings of the ACM SIGIR International Conference on Theory of Information Retrieval

Pages 109–116

ABSTRACT

A recent study on the topic of additivity addresses the task of search result diversification and concludes that while weaker baselines are almost always significantly improved by the evaluated diversification methods, for stronger baselines, just the opposite happens, i.e., no significant improvement can be observed. Due to the importance of the issue in shaping future research directions and evaluation strategies in search results diversification, in this work, we first aim to reproduce the findings reported in the previous study, and then investigate its possible limitations. Our extensive experiments first reveal that under the same experimental setting with that previous study, we can reach similar results. Next, we hypothesize that for stronger baselines, tuning the parameters of some methods (i.e., the trade-off parameter between the relevance and diversity of the results in this particular scenario) should be done in a more fine-grained manner. With trade-off parameters that are specifically determined for each baseline run, we show that the percentage of significant improvements even over the strong baselines can be doubled. As a further issue, we discuss the possible impact of using the same strong baseline retrieval function for the diversity computations of the methods. Our takeaway message is that in the case of a strong baseline, it is more crucial to tune the parameters of the diversification methods to be evaluated; but once this is done, additivity is achievable.

References

Rakesh Agrawal, Sreenivas Gollapudi, Alan Halverson, and Samuel Ieong 2009. Diversifying Search Results. In Proceedings of WSDM. 5--14. Google ScholarDigital Library
Timothy Armstrong, Alistair Moffat, William Webber, and Justin Zobel 2009. Improvements that don't add up: ad-hoc retrieval results since 1998 Proceedings of CIKM. 601--610. Google ScholarDigital Library
Van Dang and W. Bruce Croft 2012. Diversity by proportionality: an election-based approach to search result diversification. Proceedings of SIGIR. 65--74. Google ScholarDigital Library
Elena Demidova, Peter Fankhauser, Xuan Zhou, and Wolfgang Nejdl 2010. DivQ: diversification for keyword search over structured databases Proceedings of SIGIR. 331--338. Google ScholarDigital Library
Joseph A. Fox and Edward Shaw 1994. Combination of multiple sources: The TREC-2 interactive track matrix experiment Proceedings of SIGIR.Google Scholar
Sreenivas Gollapudi and Aneesh Sharma 2009. An Axiomatic Approach for Result Diversification. Proceedings of WWW. 381--390. Google ScholarDigital Library
Sadegh Kharazmi, Falk Scholer, David Vallet, and Mark Sanderson 2016. Examining Additivity and Weak Baselines. Transactions on Information Systems Vol. 34, 4 (2016), 23. Google ScholarDigital Library
Sadegh Kharazmi, Falk Scholer, David Vallet, and Mark Sanderson 2017. Personal communication. (May 2017).Google Scholar
Joon Ho Lee. 1997. Analyses of multiple evidence combination. In Proceedings of SIGIR. 267--276. Google ScholarDigital Library
Enrico Minack, Wolf Siberski, and Wolfgang Nejdl. 2011. Incremental diversification for very large sets: a streaming-based approach Proceedings of SIGIR. 585--594. Google ScholarDigital Library
Kaweh Djafari Naini, Ismail Sengor Altingovde, and Wolf Siberski 2016. Scalable and Efficient Web Search Result Diversification. Transactions on the Web Vol. 10, 3 (2016), 15:1--15:30. Google ScholarDigital Library
Kezban Dilek Onal, Ismail Sengor Altingovde, and Pinar Karagoz 2015. Utilizing Word Embeddings for Result Diversification in Tweet Search Proceedings of AIRS. 366--378.Google Scholar
Ahmet Murat Ozdemiray and Ismail Sengor Altingovde 2014. Query Performance Prediction for Aspect Weighting in Search Result Diversification Proceedings of CIKM. 1871--1874. Google ScholarDigital Library
Ahmet Murat Ozdemiray and Ismail Sengor Altingovde 2015. Explicit search result diversification using score and rank aggregation methods. JASIST, Vol. 66, 6 (2015), 1212--1228.Google Scholar
Makbule Gulcin Ozsoy, Kezban Dilek Onal, and Ismail Sengor Altingovde 2014. Result Diversification for Tweet Search. In Proceedings of WISE. 78--89.Google ScholarCross Ref
Monica Lestari Paramita, Jiayu Tang, and Mark Sanderson. 2009. Generic and Spatial Approaches to Image Search Results Diversification Proceedings of ECIR. 603--610. Google ScholarDigital Library
Stephen Robertson and Hugo Zaragoza 2009. The Probabilistic Relevance Framework: BM25 and Beyond. Foundations and Trends in Information Retrieval, Vol. 3, 4 (2009), 333--389. Google ScholarDigital Library

Index Terms

On the Additivity and Weak Baselines for Search Result Diversification Research
1. Information systems
  1. Information retrieval
    1. Information retrieval query processing
      1. Query intent
    2. Retrieval models and ranking
      1. Novelty in information retrieval

Recommendations

Examining Additivity and Weak Baselines

We present a study of which baseline to use when testing a new retrieval technique. In contrast to past work, we show that measuring a statistically significant improvement over a weak baseline is not a good predictor of whether a similar improvement ...
Read More
Explicit diversification of image search
ICMR '13: Proceedings of the 3rd ACM conference on International conference on multimedia retrieval

Search result diversification can increase user satisfaction in answering a particular information need. There are many ways of diversify search results. In some cases the user has a clear idea of how they would like to see their results diversified. ...
Read More
Intent-aware search result diversification
SIGIR '11: Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval

Search result diversification has gained momentum as a way to tackle ambiguous queries. An effective approach to this problem is to explicitly model the possible aspects underlying a query, in order to maximise the estimated relevance of the retrieved ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
ICTIR '17: Proceedings of the ACM SIGIR International Conference on Theory of Information Retrieval
October 2017
348 pages
ISBN:9781450344906
DOI:10.1145/3121050
General Chairs:
Jaap Kamps
University of Amsterdam, The Netherlands
,
Evangelos Kanoulas
University of Amsterdam, The Netherlands
,
Maarten de Rijke
University of Amsterdam, The Netherlands
,
Program Chairs:
Hui Fang
University of Delaware, USA
,
Emine Yilmaz
University College London, UK
Copyright © 2017 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 1 October 2017
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
additivity
result diversification
statistical significance
Qualifiers
- research-article
Conference

Acceptance Rates
ICTIR '17 Paper Acceptance Rate27of54submissions,50%Overall Acceptance Rate209of482submissions,43%
More
Upcoming Conference
ICTIR '24

Sponsor:

sigir

The 2024 ACM SIGIR International Conference on the Theory of Information Retrieval

July 13, 2024

Washington DC , DC , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 5
  Total Citations
  View Citations
- 101
  Total Downloads
- Downloads (Last 12 months)3
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

On the Additivity and Weak Baselines for Search Result Diversification Research

ICTIR '17: Proceedings of the ACM SIGIR International Conference on Theory of Information Retrieval

ABSTRACT

References

Cited By

Index Terms

Recommendations

Examining Additivity and Weak Baselines

Explicit diversification of image search

Intent-aware search result diversification