skip to main content
column

Report on the Evaluation-as-a-Service (EaaS) Expert Workshop

Published:23 June 2015Publication History
Skip Abstract Section

Abstract

In this report, we summarize the outcome of the "Evaluation-as-a-Service" workshop that was held on the 5th and 6th March 2015 in Sierre, Switzerland. The objective of the meeting was to bring together initiatives that use cloud infrastructures, virtual machines, APIs (Application Programming Interface) and related projects that provide evaluation of information retrieval or machine learning tools as a service.

References

  1. Georgios Balikas, Anastasia Krithara, Ioannis Partalas, and Georgios Paliouras. BioASQ: A challenge on large-scale biomedical semantic indexing and questionanswering. In MRMD'15: Proceedings of the Multimodal Retrieval in the Medical Domain Workshop, 2015.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Georgios Balikas, Ioannis Partalas, Axel-Cyrille Ngonga Ngomo, Anastasia Krithara, and Georgios Paliouras. Results of the BioASQ track of the question answering lab at CLEF 2014. In CLEF'14: Proceedings of the 5th International Conference of the CLEF Initiative, pages 1181--1193. Springer, 2014.Google ScholarGoogle Scholar
  3. Krisztian Balog, Liadh Kelly, and Anne Schuth. Head first: Living labs for ad-hoc search evaluation. In CIKM'14: Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, pages 1815--1818, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Torben Brodt and Frank Hopfgartner. Shedding Light on a Living Lab: The CLEF NEWSREEL Open Recommendation Platform. In IIiX'14: Proceedings of Information Interaction in Context Conference, pages 223--226. ACM, 08 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Tim Gollub, Benno Stein, and Steven Burrows. Ousting Ivory Tower Research: Towards a Web Framework for Providing Experiments as a Service. In SIGIR'12: Proceedings of the 35th International ACM Conference on Research and Development in Information Retrieval, pages 1125--1126. ACM, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Allan Hanbury, Henning Müller, Georg Langs, Marc André Weber, Bjoern H. Menze, and Tomas Salas Fernandez. Bringing the algorithms to the data: cloud-based benchmarking for medical image analysis. In CLEF'12: Proceedings of the 3rd International Conference of the CLEF Initiative, pages 24--29. Springer Verlag, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Frank Hopfgartner, Benjamin Kille, Andreas Lommatzsch, Torben Brodt, and Tobias Heintz. Benchmarking News Recommendations in a Living Lab. In CLEF'14: Proceedings of the 5th International Conference of the CLEF Initiative, pages 250--267. Springer Verlag, 09 2014.Google ScholarGoogle ScholarCross RefCross Ref
  8. Benjamin Kille, Frank Hopfgartner, Torben Brodt, and Tobias Heintz. The plista dataset. In NRS'13: Proceedings of the International Workshop and Challenge on News Recommender Systems, pages 14--21. ACM, 10 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Georg Langs, Henning Müller, Bjoern H. Menze, and Allan Hanbury. Visceral: Towards large data in medical imaging --- challenges and directions. In MCBR-CDS'12: Proceedings of the Third MICCAI International Workshop, pages 92--98. Springer, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Jimmy Lin and Miles Efron. Overview of the TREC-2013 Microblog Track. In TREC'13: Proceedings of the 22nd Text REtrieval Conference, Gaithersburg, Maryland, 2013.Google ScholarGoogle Scholar
  11. Richard McCreadie, Ian Soboroff, Jimmy Lin, Craig Macdonald, Iadh Ounis, and Dean McCullough. On building a reusable twitter corpus. In SIGIR'12: Proceedings of the 35th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 1113--1114, Portland, Oregon, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Iadh Ounis, Craig Macdonald, Jimmy Lin, and Ian Soboroff. Overview of the TREC- 2011 Microblog Track. In TREC'11: Proceedings of the 20th Text REtrieval Conference, Gaithersburg, Maryland, 2011.Google ScholarGoogle Scholar
  13. Martin Potthast, Tim Gollub, Francisco Rangel, Paolo Rosso, Efstathios Stamatatos, and Benno Stein. Improving the Reproducibility of PAN's Shared Tasks: Plagiarism Detection, Author Identification, and Author Profiling. In CLEF'14: Proceedings of the 5th Int. Conference of the CLEF Initiative, pages 268--299. Springer Verlag, 2014.Google ScholarGoogle Scholar
  14. Jinfeng Rao, Jimmy Lin, and Miles Efron. Reproducible experiments on lexical and temporal feedback for tweet search. In ECIR'15: Proceedings of the 37th European Conference on Information Retrieval, pages 755--767, Vienna, Austria, 2015.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Report on the Evaluation-as-a-Service (EaaS) Expert Workshop

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM SIGIR Forum
      ACM SIGIR Forum  Volume 49, Issue 1
      June 2015
      69 pages
      ISSN:0163-5840
      DOI:10.1145/2795403
      Issue’s Table of Contents

      Copyright © 2015 Authors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 23 June 2015

      Check for updates

      Qualifiers

      • column

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader