skip to main content
research-article

Oracle in Image Search: A Content-Based Approach to Performance Prediction

Published:01 May 2012Publication History
Skip Abstract Section

Abstract

This article studies a novel problem in image search. Given a text query and the image ranking list returned by an image search system, we propose an approach to automatically predict the search performance. We demonstrate that, in order to estimate the mathematical expectations of Average Precision (AP) and Normalized Discounted Cumulative Gain (NDCG), we only need to predict the relevance probability of each image. We accomplish the task with a query-adaptive graph-based learning based on the images’ ranking order and visual content. We validate our approach with a large-scale dataset that contains the image search results of 1,165 queries from 4 popular image search engines. Empirical studies demonstrate that our approach is able to generate predictions that are highly correlated with the real search performance. Based on the proposed image search performance prediction scheme, we introduce three applications: image metasearch, multilingual image search, and Boolean image search. Comprehensive experiments are conducted to validate our approach.

References

  1. Abusalah, M., Tait, J., and Oakes, M. P. 2005. Literature review of cross language information retrieval. In Proceedings of World Academy of Science, Engineering and Technology Conference.Google ScholarGoogle Scholar
  2. Ahonen, T., Hadid, A., and Pietikainen, M. 2004. Face recognition with local binary patterns. In Proceedings of the European Conference on Computer Vision.Google ScholarGoogle Scholar
  3. Amati, G., Carpineto, C., Romano, G., and Bordoni, F. U. 2004. Query difficulty, robustness and selective application of query expansion. In Proceedings of the European Conference on IR Research.Google ScholarGoogle Scholar
  4. Balasubramanian, N., Kumaran, G., and Carvalho, V. R. 2010. Predicting query performance on the web. In Proceedings of the ACM International SIGIR Conference. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Banerjee, S. and Pedersen, T. 2003. Extended gloss overlaps as a measure of semantic relatedness. In Proceedings of the Joint Conference on Artificial Intelligence. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Benitez, A., Beigi, M., Beigi, I., and Chang, S. F. 1998. A content-based image meta-search engine using relevance feedback. IEEE Internet Comput. 2, 59--69. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Carmel, D., Yom-Tov, E., Darlow, A., and Pelleg, D. 2006. What makes a query difficult? In Proceedings of the ACM International SIGIR Conference. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Cronen-Townsend, S., Zhou, Y., and Croft, W. B. 2002. Predicting query performance. In Proceedings of the ACM International SIGIR Conference. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Hauff, C., Hiemstra, D., and de Jong, F. 2008a. A survey of pre-retrieval query performance predictors. In Proceedings of the Conference on Information and Knowledge Management. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Hauff, C., Murdock, V., and Baeza-Yates, R. 2008b. Improved query difficulty prediction for the web. In Proceedings of the Conference on Information and Knowledge Management. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Hauff, C., Azzopardi, L., and Hiemstra, D. 2009. The combination and evaluation of query performance prediction methods. In Proceedings of the European Conference on IR Research. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. He, B. and Ounis, I. 2004. Inferring query performance using pre-retrieval predictors. In Proceedings of the Symposium on String Processing and Information Retrieval.Google ScholarGoogle Scholar
  13. Hsu, W., Kennedy, L., and Chang, S. F. 2007. Video search reranking through random walk over document-level context graph. In Proceedings of the ACM International Conference on Multimedia. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Jing, Y. and Baluja, S. 2008. Visualrank: Applying pagerank to large-scale image search. IEEE Trans. Pattern. Anal. Mach. Intell. 30, 11, 1877--1890. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Lew, M. S. 2006. Content-Based multimedia information retrieval: State of the art and challenges. ACM Trans. Multimedia Comput. Comm. Appl. 2, 1. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Li, H., Wang, M., and Hua, X. S. 2009. Msra-mm 2.0: A large-scale web multimedia dataset. In Proceedings of the IEEE ICDM International Conference on Data Mining Workshop. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Li, J. and Wang, J. Z. 2003. Automatic linguistic indexing of pictures by a statistical modeling approach. IEEE Trans. Pattern. Anal. Mach. Intell. 25, 9, 1075--1088. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Li, J., Allinson, N., Tao, D., and Li, X. 2006. Multi-Training support vector machine for image retrieval. IEEE Trans. Image Process. 15, 11, 3597--3601. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Li, Y., Luo, Y., Tao, D., and Xu, C. 2011. Query difficulty guided image retrieval system. In Proceedings of the International Conference on Advances in Multimedia Modeling. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Liu, Y. and Mei, T. 2011. Optimizing visual search reranking via pairwise learning. IEEE Trans. Multimedia 13, 2, 280--291. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Manning, C. D., Raghavan, P., and Schtze, H. 2008. Introduction to Information Retrieval. Cambridge University Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Manoj, M. and Elizabeth, J. 2008. Information retrieval on internet using meta-search engines: A review. J. Sci. Industr. Res. 67, 739--746.Google ScholarGoogle Scholar
  23. Monay, F. and Gatica-Perez, D. 2003. On image auto-annotation with latent space models. In Proceedings of the ACM International Conference on Multimedia. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Mothe, J. and Tanguy, L. 2005. Linguistic features to predict query difficulty. In Proceedings of the ACM International SIGIR Workshop.Google ScholarGoogle Scholar
  25. Natsev, A., Naphade, M. R., and TešiĆ, J. 2005. Learning the semantics of multimedia queries and concepts from a small number of examples. In Proceedings of the ACM International Conference on Multimedia. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Patwardhan, S. 2006. Using wordnet-based context vectors to estimate the semantic relatedness of concepts. In Proceedings of the European Association of Chinese Linguistics.Google ScholarGoogle Scholar
  27. Popescu, A. 2007. Image retrieval using a multilingual ontology. In Large Scale Semantic Access to Content (Text, Image, Video, and Sound). Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Powell, A. L., French, J. C., Callan, J., Connell, M., and Viles, C. L. 2000. The impact of database selection on distributed searching. In Proceedings of the ACM International SIGIR Conference. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Quénot, G., Tan, T. P., Le, V. B., Ayache, S., Besacier, L., and Mulhem, P. 2010. Content-based search in multilingual audiovisual documents using the international phonetic alphabet. Multimedia Tools Appl. 48, 1, 123--140. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Selberg, E. and Etzioni, O. 1995. Multi-Service search and comparison using the metacrawler. In Proceedings of the International World Wide Web Conference.Google ScholarGoogle Scholar
  31. Si, L. and Callan, J. 2005. Modeling search engine effectiveness for federated search. In Proceedings of the ACM International SIGIR Conference. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Smeulders, A., Worring, M., Santini, S., Gupta, A., and Jain, R. 2000. Content-based image retrieval at the end of the early years. IEEE Trans. Pattern. Anal. Mach. Intell. 22, 12. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Smith, L. S. and Hurson, A. R. 2003. A search engine selection methodology. In Proceedings of the International Conference on Information Technology: Computers and Communications. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Tian, X., Yang, L., Wang, J., Yang, Y., Wu, X., and Hua, X. S. 2008. Bayesian video search reranking. In Proceedings of the ACM International Conference on Multimedia. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Wang, M., Hua, X. S., Hong, R., Tang, J., Qi, G. J., and Song, Y. 2009a. Unified video annotation via multi-graph learning. IEEE Trans. Circ. Syst. Video Technol. 19, 5. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Wang, M., Hua, X. S., Tang, J., and Hong, R. 2009b. Beyond distance measurement: constructing neighborhood similarity for video annotation. IEEE Trans. Multimedia 11, 3. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Wang, M., Yang, K., Hua, X. S., and Zhang, H. J. 2010. Towards a relevant and diverse search of social images. IEEE Trans. Multimedia 12, 8, 829--842. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Xing, X., Zhang, Y., and Han, M. 2010. Query difficulty prediction for contextual image retrieval. In Proceedings of the European Conference on IR Research. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Yan, R., Hauptmann, A., and Jin, R. 2003. Multimedia search with pseudo-relevance feedback. In Proceedings of the International Conference on Image and Video Retrieval. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Yang, L. and Hanjalic, A. 2010. Supervised reranking for web image search. In Proceedings of the ACM International Conference on Multimedia. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Zhao, Y., Scholer, F., and Tsegay, Y. 2008. Effective pre-retrieval query performance prediction using similarity and variability evidence. In Proceedings of the European Conference on IR Research. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. Zhou, Y. and Croft, W. B. 2007. Query performance prediction in web search environments. In Proceedings of the ACM International SIGIR Conference. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Oracle in Image Search: A Content-Based Approach to Performance Prediction

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Information Systems
      ACM Transactions on Information Systems  Volume 30, Issue 2
      May 2012
      245 pages
      ISSN:1046-8188
      EISSN:1558-2868
      DOI:10.1145/2180868
      Issue’s Table of Contents

      Copyright © 2012 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 1 May 2012
      • Accepted: 1 February 2012
      • Revised: 1 December 2011
      • Received: 1 July 2011
      Published in tois Volume 30, Issue 2

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader