skip to main content
10.1145/2502081.2502093acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Attribute-augmented semantic hierarchy: towards bridging semantic gap and intention gap in image retrieval

Authors Info & Claims
Published:21 October 2013Publication History

ABSTRACT

This paper presents a novel Attribute-augmented Semantic Hierarchy (A2 SH) and demonstrates its effectiveness in bridging both the semantic and intention gaps in Content-based Image Retrieval (CBIR). A2 SH organizes the semantic concepts into multiple semantic levels and augments each concept with a set of related attributes, which describe the multiple facets of the concept and act as the intermediate bridge connecting the concept and low-level visual content. A hierarchical semantic similarity function is learnt to characterize the semantic similarities among images for retrieval. To better capture user search intent, a hybrid feedback mechanism is developed, which collects hybrid feedbacks on attributes and images. These feedbacks are then used to refine the search results based on A2 SH. We develop a content-based image retrieval system based on the proposed A2 SH. We conduct extensive experiments on a large-scale data set of over one million Web images. Experimental results show that the proposed A2 SH can characterize the semantic affinities among images accurately and can shape user search intent precisely and quickly, leading to more accurate search results as compared to state-of-the-art CBIR solutions.

References

  1. M. Crucianu, M. Ferecatu, and N. Boujemaa. Relevance feedback for image retrieval: a short survey. DELOS2 Report, 2004.Google ScholarGoogle Scholar
  2. R. Datta, D. Joshi, J. Li, and J. Wang. Image retrieval: Ideas, influences, and trends of the new age. ACM Computing Surveys, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. J. Deng, A. C. Berg, and L. Fei-Fei. Hierarchical semantic indexing for large scale image retrieval. In CVPR, 2011.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei. Imagenet: A large-scale hierarchical image database. In CVPR, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  5. T. Deselaers and V. Ferrari. Visual and semantic similarity in imagenet. In CVPR, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. M. Douze, A. Ramisa, and C. Schmid. Combining attributes and fisher vectors for efficient image retrieval. In CVPR, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. J. Fan, Y. Gao, and H. Luo. Integrating concept ontology and multitask learning to achieve more effective classifier training for multilevel image annotation. TIP, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. A. Farhadi, I. Endres, D. Hoiem, and D. Forsyth. Describing objects by their attributes. In CVPR, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  9. X. Felix, R. Ji, M. Tsai, G. Ye, and S. Chang. Weak attributes for large-scale image retrieval. In CVPR, 2012.Google ScholarGoogle Scholar
  10. C. Fellbaum. Wordnet. Theory and Applications of Ontology: Computer Applications, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. A. Hanjalic, C. Kofler, and M. Larson. Intent and its discontents: the user at the wheel of the online video search engine. In MM, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. A. Jaimes and S. fu Chang. A conceptual framework for indexing visual information at multiple levels. In SPIE Internet Imaging, 2000.Google ScholarGoogle Scholar
  13. A. Kovashka, D. Parikh, and K. Grauman. Whittlesearch: Image search with relative attribute feedback. In CVPR, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. T. Leung and J. Malik. Representing and recognizing the visual appearance of materials using three-dimensional textons. IJCV, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. M. Lew, N. Sebe, C. Djeraba, and R. Jain. Content-based multimedia information retrieval: State of the art and challenges. TOMCCAP, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. D. G. Lowe. Distinctive image features from scale-invariant keypoints. IJCV, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Z. Ma, Y. Yang, Z. Xu, S. Yan, N. Sebe, and A. G. Hauptmann. Complex event detection via multi-source video attributes. In CVPR, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. M. Marszalek and C. Schmid. Semantic hierarchies for visual object recognition. In CVPR, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  19. F. Monay and D. Gatica-Perez. On image auto-annotation with latent space models. In MM, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. M. Naphade, J. R. Smith, J. Tesic, S.-F. Chang, W. Hsu, L. Kennedy, A. Hauptmann, and J. Curtis. Large-scale concept ontology for multimedia. Multimedia, IEEE, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. P. Over, G. Awad, M. Michel, J. Fiscus, G. Sanders, B. Shaw, W. Kraaij, A. F. Smeaton, and G. Quéenot. Trecvid 2012 -- an overview of the goals, tasks, data, evaluation mechanisms and metrics. In TRECVID, 2012.Google ScholarGoogle Scholar
  22. D. Parikh and K. Grauman. Interactively building a discriminative vocabulary of nameable attributes. In CVPR, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Y. Rui, T. S. Huang, and S.-F. Chang. Image retrieval: Current techniques, promising directions, and open issues. JVCIR, 1999.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Y. Rui, T. S. Huang, M. Ortega, and S. Mehrotra. Relevance feedback: a power tool for interactive content-based image retrieval. TCSVT, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. O. Russakovsky and L. Fei-Fei. Attribute learning in large-scale datasets. In ECCV, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. W. J. Scheirer, N. Kumar, P. N. Belhumeur, and T. E. Boult. Multi-attribute spaces: Calibration for attribute fusion and similarity search. In CVPR, 2012.Google ScholarGoogle ScholarCross RefCross Ref
  27. N. Sebe, M. S. Lew, X. Zhou, T. S. Huang, and E. M. Bakker. The state of the art in image and video retrieval. In Image and Video Retrieval. 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. A. Smeulders, M. Worring, S. Santini, A. Gupta, and R. Jain. Content-based image retrieval at the end of the early years. TPAMI, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. J. R. Smith and S.-F. Chang. Visualseek: a fully automated content-based image query system. In MM, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. C. G. Snoek, B. Huurnink, L. Hollink, M. De Rijke, G. Schreiber, and M. Worring. Adding semantics to detectors for video retrieval. TMM, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. C. G. Snoek and M. Worring. Concept-based video retrieval. FTIR, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Y. Song, M. Zhao, J. Yagnik, and X. Wu. Taxonomic classification for web-based videos. In CVPR, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  33. D. Tao, X. Tang, X. Li, and X. Wu. Asymmetric bagging and random subspace for support vector machines-based relevance feedback in image retrieval. TPAMI, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. S. Tong and E. Chang. Support vector machine active learning for image retrieval. In MM, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. N. Verma, D. Mahajan, S. Sellamanickam, and V. Nair. Learning hierarchical similarity metrics. In CVPR, 2012.Google ScholarGoogle ScholarCross RefCross Ref
  36. J. Wang, J. Yang, K. Yu, F. Lv, T. Huang, and Y. Gong. Locality-constrained linear coding for image classification. In CVPR, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  37. K. Q. Weinberger, J. Blitzer, and L. K. Saul. Distance metric learning for large margin nearest neighbor classification. In NIPS, 2006.Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Z.-J. Zha, L. Yang, T. Mei, M. Wang, and Z. Wang. Visual query suggestion. In MM, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. H. Zhang, Z.-J. Zha, S. Yan, J. Bian, and T.-S. Chua. Attribute feedback. In MM, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. K. Zhang, I. W. Tsang, and J. T. Kwok. Maximum margin clustering made practical. TNN, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Attribute-augmented semantic hierarchy: towards bridging semantic gap and intention gap in image retrieval

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      MM '13: Proceedings of the 21st ACM international conference on Multimedia
      October 2013
      1166 pages
      ISBN:9781450324045
      DOI:10.1145/2502081

      Copyright © 2013 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 21 October 2013

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      MM '13 Paper Acceptance Rate47of235submissions,20%Overall Acceptance Rate995of4,171submissions,24%

      Upcoming Conference

      MM '24
      MM '24: The 32nd ACM International Conference on Multimedia
      October 28 - November 1, 2024
      Melbourne , VIC , Australia

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader