research-article

Attribute-augmented semantic hierarchy: towards bridging semantic gap and intention gap in image retrieval

Authors:
Hanwang Zhang

School of Computing, National University of Singapore, Singapore, Singapore

School of Computing, National University of Singapore, Singapore, Singapore
View Profile

,
Zheng-Jun Zha

Institute of Intelligent Machines, Chinese Academy of Sciences, He Fei, China

Institute of Intelligent Machines, Chinese Academy of Sciences, He Fei, China
View Profile

,
Yang Yang

School of Computing, National University of Singapore, Singapore, Singapore

School of Computing, National University of Singapore, Singapore, Singapore
View Profile

,
Shuicheng Yan

Electrical Computer Engineering, Chinese Academy of Sciences, Singapore, Singapore

Electrical Computer Engineering, Chinese Academy of Sciences, Singapore, Singapore
View Profile

,
Yue Gao

School of Computing, National University of Singapore, Singapore, Singapore

School of Computing, National University of Singapore, Singapore, Singapore
View Profile

,
Tat-Seng Chua

School of Computing, National University of Singapore, Singapore, Singapore

School of Computing, National University of Singapore, Singapore, Singapore
View Profile

MM '13: Proceedings of the 21st ACM international conference on MultimediaOctober 2013Pages 33–42https://doi.org/10.1145/2502081.2502093

Published:21 October 2013Publication History

MM '13: Proceedings of the 21st ACM international conference on Multimedia

Pages 33–42

ABSTRACT

This paper presents a novel Attribute-augmented Semantic Hierarchy (A² SH) and demonstrates its effectiveness in bridging both the semantic and intention gaps in Content-based Image Retrieval (CBIR). A² SH organizes the semantic concepts into multiple semantic levels and augments each concept with a set of related attributes, which describe the multiple facets of the concept and act as the intermediate bridge connecting the concept and low-level visual content. A hierarchical semantic similarity function is learnt to characterize the semantic similarities among images for retrieval. To better capture user search intent, a hybrid feedback mechanism is developed, which collects hybrid feedbacks on attributes and images. These feedbacks are then used to refine the search results based on A² SH. We develop a content-based image retrieval system based on the proposed A² SH. We conduct extensive experiments on a large-scale data set of over one million Web images. Experimental results show that the proposed A² SH can characterize the semantic affinities among images accurately and can shape user search intent precisely and quickly, leading to more accurate search results as compared to state-of-the-art CBIR solutions.

References

M. Crucianu, M. Ferecatu, and N. Boujemaa. Relevance feedback for image retrieval: a short survey. DELOS2 Report, 2004.Google Scholar
R. Datta, D. Joshi, J. Li, and J. Wang. Image retrieval: Ideas, influences, and trends of the new age. ACM Computing Surveys, 2008. Google ScholarDigital Library
J. Deng, A. C. Berg, and L. Fei-Fei. Hierarchical semantic indexing for large scale image retrieval. In CVPR, 2011.Google ScholarDigital Library
J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei. Imagenet: A large-scale hierarchical image database. In CVPR, 2009.Google ScholarCross Ref
T. Deselaers and V. Ferrari. Visual and semantic similarity in imagenet. In CVPR, 2011. Google ScholarDigital Library
M. Douze, A. Ramisa, and C. Schmid. Combining attributes and fisher vectors for efficient image retrieval. In CVPR, 2011. Google ScholarDigital Library
J. Fan, Y. Gao, and H. Luo. Integrating concept ontology and multitask learning to achieve more effective classifier training for multilevel image annotation. TIP, 2008. Google ScholarDigital Library
A. Farhadi, I. Endres, D. Hoiem, and D. Forsyth. Describing objects by their attributes. In CVPR, 2009.Google ScholarCross Ref
X. Felix, R. Ji, M. Tsai, G. Ye, and S. Chang. Weak attributes for large-scale image retrieval. In CVPR, 2012.Google Scholar
C. Fellbaum. Wordnet. Theory and Applications of Ontology: Computer Applications, 2010. Google ScholarDigital Library
A. Hanjalic, C. Kofler, and M. Larson. Intent and its discontents: the user at the wheel of the online video search engine. In MM, 2012. Google ScholarDigital Library
A. Jaimes and S. fu Chang. A conceptual framework for indexing visual information at multiple levels. In SPIE Internet Imaging, 2000.Google Scholar
A. Kovashka, D. Parikh, and K. Grauman. Whittlesearch: Image search with relative attribute feedback. In CVPR, 2012. Google ScholarDigital Library
T. Leung and J. Malik. Representing and recognizing the visual appearance of materials using three-dimensional textons. IJCV, 2001. Google ScholarDigital Library
M. Lew, N. Sebe, C. Djeraba, and R. Jain. Content-based multimedia information retrieval: State of the art and challenges. TOMCCAP, 2006. Google ScholarDigital Library
D. G. Lowe. Distinctive image features from scale-invariant keypoints. IJCV, 2004. Google ScholarDigital Library
Z. Ma, Y. Yang, Z. Xu, S. Yan, N. Sebe, and A. G. Hauptmann. Complex event detection via multi-source video attributes. In CVPR, 2012. Google ScholarDigital Library
M. Marszalek and C. Schmid. Semantic hierarchies for visual object recognition. In CVPR, 2007.Google ScholarCross Ref
F. Monay and D. Gatica-Perez. On image auto-annotation with latent space models. In MM, 2003. Google ScholarDigital Library
M. Naphade, J. R. Smith, J. Tesic, S.-F. Chang, W. Hsu, L. Kennedy, A. Hauptmann, and J. Curtis. Large-scale concept ontology for multimedia. Multimedia, IEEE, 2006. Google ScholarDigital Library
P. Over, G. Awad, M. Michel, J. Fiscus, G. Sanders, B. Shaw, W. Kraaij, A. F. Smeaton, and G. Quéenot. Trecvid 2012 -- an overview of the goals, tasks, data, evaluation mechanisms and metrics. In TRECVID, 2012.Google Scholar
D. Parikh and K. Grauman. Interactively building a discriminative vocabulary of nameable attributes. In CVPR, 2011. Google ScholarDigital Library
Y. Rui, T. S. Huang, and S.-F. Chang. Image retrieval: Current techniques, promising directions, and open issues. JVCIR, 1999.Google ScholarDigital Library
Y. Rui, T. S. Huang, M. Ortega, and S. Mehrotra. Relevance feedback: a power tool for interactive content-based image retrieval. TCSVT, 1998. Google ScholarDigital Library
O. Russakovsky and L. Fei-Fei. Attribute learning in large-scale datasets. In ECCV, 2010. Google ScholarDigital Library
W. J. Scheirer, N. Kumar, P. N. Belhumeur, and T. E. Boult. Multi-attribute spaces: Calibration for attribute fusion and similarity search. In CVPR, 2012.Google ScholarCross Ref
N. Sebe, M. S. Lew, X. Zhou, T. S. Huang, and E. M. Bakker. The state of the art in image and video retrieval. In Image and Video Retrieval. 2003. Google ScholarDigital Library
A. Smeulders, M. Worring, S. Santini, A. Gupta, and R. Jain. Content-based image retrieval at the end of the early years. TPAMI, 2000. Google ScholarDigital Library
J. R. Smith and S.-F. Chang. Visualseek: a fully automated content-based image query system. In MM, 1997. Google ScholarDigital Library
C. G. Snoek, B. Huurnink, L. Hollink, M. De Rijke, G. Schreiber, and M. Worring. Adding semantics to detectors for video retrieval. TMM, 2007. Google ScholarDigital Library
C. G. Snoek and M. Worring. Concept-based video retrieval. FTIR, 2008. Google ScholarDigital Library
Y. Song, M. Zhao, J. Yagnik, and X. Wu. Taxonomic classification for web-based videos. In CVPR, 2010.Google ScholarCross Ref
D. Tao, X. Tang, X. Li, and X. Wu. Asymmetric bagging and random subspace for support vector machines-based relevance feedback in image retrieval. TPAMI, 2006. Google ScholarDigital Library
S. Tong and E. Chang. Support vector machine active learning for image retrieval. In MM, 2001. Google ScholarDigital Library
N. Verma, D. Mahajan, S. Sellamanickam, and V. Nair. Learning hierarchical similarity metrics. In CVPR, 2012.Google ScholarCross Ref
J. Wang, J. Yang, K. Yu, F. Lv, T. Huang, and Y. Gong. Locality-constrained linear coding for image classification. In CVPR, 2010.Google ScholarCross Ref
K. Q. Weinberger, J. Blitzer, and L. K. Saul. Distance metric learning for large margin nearest neighbor classification. In NIPS, 2006.Google ScholarDigital Library
Z.-J. Zha, L. Yang, T. Mei, M. Wang, and Z. Wang. Visual query suggestion. In MM, 2009. Google ScholarDigital Library
H. Zhang, Z.-J. Zha, S. Yan, J. Bian, and T.-S. Chua. Attribute feedback. In MM, 2012. Google ScholarDigital Library
K. Zhang, I. W. Tsang, and J. T. Kwok. Maximum margin clustering made practical. TNN, 2009. Google ScholarDigital Library

Index Terms

Attribute-augmented semantic hierarchy: towards bridging semantic gap and intention gap in image retrieval
1. Information systems
  1. Information retrieval
    1. Retrieval models and ranking

Recommendations

Attribute-Augmented Semantic Hierarchy: Towards a Unified Framework for Content-Based Image Retrieval
Special Issue on Multiple Sensorial (MulSeMedia) Multimodal Media : Advances and Applications

This article presents a novel attribute-augmented semantic hierarchy (A²SH) and demonstrates its effectiveness in bridging both the semantic and intention gaps in content-based image retrieval (CBIR). A²SH organizes semantic concepts into multiple ...
Read More
Semantic feedback for interactive image retrieval
MULTIMEDIA '05: Proceedings of the 13th annual ACM international conference on Multimedia

In this paper we present a semantic image retrieval system with integrated feedback mechanism. In our system, we propose a novel feedback solution for semantic retrieval: semantic feedback, which allows our system to interact with users directly at the ...
Read More
Augmented Image Retrieval using Multi-order Object Layout with Attributes
MM '14: Proceedings of the 22nd ACM international conference on Multimedia

In image retrieval, users' search intention is usually specified by textual queries, exemplar images, concept maps, and even sketches, which can only express the search intention partially. These query strategies lack the abilities to indicate the ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
MM '13: Proceedings of the 21st ACM international conference on Multimedia
October 2013
1166 pages
ISBN:9781450324045
DOI:10.1145/2502081
General Chairs:
Alejandro (Alex) Jaimes
Yahoo!, Spain
,
Nicu Sebe
University of Trento, Italy
,
Nozha Boujemaa
INRIA, France
,
Program Chairs:
Daniel Gatica-Perez
IDIAP & EPFL, Switzerland
,
David A. Shamma
Yahoo!, USA
,
Marcel Worring
University of Amsterdam, The Netherlands
,
Roger Zimmermann
National University of Singapore, Singapore
Copyright © 2013 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 21 October 2013
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
attribute
image retrieval
semantic hierarchy
Qualifiers
- research-article
Conference

Acceptance Rates
MM '13 Paper Acceptance Rate47of235submissions,20%Overall Acceptance Rate995of4,171submissions,24%
More
Upcoming Conference
MM '24

Sponsor:

sigmm

MM '24: The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne , VIC , Australia
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 144
  Total Citations
  View Citations
- 2,284
  Total Downloads
- Downloads (Last 12 months)23
- Downloads (Last 6 weeks)6
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Attribute-augmented semantic hierarchy: towards bridging semantic gap and intention gap in image retrieval

MM '13: Proceedings of the 21st ACM international conference on Multimedia

ABSTRACT

References

Cited By

Index Terms

Recommendations

Attribute-Augmented Semantic Hierarchy: Towards a Unified Framework for Content-Based Image Retrieval

Semantic feedback for interactive image retrieval

Augmented Image Retrieval using Multi-order Object Layout with Attributes