ABSTRACT
One of the fundamental problems in image search is to rank image documents according to a given textual query. We address two limitations of the existing image search engines in this paper. First, there is no straightforward way of comparing textual keywords with visual image content. Image search engines therefore highly depend on the surrounding texts, which are often noisy or too few to accurately describe the image content. Second, ranking functions are trained on query-image pairs labeled by human labelers, making the annotation intellectually expensive and thus cannot be scaled~up.
We demonstrate that the above two fundamental challenges can be mitigated by jointly exploring the subspace learning and the use of click-through data. The former aims to create a latent subspace with the ability in comparing information from the original incomparable views (i.e., textual and visual views), while the latter explores the largely available and freely accessible click-through data (i.e., "crowdsourced" human intelligence) for understanding query. Specifically, we investigate a series of click-through-based subspace learning techniques (CSL) for image search. We conduct experiments on MSR-Bing Grand Challenge and the final evaluation performance achieves DCG@25=0.47225. Moreover, the feature dimension is significantly reduced by several orders of magnitude (e.g., from thousands to tens).
- J. Donahue, Y. Jia, O. Vinyals, J. Hoffman, N. Zhang, E. Tzeng, and T. Darrell. Decaf: A deep convolutional activation feature for generic visual recognition. arXiv preprint arXiv:1310.1531, 2013.Google Scholar
- Y. Gong, Q. Ke, M. Isard, and S. Lazebnik. A multi-view embedding space for modeling internet images, tags, and their semantics. IJCV, pages 210--233, 2014. Google ScholarDigital Library
- D. Grangier and S. Bengio. A discriminative kernel-based model to rank images from text queries. IEEE Trans. on PAMI, 30(8):1371--1384, 2008. Google ScholarDigital Library
- X.-S. Hua, L. Yang, J. Wang, J. Wang, M. Ye, K. Wang, Y. Rui, and J. Li. Clickage: Towards bridging semantic and intent gaps via mining click logs of search engines. In ACM MM, 2013. Google ScholarDigital Library
- T. Joachims. Optimizing search engines using clickthrough data. In KDD, 2002. Google ScholarDigital Library
- Y. Pan, T. Yao, T. Mei, H. Li, C.-W. Ngo, and Y. Rui. Click-through-based cross-view learning for image search. In SIGIR, 2014. Google ScholarDigital Library
- Y. Pan, T. Yao, K. Yang, H. Li, C.-W. Ngo, J. Wang, and T. Mei. Image search by graph-based label propagation with image representation from dnn. In ACM MM, 2013. Google ScholarDigital Library
- Y. Rui, T. Huang, and S. Chang. Image retrieval: current techniques, promising directions, and open issues. VCIR, 10(1):39--62, 1999.Google ScholarDigital Library
- C. Xu, D. Tao, and C. Xu. A survey on multi-view learning. CoRR abs/1304.5634, 2013.Google Scholar
- T. Yao, T. Mei, C.-W. Ngo, and S. Li. Annotation for free: Video tagging by mining user search behavior. In ACM MM, 2013. Google ScholarDigital Library
Index Terms
- Click-through-based Subspace Learning for Image Search
Recommendations
Click-through-based cross-view learning for image search
SIGIR '14: Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrievalOne of the fundamental problems in image search is to rank image documents according to a given textual query. Existing search engines highly depend on surrounding texts for ranking images, or leverage the query-image pairs annotated by human labelers ...
Click-boosting multi-modality graph-based reranking for image search
Image reranking is an effective way for improving the retrieval performance of keyword-based image search engines. A fundamental issue underlying the success of existing image reranking approaches is the ability in identifying potentially useful ...
Optimizing web search using web click-through data
CIKM '04: Proceedings of the thirteenth ACM international conference on Information and knowledge managementThe performance of web search engines may often deteriorate due to the diversity and noisy information contained within web pages. User click-through data can be used to introduce more accurate description (metadata) for web pages, and to improve the ...
Comments