skip to main content
10.1145/3343031.3350900acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

TC-Net for iSBIR: Triplet Classification Network for Instance-level Sketch Based Image Retrieval

Authors Info & Claims
Published:15 October 2019Publication History

ABSTRACT

Sketch has been employed as an effective communication tool to express the abstract and intuitive meaning of object. While content-based sketch recognition has been studied for several decades, the instance-level Sketch Based Image Retrieval (iSBIR) task has attracted significant research attention recently. In many previous iSBIR works -- TripletSN, and DSSA, edge maps were employed as intermediate representations in bridging the cross-domain discrepancy between photos and sketches. However, it is nontrivial to efficiently train and effectively use the edge maps in an iSBIR system. Particularly, we find that such an edge map based iSBIR system has several major limitations. First, the system has to be pre-trained on a significant amount of edge maps, either from large-scale sketch datasets, e.g., TU-Berlin~\citeeitz2012hdhso, or converted from other large-scale image datasets, e.g., ImageNet-1K\citedeng2009imagenet dataset. Second, the performance of such an iSBIR system is very sensitive to the quality of edge maps. Third and empirically, the multi-cropping strategy is essentially very important in improving the performance of previous iSBIR systems. To address these limitations, this paper advocates an end-to-end iSBIR system without using the edge maps. Specifically, we present a Triplet Classification Network (TC-Net) for iSBIR which is composed of two major components: triplet Siamese network, and auxiliary classification loss. Our TC-Net can break the limitations existed in previous works. Extensive experiments on several datasets validate the efficacy of the proposed network and system.

References

  1. John Canny. 1986. A computational approach to edge detection. IEEE TPAMI 6 (1986), 679--698.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Xiaochun Cao, Hua Zhang, Si Liu, Xiaojie Guo, and Liang Lin. 2013. Sym-fish: A symmetry-aware flip invariant sketch histogram shape descriptor. In ICCV .Google ScholarGoogle Scholar
  3. Yang Cao, Changhu Wang, Liqing Zhang, and Lei Zhang. 2011. Edgel index for large-scale sketch-based image search. In CVPR .Google ScholarGoogle Scholar
  4. De Cheng, Yihong Gong, Sanping Zhou, JinjunWang, and Nanning Zheng. 2016a. Person Re-Identification by Multi-Channel Parts-Based CNN with Improved Triplet Loss Function. In CVPR .Google ScholarGoogle Scholar
  5. De Cheng, Yihong Gong, Sanping Zhou, Jinjun Wang, and Nanning Zheng. 2016b. Person re-identification by multi-channel parts-based cnn with improved triplet loss function. In CVPR .Google ScholarGoogle Scholar
  6. Jia Deng, Wei Dong, R. Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. 2009. ImageNet: A large-scale hierarchical image database. In CVPR . https://doi.org/10.1109/CVPR.2009.5206848Google ScholarGoogle Scholar
  7. Jiankang Deng, Yuxiang Zhou, and Stefanos Zafeiriou. 2017. Marginal loss for deep face recognition. In CVPR, Faces in-the-wild Workshop/Challenge .Google ScholarGoogle Scholar
  8. Mathias Eitz, James Hays, and Marc Alexa. 2012. How Do Humans Sketch Objects? ACM SIGGRAPH , Vol. 31, 4 (2012), 44:1--44:10.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Mathias Eitz, Kristian Hildebrand, Tamy Boubekeur, and Marc Alexa. 2010. An evaluation of descriptors for large-scale image retrieval from sketched feature lines. Computers & Graphics , Vol. 34, 5 (2010), 482--498.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Mathias Eitz, Kristian Hildebrand, Tamy Boubekeur, and Marc Alexa. 2011. Sketch-based image retrieval: Benchmark and bag-of-features descriptors. TVCG .Google ScholarGoogle Scholar
  11. Wang F., Kang L., and Li Y. 2015. Sketch-based 3d shape retrieval using convolutional neural networks. In CVPR .Google ScholarGoogle Scholar
  12. Yunchao Gong, Qifa Ke, Michael Isard, and Svetlana Lazebnik. 2013. A Multi-View Embedding Space for Modeling Internet Images, Tags, and their Semantics. IJCV (2013).Google ScholarGoogle Scholar
  13. Alexander Hermans, Lucas Beyer, and Bastian Leibe. 2017. In defense of the triplet loss for person re-identification. arXiv preprint arXiv:1703.07737 (2017).Google ScholarGoogle Scholar
  14. R. Hu and J. Collomosse. 2013. A performance evaluation of gradient field HOG descriptor for sketch based image retrieval. CVIU (2013).Google ScholarGoogle Scholar
  15. R. Hu, T. Wang, and J. Collomosse. 2011. A bag-of-regions approach to sketch based image retrieval. In ICIP .Google ScholarGoogle Scholar
  16. Gao Huang, Zhuang Liu, Kilian Q Weinberger, and Laurens van der Maaten. 2017. Densely connected convolutional networks. In CVPR .Google ScholarGoogle Scholar
  17. J. Huang, R. S. Feris, Q. Chen, and S. Yan. 2015. Cross-domain image retrieval with a dual attribute-aware ranking network. In ICCV .Google ScholarGoogle Scholar
  18. Yu-Gang Jiang, Minjun Li, Xi Wang, Wei Liu, and Xian-Sheng Hua. 2018. DeepProduct: Mobile product search with portable deep features. ACM TOMM , Vol. 14, 2 (2018), 50.Google ScholarGoogle Scholar
  19. T. Kato, T. Kurita, N. Otsu, and K. Hirata. 1992. A sketch retrieval method for full color image database-query by visual example. In IAPR .Google ScholarGoogle Scholar
  20. Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. 2012. ImageNet Classification with Deep Convolutional Neural Networks. In NIPS .Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Ke Li, Kaiyue Pang, Jifei Song, Yi-Zhe Song, Tao Xiang, Timothy M., Hospedales, and Honggang Zhang. 2018. Universal Sketch Perceptual Grouping. In arxiv .Google ScholarGoogle Scholar
  22. Jiawei Liu, Zheng-Jun Zha, QI Tian, Dong Liu, Ting Yao, Qiang Ling, and Tao Mei. 2016. Multi-Scale Triplet CNN for Person Re-Identification. In ACM Multimedia .Google ScholarGoogle Scholar
  23. Li Liu, Fumin Shen, Yuming Shen, Xianglong Liu, and Ling Shao. 2017a. Deep sketch hashing: Fast free-hand sketch-based image retrieval. In CVPR .Google ScholarGoogle Scholar
  24. Weiyang Liu, Yandong Wen, Zhiding Yu, Ming Li, Bhiksha Raj, and Le Song. 2017b. Sphereface: Deep hypersphere embedding for face recognition. In CVPR .Google ScholarGoogle Scholar
  25. E. Mathias, H. Kristian, B. Tamy, and A. Marc. 2011. Sketch-based image retrieval: Benchmark and bag-of-features descriptors. TVCG (2011).Google ScholarGoogle Scholar
  26. Umar Riaz Muhammad, Yongxin Yang, Yi-Zhe Song, Tao Xiang, Timothy M Hospedales, et almbox. 2018. Learning Deep Sketch Abstraction. arXiv preprint arXiv:1804.04804 (2018).Google ScholarGoogle Scholar
  27. Filip Radenovi´c, Giorgos Tolias, and Ondrej Chum. 2018. Deep Shape Matching. In arxiv .Google ScholarGoogle Scholar
  28. Jose M Saavedra, Juan Manuel Barrios, and S Orand. 2015. Sketch based Image Retrieval using Learned KeyShapes (LKS).. In BMVC .Google ScholarGoogle Scholar
  29. Patsorn Sangkloy, Nathan Burnell, Cusuh Ham, and James Hays. 2016. The Sketchy Database: Learning to Retrieve Badly Drawn Bunnies. ACM SIGGRAPH (2016).Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Florian Schroff, Dmitry Kalenichenko, and James Philbin. 2015. Facenet: A unified embedding for face recognition and clustering. In CVPR .Google ScholarGoogle Scholar
  31. Karen Simonyan and Andrew Zisserman. 2015. VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION. In ICLR .Google ScholarGoogle Scholar
  32. Jifei Song, Yu Qian, Yi-Zhe Song, Tao Xiang, and Timothy Hospedales. 2017. Deep spatial-semantic attention for fine-grained sketch-based image retrieval. In ICCV .Google ScholarGoogle Scholar
  33. Yaniv Taigman, Ming Yang, Marc'Aurelio Ranzato, and Lior Wolf. 2014. Deepface: Closing the gap to human-level performance in face verification. In CVPR .Google ScholarGoogle Scholar
  34. Hao Wang, Yitong Wang, Zheng Zhou, Xing Ji, Zhifeng Li, Dihong Gong, Jingchao Zhou, and Wei Liu. 2018. CosFace: Large margin cosine loss for deep face recognition. arXiv preprint arXiv:1801.09414 (2018).Google ScholarGoogle Scholar
  35. Yandong Wen, Kaipeng Zhang, Zhifeng Li, and Yu Qiao. 2016. A discriminative feature learning approach for deep face recognition. In ECCV .Google ScholarGoogle Scholar
  36. Holger Winnemöller, Jan Eric Kyprianidis, and Sven C Olsen. 2012. XDoG: an extended difference-of-Gaussians compendium including advanced image stylization. Computers & Graphics , Vol. 36, 6 (2012), 740--753.Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Peng Xu, Yongye Huang, Tongtong Yuan, Kaiyue Pang, Yi-Zhe Song, Tao Xiang, Timothy M Hospedales, Zhanyu Ma, and Jun Guo. 2018. Sketchmate: Deep hashing for million-scale human sketch retrieval. In CVPR .Google ScholarGoogle Scholar
  38. Peng Xu, Qiyue Yin, Yonggang Qi, Yi-Zhe Song, Zhanyu Ma, Liang Wang, and Jun Guo. 2016. Instance-level coupled subspace learning for fine-grained sketch-based image retrieval. In European Conference on Computer Vision. Springer, 19--34.Google ScholarGoogle ScholarCross RefCross Ref
  39. Weidong Yin, Yanwei Fu, Yiqiang Ma, Yu-Gang Jiang, Tao Xiang, and Xiangyang Xue. 2017. Learning to Generate and Edit Hairstyles. In ACM Multimedia .Google ScholarGoogle Scholar
  40. Qian Yu, Feng Liu, Yi-Zhe Song, Tao Xiang, Timothy M. Hospedales, and Chen Change Loy. 2016. Sketch Me That Shoe. In CVPR .Google ScholarGoogle Scholar
  41. Qian Yu, Yongxin Yang, Feng Liu, Yi-Zhe Song, Tao Xiang, and Timothy M. Hospedales. 2017. Sketch-a-Net: a Deep Neural Network that Beats Humans. IJCV (2017).Google ScholarGoogle Scholar
  42. Xiao Zhang, Zhiyuan Fang, Yandong Wen, Zhifeng Li, and Yu Qiao. 2017. Range loss for deep face recognition with long-tailed training data. In CVPR .Google ScholarGoogle Scholar
  43. Han Zhu, Mingsheng Long, Jianmin Wang, and Yue Cao. 2016. Deep Hashing Network for Efficient Similarity Retrieval.. In AAAI .Google ScholarGoogle Scholar

Index Terms

  1. TC-Net for iSBIR: Triplet Classification Network for Instance-level Sketch Based Image Retrieval

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      MM '19: Proceedings of the 27th ACM International Conference on Multimedia
      October 2019
      2794 pages
      ISBN:9781450368896
      DOI:10.1145/3343031

      Copyright © 2019 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 15 October 2019

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      MM '19 Paper Acceptance Rate252of936submissions,27%Overall Acceptance Rate995of4,171submissions,24%

      Upcoming Conference

      MM '24
      MM '24: The 32nd ACM International Conference on Multimedia
      October 28 - November 1, 2024
      Melbourne , VIC , Australia

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader