skip to main content
10.1145/2578726.2578734acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
tutorial

Sparse Kernel Learning for Image Annotation

Authors Info & Claims
Published:01 April 2014Publication History

ABSTRACT

In this paper we introduce a sparse kernel learning framework for the Continuous Relevance Model (CRM). State-of-the-art image annotation models linearly combine evidence from several different feature types to improve image annotation accuracy. While previous authors have focused on learning the linear combination weights for these features, there has been no work examining the optimal combination of kernels. We address this gap by formulating a sparse kernel learning framework for the CRM, dubbed the SKL-CRM, that greedily selects an optimal combination of kernels. Our kernel learning framework rapidly converges to an annotation accuracy that substantially outperforms a host of state-of-the-art annotation models. We make two surprising conclusions: firstly, if the kernels are chosen correctly, only a very small number of features are required so to achieve superior performance over models that utilise a full suite of feature types; and secondly, the standard default selection of kernels commonly used in the literature is sub-optimal, and it is much better to adapt the kernel choice based on the feature type and image dataset.

References

  1. K. Barnard, P. Duygulu, N. de Freitas, D. Forsyth, D. Blei, and M. Jordan. Matching words and pictures. In JMLR'03. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. G. Carneiro, A. B. Chan, P. J. Moreno, and N. Vasconcelos. Supervised learning of semantic classes for image annotation and retrieval. In PAMI'07.Google ScholarGoogle Scholar
  3. W. S. Cooper. Some inconsistencies and misidentified modeling assumptions in probabilistic information retrieval. In TOIS'95.Google ScholarGoogle Scholar
  4. P. Duygulu, K. Barnard, N. de Freitas, and D. Forsyth. Object recognition as machine translation: Learning a lexicon for a fixed image vocabulary. In ECCV '02. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. S. Feng, R. Manmatha, and V. Lavrenko. Multiple bernoulli relevance models for image and video annotation. In CVPR'04. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. H. Fu, Q. Zhang, and G. Qiu. Random forest for image annotation. In ECCV'12. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. D. Grangier and S. Bengio. A discriminative kernel-based approach to rank images from text queries. In PAMI'08.Google ScholarGoogle Scholar
  8. M. Guillaumin, T. Mensink, J. Verbeek, and C. Schmid. Tagprop: Discriminative metric learning in nearest neighbor models for image auto-annotation. In ICCV'09.Google ScholarGoogle Scholar
  9. C. Hentschel, S. Stober, A. Nürnberger, and M. Detyniecki. Automatic image annotation using a visual dictionary based on reliable image segmentation. In AMR'08.Google ScholarGoogle Scholar
  10. V. Lavrenko, S. Feng, and R. Manmatha. Statistical models for automatic video annotation and retrieval. In ICASSP'04.Google ScholarGoogle Scholar
  11. V. Lavrenko, R. Manmatha, and J. Jeon. A model for learning the semantics of pictures. In NIPS'03.Google ScholarGoogle Scholar
  12. J. Liu, M. Li, Q. Liu, H. Lu, and S. Ma. Image annotation via graph learning. In JPR'09.Google ScholarGoogle Scholar
  13. A. Makadia, V. Pavlovic, and S. Kumar. A new baseline for image annotation. In ECCV '08. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. M. Markkula and E. Sormunen. End-user searching challenges indexing practices in the digital newspaper photo archive. In IR'00.Google ScholarGoogle Scholar
  15. D. Metzler and R. Manmatha. An inference network approach to image retrieval. In CIVR'04.Google ScholarGoogle Scholar
  16. S. Moran and V. Lavrenko. Optimal tag sets for automatic image annotation. In BMVC'11.Google ScholarGoogle Scholar
  17. H. Nakayama. Linear distance metric Learning for large-scale generic image recognition. PhD thesis, The University of Tokyo, Japan, 2011.Google ScholarGoogle Scholar
  18. P. Richtárik and M. Takác. Distributed coordinate descent method for learning with big data. In CoRR'13.Google ScholarGoogle Scholar
  19. Y. Verma and C. V. Jawahar. Exploring svm for image annotation in presence of confusing labels. In BMVC'13.Google ScholarGoogle Scholar
  20. Y. Verma and C. V. Jawahar. Image annotation using metric learning in semantic neighbourhoods. In ECCV'12. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. K. Q. Weinberger and L. K. Saul. Distance metric learning for large margin nearest neighbor classification. In JMLR'09. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Y. Xiang, X. Zhou, T.-S. Chua, and C.-W. Ngo. A revisit of generative model for automatic image annotation using markov random fields. In CVPR'09.Google ScholarGoogle Scholar
  23. O. Yakhnenko and V. Honavar. Annotating images and image objects using a hierarchical dirichlet process model. In MDM '08. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. A. Yavlinsky, E. Schofield, and S. Rüger. Automated image annotation using global features and robust nonparametric density estimation. In CIVR'05. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. S. Zhang, J. Huang, Y. Huang, Y. Yu, H. Li, and D. N. Metaxas. Automatic image annotation using group sparsity. In CVPR'10.Google ScholarGoogle Scholar

Index Terms

  1. Sparse Kernel Learning for Image Annotation

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        ICMR '14: Proceedings of International Conference on Multimedia Retrieval
        April 2014
        564 pages
        ISBN:9781450327824
        DOI:10.1145/2578726

        Copyright © 2014 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 1 April 2014

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • tutorial
        • Research
        • Refereed limited

        Acceptance Rates

        ICMR '14 Paper Acceptance Rate21of111submissions,19%Overall Acceptance Rate254of830submissions,31%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader