short-paper

Semi-supervised topic modeling for image annotation

Authors:
Yuanlong Shao

State Key Laboratory of CAD&CG, Zhejiang University, Hangzhou, Zhejiang, China

State Key Laboratory of CAD&CG, Zhejiang University, Hangzhou, Zhejiang, China
View Profile

,
Yuan Zhou

State Key Laboratory of CAD&CG, Zhejiang University, Hangzhou, Zhejiang, China

State Key Laboratory of CAD&CG, Zhejiang University, Hangzhou, Zhejiang, China
View Profile

,
Xiaofei He

State Key Laboratory of CAD&CG, Zhejiang University, Hangzhou, Zhejiang, China

State Key Laboratory of CAD&CG, Zhejiang University, Hangzhou, Zhejiang, China
View Profile

,
Deng Cai

State Key Laboratory of CAD&CG, Zhejiang University, Hangzhou, Zhejiang, China

State Key Laboratory of CAD&CG, Zhejiang University, Hangzhou, Zhejiang, China
View Profile

,
Hujun Bao

State Key Laboratory of CAD&CG, Zhejiang University, Hangzhou, Zhejiang, China

State Key Laboratory of CAD&CG, Zhejiang University, Hangzhou, Zhejiang, China
View Profile

MM '09: Proceedings of the 17th ACM international conference on MultimediaOctober 2009Pages 521–524https://doi.org/10.1145/1631272.1631346

Published:19 October 2009Publication History

MM '09: Proceedings of the 17th ACM international conference on Multimedia

Pages 521–524

ABSTRACT

We propose a novel technique for semi-supervised image annotation which introduces a harmonic regularizer based on the graph Laplacian of the data into the probabilistic semantic model for learning latent topics of the images. By using a probabilistic semantic model, we connect visual features and textual annotations of images by their latent topics. Meanwhile, we incorporate the manifold assumption into the model to say that the probabilities of latent topics of images are drawn from a manifold, so that for images sharing similar visual features or the same annotations, their probability distribution of latent topics should also be similar. We create a nearest neighbor graph to model the manifold and propose a regularized EM algorithm to simultaneously learn a generative model and assign probability density of latent topics to images discriminatively. In this way, databases with very few labeled images can be annotated better than previous works.

References

K. Barnard, P. Duygulu, D. Forsyth, N. de Freitas, D. M. Blei, and M. I. Jordan. Matching words and pictures. Journal of Machine Learning Research, 3:1107--1135, 2003. Google ScholarDigital Library
M. Belkin, P. Niyogi, and V. Sindhwani. Manifold regularization: A geometric framework for learning from labeled and unlabeled examples. Journal of Machine Learning Research, 7:2399--2434, 2006. Google ScholarDigital Library
D. M. Blei and M. I. Jordan. Modeling annotated data. In Proc. ACM Int. Conf. on Research and Development in Informaion Retrieval(ACM SIGIR), pages 127--134, 2003. Google ScholarDigital Library
D. Cai, Q. Mei, J. Han, and C. Zhai. Modeling hidden topics on document manifold. In Proc. ACM Conf. on Information and knowledge management(CIKM'08), pages 911--920, 2008. Google ScholarDigital Library
G. Csurka, C. R. Dance, L. Fan, J. Willamowski, and C. Bray. Visual categorization with bags of keypoints. In Workshop on Statistical Learning in Computer Vision, ECCV, pages 1--22, 2004.Google Scholar
X. He, D. Cai, Y. Shao, H. Bao, and J. Han. Laplacian regularized gaussian mixture model for data clustering. Preprint.Google Scholar
Q. Mei, D. Cai, D. Zhang, and C. Zhai. Topic modeling with network regularization. In Proc. ACM Int. Conf. on World Wide Web (WWW'08), pages 101--110, 2008. Google ScholarDigital Library
F. Monay and D. Gatica-Perez. On image auto-annotation with latent space models. In Proc. ACM Int. Conf. on Multimedia (SIGMM'03), pages 275--278, 2003. Google ScholarDigital Library
F. Monay and D. Gatica-Perez. Plsa-based image auto-annotation: constraining the latent space. In Proc. ACM Int. Conf. on Multimedia (SIGMM'04), pages 348--351, 2004. Google ScholarDigital Library
R. M. Neal and G. E. Hinton. A view of the em algorithm that justifies incremental, sparse, and other variants. In Learning in graphical models, pages 355--368. 1999. Google ScholarDigital Library
R. Zhang, Z. M. Zhang, M. Li, W.-Y. Ma, and H.-J. Zhang. A probabilistic semantic model for image annotation and multi-modal image retrieval. In Proc. IEEE Int. Conf. on Computer Vision (ICCV'05), pages 846--851, 2005. Google ScholarDigital Library
X. Zhu, J. Lafferty, and Z. Ghahramani. Semi-supervised learning using gaussian fields and harmonic functions. In Proc. Int. Conf. Machine Learning(ICML'05), 2005.Google Scholar

Index Terms

Semi-supervised topic modeling for image annotation
1. Information systems
  1. Information retrieval
    1. Document representation
    2. Search engine architectures and scalability
      1. Search engine indexing

Recommendations

Opinion integration through semi-supervised topic modeling
WWW '08: Proceedings of the 17th international conference on World Wide Web

Web 2.0 technology has enabled more and more people to freely express their opinions on the Web, making the Web an extremely valuable source for mining user opinions about all kinds of topics. In this paper we study how to automatically integrate ...
Read More
Automatic image annotation using semi-supervised generative modeling

Image annotation approaches need an annotated dataset to learn a model for the relation between images and words. Unfortunately, preparing a labeled dataset is highly time consuming and expensive. In this work, we describe the development of an ...
Read More
A Novel Region-based Image Annotation Using Multi-instance Learning
WKDD '09: Proceedings of the 2009 Second International Workshop on Knowledge Discovery and Data Mining

In this paper, we formulate image annotation as a semi-supervised learning problem under multi-instance learning framework. A novel graph based semi-supervised learning approach to image annotation using multiple instances is presented, which extends ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
MM '09: Proceedings of the 17th ACM international conference on Multimedia
October 2009
1202 pages
ISBN:9781605586083
DOI:10.1145/1631272
General Chairs:
Wen Gao
Peking University, China
,
Yong Rui
Microsoft, China
,
Alan Hanjalic
Delft University of Technology, The Netherlands
,
Program Chairs:
Changsheng Xu
Institute of Automation, Chinese Academy of Sciences, China
,
Eckehard Steinbach
Technical University of Munich, Germany
,
Abdulmotaleb El Saddik
University of Ottawa, Canada
,
Michelle Zhou
IBM T. J. Watson Research Center, USA
Copyright © 2009 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 19 October 2009
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
automatic image annotation
laplacian regularization
semantic indexing
semi-supervised learning
Qualifiers
- short-paper
Conference

Acceptance Rates
Overall Acceptance Rate995of4,171submissions,24%
Upcoming Conference
MM '24

Sponsor:

sigmm

MM '24: The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne , VIC , Australia
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 11
  Total Citations
  View Citations
- 413
  Total Downloads
- Downloads (Last 12 months)2
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Semi-supervised topic modeling for image annotation

MM '09: Proceedings of the 17th ACM international conference on Multimedia

ABSTRACT

References

Cited By

Index Terms

Recommendations

Opinion integration through semi-supervised topic modeling

Automatic image annotation using semi-supervised generative modeling

A Novel Region-based Image Annotation Using Multi-instance Learning