Article

Scalable search-based image annotation of personal images

Authors:
Changhu Wang

University of Science and Technology of China, Hefei, China

University of Science and Technology of China, Hefei, China
View Profile

,
Feng Jing

Microsoft Research Asia, Beijing, China

Microsoft Research Asia, Beijing, China
View Profile

,
Lei Zhang

Microsoft Research Asia, Beijing, China

Microsoft Research Asia, Beijing, China
View Profile

,
Hong-Jiang Zhang

Microsoft Research Asia, Beijing, China

Microsoft Research Asia, Beijing, China
View Profile

MIR '06: Proceedings of the 8th ACM international workshop on Multimedia information retrievalOctober 2006Pages 269–278https://doi.org/10.1145/1178677.1178714

Published:26 October 2006Publication History

MIR '06: Proceedings of the 8th ACM international workshop on Multimedia information retrieval

Pages 269–278

ABSTRACT

With the prevalence of digital cameras, more and more people have considerable digital images on their personal devices. As a result, there are increasing needs to effectively search these personal images. Automatic image annotation may serve the goal, for the annotated keywords could facilitate the search processes. Although many image annotation methods have been proposed in recent years, their effectiveness on arbitrary personal images is constrained by their limited scalability, i.e. limited lexicon of small-scale training set. To be scalable, we propose a search-based image annotation (SBIA) algorithm that is analogous to Web page search. First, content-based image retrieval (CBIR) technology is used to retrieve a set of visually similar images from a large-scale Web image set. Then, a text-based keyword search (TBKS) technique is used to obtain a ranked list of candidate annotations for each retrieved image. Finally, a fusion algorithm is used to combine the ranked lists into the final annotation list. The application of both efficient search technologies and Web-scale image set guarantees the scalability of the proposed algorithm. Experimental results on U. Washington dataset show not only the effectiveness and efficiency of the proposed algorithm but also the advantage of image retrieval using annotation results over that using visual features.

References

http://www.cs.washington.edu/research/imagedatabase/groundtruth/Google Scholar
http://www.photosig.comGoogle Scholar
Baeza-Yates, R. A. and Ribeiro-Neto, B. 1999 Modern Information Retrieval. Addison-Wesley Longman Publishing Co., Inc. Google ScholarDigital Library
Blei, D. M. and Jordan, M. I. 2003. Modeling annotated data. In Proceedings of the 26th Annual international ACM SIGIR Conference on Research and Development in informaion Retrieval. New York, NY, 127--134. Google ScholarDigital Library
Brown, P. F., deSouza, P. V., Mercer, R. L., Pietra, V. J., and Lai, J. C. 1992. Class-based n-gram models of natural language. Comput. Linguist. 18, 4 (Dec. 1992), 467--479. Google ScholarDigital Library
Chang, E., Kingshy, G., Sychay, G., and Wu, G. CBSA: content-based soft annotation for multimodal image retrieval using Bayes point machines. IEEE Trans. on CSVT, 13(1):26--38, Jan. 2003. Google ScholarDigital Library
Cusano, C., Ciocca, G., and Schettini, R. Image annotation using SVM. In Proc. Of Internet imaging IV, Vol. SPIE, 2004.Google Scholar
Duygulu, P. and Barnard, K. Object recognition as machine translation: learning a lexicon for a fixed image vocabulary. In Seventh European Conference on Computer Vision, 4:97--112, 2002. Google ScholarDigital Library
Fan, X., Xie, X., Li, Z., Li, M., and Ma, W. Y. Photo-to- Search: Using Multimodal Queries to Search the Web from Mobile Devices. 7th ACM SIGMM Workshop on MIR, 2005. Google ScholarDigital Library
Feng, S. L., Manmatha, R., and Lavrenko, V. Multiple bernoulli relevance models for image and video annotation. In The International Conference on Computer Vision and Pattern Recognition, Washington, DC, June, 2004. Google ScholarDigital Library
Ferhatosmanoglu, H., Tuncel, E., Agrawal, D., and Abbadi, A. E. Approximate nearest neighbor searching in multimedia databases. In Proceedings of the 17th IEEE Int'l. Conference on Data Engineering, Heidelberg, Germany, April, 2001, 2-6, pp. 503--511. Google ScholarDigital Library
Jeon, J., Lavrenko, V., and Manmatha, R. Automatic Image Annotation and Retrieval Using Cross-media Relevance Models. In Proc. of ACM SIGIR conference on Research and development in information retrieval, pp. 119--126, July, 2003. Google ScholarDigital Library
Lavrenko, V., Manmatha, R., and Jeon, J. A Model for Learning the Semantics of Pictures. In Proc. of the 17th Annual Conf. on Neural Information Processing Systems, 2003.Google Scholar
Li, J. and Wang, J. Z. Automatic linguistic indexing of pictures by a statistical modeling approach. IEEE Trans. on PAMI, 25(10), Oct. 2003. Google ScholarDigital Library
Miller, G. A. 1995. WordNet: a lexical database for English. Commun. ACM 38, 11 (Nov. 1995), 39--41. Google ScholarDigital Library
Mori, Y., Takahashi, H., and Oka, R. Image-to-word transformation based on dividing and vector quantizing images with words. In MISRM'99 First International Workshop on Multimedia Intelligent Storage and Retrieval Management, 1999.Google Scholar
Page, L., Brin, S., Motwani, R., and Winograd, T. The Pagerank Citation Ranking: Bringing Order to the web, technical report, Stanford University, Stanford, CA, 1998.Google Scholar
Robertson, S. E. and Walker, S. Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval. In Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 345--354. Springer-Verlag, 1994. Google ScholarDigital Library
Smeulders, A. W. M., Worring, M., Santini, S., Gupta, A., and Jain, R. Content-based image retrieval at the end of the early years. IEEE Trans. Pattern Anal. Mach. Intell., 22(12):1349--1380, 2000. Google ScholarDigital Library
Wang, X. J., Zhang, L., Jing, F., and Ma, W. Y. AnnoSearch: Image Auto-Annotation by Search. In The International Conference on Computer Vision and Pattern Recognition, New York, June, 2006. Google ScholarDigital Library
Yanai, K. and Barnard, K. 2005. Image region entropy: a measure of "visualness" of web images associated with one concept. In Proceedings of the 13th Annual ACM international Conference on Multimedia. New York, NY, 419--422. Google ScholarDigital Library
Yeh, T., Tollmar, K., and Darrell, T. Searching the Web with Mobile Images for Location Recognition. In The International Conference on Computer Vision and Pattern Recognition, 2004, pp. 76--81. Google ScholarDigital Library
Zeng, H., He, Q., Chen, Z., Ma, W., and Ma, J. 2004. Learning to cluster web search results. SIGIR, 2004. New York, NY, 210--217. Google ScholarDigital Library
Zhang, L., Hu, Y., Li, M., Ma, W., and Zhang, H. 2004. Efficient propagation for face annotation in family albums. In Proceedings of the 12th Annual ACM international Conference on Multimedia. New York, NY, 716--723. Google ScholarDigital Library

Index Terms

Scalable search-based image annotation of personal images
1. Information systems
  1. Information retrieval
    1. Retrieval models and ranking

Recommendations

ConceptRank for search-based image annotation

Multimedia information is becoming an ubiquitous part of our lives, which brings an equally ubiquitous need for efficient multimedia retrieval. One of the possible solutions to this problem is to attach text descriptions to multimedia data objects, thus ...
Read More
Image annotation by large-scale content-based image retrieval
MM '06: Proceedings of the 14th ACM international conference on Multimedia

Image annotation has been an active research topic in recent years due to its potentially large impact on both image understanding and Web image search. In this paper, we target at solving the automatic image annotation problem in a novel search and ...
Read More
Learning to reduce the semantic gap in web image retrieval and annotation
SIGIR '08: Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval

We study in this paper the problem of bridging the semantic gap between low-level image features and high-level semantic concepts, which is the key hindrance in content-based image retrieval. Piloted by the rich textual information of Web images, the ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
MIR '06: Proceedings of the 8th ACM international workshop on Multimedia information retrieval
October 2006
344 pages
ISBN:1595934952
DOI:10.1145/1178677
General Chairs:
James Z. Wang
The Pennsylvania State University
,
Nozha Boujemaa
INRIA Rocquencourt, France
,
Yixin Chen
The University of Mississippi
Copyright © 2006 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 26 October 2006
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
automatic image annotation
content-based image retrieval
query by keyword
search-based image annotation
text-based keyword search
text-based web search
Qualifiers
- Article
Conference
Upcoming Conference
MM '24

Sponsor:

sigmm

MM '24: The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne , VIC , Australia
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 35
  Total Citations
  View Citations
- 726
  Total Downloads
- Downloads (Last 12 months)3
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Scalable search-based image annotation of personal images

MIR '06: Proceedings of the 8th ACM international workshop on Multimedia information retrieval

ABSTRACT

References

Cited By

Index Terms

Recommendations

ConceptRank for search-based image annotation

Image annotation by large-scale content-based image retrieval

Learning to reduce the semantic gap in web image retrieval and annotation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Upcoming Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Scalable search-based image annotation of personal images

MIR '06: Proceedings of the 8th ACM international workshop on Multimedia information retrieval

ABSTRACT

References

Cited By

Index Terms

Recommendations

ConceptRank for search-based image annotation

Image annotation by large-scale content-based image retrieval

Learning to reduce the semantic gap in web image retrieval and annotation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media