skip to main content
10.1145/1178677.1178714acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
Article

Scalable search-based image annotation of personal images

Published:26 October 2006Publication History

ABSTRACT

With the prevalence of digital cameras, more and more people have considerable digital images on their personal devices. As a result, there are increasing needs to effectively search these personal images. Automatic image annotation may serve the goal, for the annotated keywords could facilitate the search processes. Although many image annotation methods have been proposed in recent years, their effectiveness on arbitrary personal images is constrained by their limited scalability, i.e. limited lexicon of small-scale training set. To be scalable, we propose a search-based image annotation (SBIA) algorithm that is analogous to Web page search. First, content-based image retrieval (CBIR) technology is used to retrieve a set of visually similar images from a large-scale Web image set. Then, a text-based keyword search (TBKS) technique is used to obtain a ranked list of candidate annotations for each retrieved image. Finally, a fusion algorithm is used to combine the ranked lists into the final annotation list. The application of both efficient search technologies and Web-scale image set guarantees the scalability of the proposed algorithm. Experimental results on U. Washington dataset show not only the effectiveness and efficiency of the proposed algorithm but also the advantage of image retrieval using annotation results over that using visual features.

References

  1. http://www.cs.washington.edu/research/imagedatabase/groundtruth/Google ScholarGoogle Scholar
  2. http://www.photosig.comGoogle ScholarGoogle Scholar
  3. Baeza-Yates, R. A. and Ribeiro-Neto, B. 1999 Modern Information Retrieval. Addison-Wesley Longman Publishing Co., Inc. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Blei, D. M. and Jordan, M. I. 2003. Modeling annotated data. In Proceedings of the 26th Annual international ACM SIGIR Conference on Research and Development in informaion Retrieval. New York, NY, 127--134. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Brown, P. F., deSouza, P. V., Mercer, R. L., Pietra, V. J., and Lai, J. C. 1992. Class-based n-gram models of natural language. Comput. Linguist. 18, 4 (Dec. 1992), 467--479. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Chang, E., Kingshy, G., Sychay, G., and Wu, G. CBSA: content-based soft annotation for multimodal image retrieval using Bayes point machines. IEEE Trans. on CSVT, 13(1):26--38, Jan. 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Cusano, C., Ciocca, G., and Schettini, R. Image annotation using SVM. In Proc. Of Internet imaging IV, Vol. SPIE, 2004.Google ScholarGoogle Scholar
  8. Duygulu, P. and Barnard, K. Object recognition as machine translation: learning a lexicon for a fixed image vocabulary. In Seventh European Conference on Computer Vision, 4:97--112, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Fan, X., Xie, X., Li, Z., Li, M., and Ma, W. Y. Photo-to- Search: Using Multimodal Queries to Search the Web from Mobile Devices. 7th ACM SIGMM Workshop on MIR, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Feng, S. L., Manmatha, R., and Lavrenko, V. Multiple bernoulli relevance models for image and video annotation. In The International Conference on Computer Vision and Pattern Recognition, Washington, DC, June, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Ferhatosmanoglu, H., Tuncel, E., Agrawal, D., and Abbadi, A. E. Approximate nearest neighbor searching in multimedia databases. In Proceedings of the 17th IEEE Int'l. Conference on Data Engineering, Heidelberg, Germany, April, 2001, 2-6, pp. 503--511. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Jeon, J., Lavrenko, V., and Manmatha, R. Automatic Image Annotation and Retrieval Using Cross-media Relevance Models. In Proc. of ACM SIGIR conference on Research and development in information retrieval, pp. 119--126, July, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Lavrenko, V., Manmatha, R., and Jeon, J. A Model for Learning the Semantics of Pictures. In Proc. of the 17th Annual Conf. on Neural Information Processing Systems, 2003.Google ScholarGoogle Scholar
  14. Li, J. and Wang, J. Z. Automatic linguistic indexing of pictures by a statistical modeling approach. IEEE Trans. on PAMI, 25(10), Oct. 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Miller, G. A. 1995. WordNet: a lexical database for English. Commun. ACM 38, 11 (Nov. 1995), 39--41. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Mori, Y., Takahashi, H., and Oka, R. Image-to-word transformation based on dividing and vector quantizing images with words. In MISRM'99 First International Workshop on Multimedia Intelligent Storage and Retrieval Management, 1999.Google ScholarGoogle Scholar
  17. Page, L., Brin, S., Motwani, R., and Winograd, T. The Pagerank Citation Ranking: Bringing Order to the web, technical report, Stanford University, Stanford, CA, 1998.Google ScholarGoogle Scholar
  18. Robertson, S. E. and Walker, S. Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval. In Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 345--354. Springer-Verlag, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Smeulders, A. W. M., Worring, M., Santini, S., Gupta, A., and Jain, R. Content-based image retrieval at the end of the early years. IEEE Trans. Pattern Anal. Mach. Intell., 22(12):1349--1380, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Wang, X. J., Zhang, L., Jing, F., and Ma, W. Y. AnnoSearch: Image Auto-Annotation by Search. In The International Conference on Computer Vision and Pattern Recognition, New York, June, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Yanai, K. and Barnard, K. 2005. Image region entropy: a measure of "visualness" of web images associated with one concept. In Proceedings of the 13th Annual ACM international Conference on Multimedia. New York, NY, 419--422. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Yeh, T., Tollmar, K., and Darrell, T. Searching the Web with Mobile Images for Location Recognition. In The International Conference on Computer Vision and Pattern Recognition, 2004, pp. 76--81. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Zeng, H., He, Q., Chen, Z., Ma, W., and Ma, J. 2004. Learning to cluster web search results. SIGIR, 2004. New York, NY, 210--217. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Zhang, L., Hu, Y., Li, M., Ma, W., and Zhang, H. 2004. Efficient propagation for face annotation in family albums. In Proceedings of the 12th Annual ACM international Conference on Multimedia. New York, NY, 716--723. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Scalable search-based image annotation of personal images

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader