Article

Image retrieval on large-scale image databases

Authors:
Eva Hörster

University of Augsburg, Augsburg, Germany

University of Augsburg, Augsburg, Germany
View Profile

,
Rainer Lienhart

University of Augsburg, Augsburg, Germany

University of Augsburg, Augsburg, Germany
View Profile

,
Malcolm Slaney

Yahoo! Research, Santa Clara, CA

Yahoo! Research, Santa Clara, CA
View Profile

CIVR '07: Proceedings of the 6th ACM international conference on Image and video retrievalJuly 2007Pages 17–24https://doi.org/10.1145/1282280.1282283

Published:09 July 2007Publication History

CIVR '07: Proceedings of the 6th ACM international conference on Image and video retrieval

Pages 17–24

ABSTRACT

Online image repositories such as Flickr contain hundreds of millions of images and are growing quickly. Along with that the needs for supporting indexing, searching and browsing is becoming more and more pressing. In this work we will employ the image content as a source of information to retrieve images. We study the representation of images by Latent Dirichlet Allocation (LDA) models for content-based image retrieval. Image representations are learned in an unsupervised fashion, and each image is modeled as the mixture of topics/object parts depicted in the image. This allows us to put images into subspaces for higher-level reasoning which in turn can be used to find similar images. Different similarity measures based on the described image representation are studied. The presented approach is evaluated on a real world image database consisting of more than 246,000 images and compared to image models based on probabilistic Latent Semantic Analysis (pLSA). Results show the suitability of the approach for large-scale databases. Finally we incorporate active learning with user relevance feedback in our framework, which further boosts the retrieval performance.

References

R. A. Baeza-Yates and B. Ribeiro-Neto. Modern Information Retrieval. Addison-Wesley Longman Publishing Co., Inc., Boston, MA, USA, 1999. Google ScholarDigital Library
K. Barnard, P. Duygulu, D. Forsyth, N. de Freitas, D. M. Blei, and M. I. Jordan. Matching words and pictures. J. Mach. Learn. Res., 3:1107--1135, 2003. Google ScholarDigital Library
D. M. Blei and M. I. Jordan. Modeling annotated data. In SIGIR '03: Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval, pages 127--134, New York, NY, USA, 2003. ACM Press. Google ScholarDigital Library
D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. J. Mach. Learn. Res., 3:993--1022, 2003. Google ScholarCross Ref
A. Bosch, A. Zisserman, and X. Munoz. Scene classification via pLSA. In Proceedings of the European Conference on Computer Vision, 2006. Google ScholarDigital Library
R. Fergus, L. Fei-Fei, P. Perona, and A. Zisserman. Learning object categories from google's image search. In ICCV '05: Proceedings of the Tenth IEEE International Conference on Computer Vision, pages 1816--1823, Washington, DC, USA, 2005. IEEE Computer Society. Google ScholarDigital Library
T. Hofmann. Unsupervised learning by probabilistic latent semantic analysis. Mach. Learn., 42(1--2):177--196, 2001. Google ScholarDigital Library
F.-F. Li and P. Perona. A Bayesian hierarchical model for learning natural scene categories. In CVPR '05: Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 2, pages 524--531, Washington, DC, USA, 2005. IEEE Computer Society. Google ScholarDigital Library
R. Lienhart and M. Slaney. pLSA on large scale image databases. In IEEE International Conference on Acoustics, Speech and Signal Processing, 2007.Google ScholarCross Ref
D. G. Lowe. Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vision, 60(2):91--110, 2004. Google ScholarDigital Library
G. Pass, R. Zabih, and J. Miller. Comparing images using color coherence vectors. In MULTIMEDIA '96: Proceedings of the fourth ACM international conference on Multimedia, pages 65--73, New York, NY, USA, 1996. ACM Press. Google ScholarDigital Library
P. Quelhas, F. Monay, J.-M. Odobez, D. Gatica-Perez, T. Tuytelaars, and L. V. Gool. Modeling scenes with local descriptors and latent aspects. In ICCV '05: Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, pages 883--890, Washington, DC, USA, 2005. IEEE Computer Society. Google ScholarDigital Library
J. Sivic, B. C. Russell, A. A. Efros, A. Zisserman, and W. T. Freeman. Discovering objects and their location in images. In International Conference on Computer Vision (ICCV 2005), 2005. Google ScholarDigital Library
E. B. Sudderth, A. B. Torralba, W. T. Freeman, and A. S. Willsky. Describing visual scenes using transformed dirichlet processes. In NIPS, 2005.Google Scholar
S. Tong and E. Chang. Support vector machine active learning for image retrieval. In MULTIMEDIA '01: Proceedings of the ninth ACM international conference on Multimedia, pages 107--118, New York, NY, USA, 2001. ACM Press. Google ScholarDigital Library
G. Wang, Y. Zhang, and L. Fei-Fei. Using dependent regions for object categorization in a generative framework. In CVPR '06: Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pages 1597--1604, Washington, DC, USA, 2006. IEEE Computer Society. Google ScholarDigital Library
X. Wei and W. B. Croft. LDA-based document models for ad-hoc retrieval. In SIGIR '06: Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, pages 178--185, New York, NY, USA, 2006. ACM Press. Google ScholarDigital Library
C. Zhai and J. Lafferty. A study of smoothing methods for language models applied to ad hoc information retrieval. In SIGIR '01: Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, pages 334--342, New York, NY, USA, 2001. ACM Press. Google ScholarDigital Library

Index Terms

Image retrieval on large-scale image databases
1. Information systems
  1. Information retrieval
    1. Document representation
    2. Search engine architectures and scalability
      1. Search engine indexing

Recommendations

Semantic-Aware Co-Indexing for Image Retrieval
In content-based image retrieval, inverted indexes allow fast access to database images and summarize all knowledge about the database. Indexing multiple clues of image contents allows retrieval algorithms search for relevant images from different ...
Read More
Query Specific Rank Fusion for Image Retrieval
Recently two lines of image retrieval algorithms demonstrate excellent scalability: 1) local features indexed by a vocabulary tree, and 2) holistic features indexed by compact hashing codes. Although both of them are able to search visually similar images ...
Read More
Content based medical image retrieval using topic and location model
Graphical abstract

Display Omitted
Highlights
- Medical image retrieval based on Topic and Location probabilities of visual words.
Abstract Background and objective
Retrieval of medical images from an anatomically diverse dataset is a challenging task. Objective of our present study is to analyse the automated medical image retrieval system incorporating topic ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CIVR '07: Proceedings of the 6th ACM international conference on Image and video retrieval
July 2007
655 pages
ISBN:9781595937339
DOI:10.1145/1282280
General Chairs:
Nicu Sebe
Univ. of Amsterdam, The Netherlands
,
Marcel Worring
Univ. of Amsterdam, The Netherlands
Copyright © 2007 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 9 July 2007
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
large-scale image retrieval
latent Dirichlet allocation
Qualifiers
- Article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 61
  Total Citations
  View Citations
- 1,914
  Total Downloads
- Downloads (Last 12 months)21
- Downloads (Last 6 weeks)4
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Image retrieval on large-scale image databases

CIVR '07: Proceedings of the 6th ACM international conference on Image and video retrieval

ABSTRACT

References

Cited By

Index Terms

Recommendations

Semantic-Aware Co-Indexing for Image Retrieval

Query Specific Rank Fusion for Image Retrieval

Content based medical image retrieval using topic and location model

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Image retrieval on large-scale image databases

CIVR '07: Proceedings of the 6th ACM international conference on Image and video retrieval

ABSTRACT

References

Cited By

Index Terms

Recommendations

Semantic-Aware Co-Indexing for Image Retrieval

Query Specific Rank Fusion for Image Retrieval

Content based medical image retrieval using topic and location model

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media