skip to main content
article

Clustering and searching WWW images using link and page layout analysis

Published:01 May 2007Publication History
Skip Abstract Section

Abstract

Due to the rapid growth of the number of digital images on the Web, there is an increasing demand for an effective and efficient method for organizing and retrieving the available images. This article describes iFind, a system for clustering and searching WWW images. By using a vision-based page segmentation algorithm, a Web page is partitioned into blocks, and the textual and link information of an image can be accurately extracted from the block containing that image. The textual information is used for image indexing. By extracting the page-to-block, block-to-image, block-to-page relationships through link structure and page layout analysis, we construct an image graph. Our method is less sensitive to noisy links than previous methods like PageRank, HITS, and PicASHOW, and hence the image graph can better reflect the semantic relationship between images. Using the notion of Markov Chain, we can compute the limiting probability distributions of the images, ImageRanks, which characterize the importance of the images. The ImageRanks are combined with the relevance scores to produce the final ranking for image search. With the graph models, we can also use techniques from spectral graph theory for image clustering and embedding, or 2-D visualization. Some experimental results on 11.6 million images downloaded from the Web are provided in the article.

References

  1. Belkin, M. and Niyogi, P. 2001. Laplacian eigenmaps and spectral techniques for embedding and clustering. In Advances in Neural Information Processing Systems 14. Vancouver, Canada.Google ScholarGoogle Scholar
  2. Brew, C. and Wade, S. 2002. Spectral clustering for German verbs. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. Philadelphia, PA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Brin, S. and Page, L. 1998. The anatomy of a large-scale hypertextual (Web) search engine. In Proceedings of the 7th ACM Conference on the World Wide Web. Brisbane, Australia. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Cai, D., He, X., Ma, W.-Y., Wen, J.-R., and Zhang, H.-J. 2004a. Organizing WWW images based on the analysis of page layout and Web link structure. In IEEE International Conference on Multimedia and Expo. Xi'an, China.Google ScholarGoogle Scholar
  5. Cai, D., He, X., Wen, J.-R., and Ma, W.-Y. 2004b. Block-level link analysis. In Proceedings of the ACM SIGIR Conference on Information Retrieval. Sheffield, UK. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Cai, D., Yu, S., Wen, J.-R., and Ma, W.-Y. 2003a. Extracting content structure for Web pages based on visual representation. In Proceedings of the 5th Asia Pacific Web Conference. Xi'an, China. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Cai, D., Yu, S., Wen, J.-R., and Ma, W.-Y. 2003b. Vips: A vision-based page segmentation algorithm. Microsoft Tech. Rep., MSR-TR-2003-79.Google ScholarGoogle Scholar
  8. Cai, D., Yu, S., Wen, J.-R., and Ma, W.-Y. 2004c. Block-based Web search. In Proceedings of the ACM SIGIR Conference on Information Retrieval. Sheffield, UK. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Chung, F. R. K. 1997. Spectral Graph Theory. Regional Conference Series in Mathematics, vol. 92.Google ScholarGoogle Scholar
  10. Frankel, C., Swain, M., and Athitsos, V. 1996. Webseer: An image search engine for the World Wide Web. Tech. Rep., TR-96-14, Department of Computer Science, University of Chicago. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Google. http://www.google.com/press/zeitgeist.html. Google zeitgeist---search patterns, trends, and surprises according to google.Google ScholarGoogle Scholar
  12. Guattery, S. and Miller, G. L. 2000. Graph embeddings and Laplacian eigenvalues. SIAM J. Matrix Anal. Appl. 21, 3, 703--723. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. He, X., Yan, S., Hu, Y., Niyogi, P., and Zhang, H.-J. 2005. Face recognition using Laplacian-faces. IEEE Trans. Pattern Anal. Mach. Intell. 27, 3, 328--340. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Kleinberg, J. 1999. Authoritative sources in a hyperlinked environment. J. ACM 46, 5, 604--622. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Lempel, R. and Soffer, A. 2001. Picashow: Pictorial authority search by hyperlinks on the Web. In Proceedings of the 10th ACM Conference on World Wide Web. Hong Kong, China, 438--448. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Ma, W.-Y. and Manjunath, B. S. 1996. Texture features and learning similarity. In IEEE Conference on Computer Vision and Pattern Recognition. San Francisco, CA, 425--430. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Ma, W.-Y. and Manjunath, B. S. 1999. Netra: A toolbox for navigating large image databases. Multimedia Syst. 7, 3, 184--189. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Mohar, B. 1997. Some applications of Laplace eigenvalues of graphs. In Graph Symmetry: Algebraic Methods and Applications, G. Hahn and G. Sabidussi, Eds.Google ScholarGoogle Scholar
  19. Ng, A. Y., Jordan, M., and Weiss, Y. 2001. On spectral clustering: Analysis and an algorithm. In Advances in Neural Information Processing Systems 14. Vancouver, Canada.Google ScholarGoogle Scholar
  20. Robertson, S. E. and Walker, S. 1999. Okapi/keenbow at trec-8. In Eighth Text Retrieval Conference (TREC-8). 151--162.Google ScholarGoogle Scholar
  21. Rui, Y., Huang, T. S., Ortega, M., and Mehrotra, S. 1998. Relevance feedback: A power tool for interactive content-based image retrieval. IEEE Trans. Circ. Syst. Video Tech. 8, 5, 644--655. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Sclaroff, S., Taycher, L., and Cascia, M. L. 1994. Imagerover: A content-based image browser for the World Wide Web. In IEEE Workshop on Content-Based Access of Image and Video Libraries. San Juan, Puerto Rico. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Shi, J. and Malik, J. 2000. Normalized cuts and image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 22, 8, 888--905. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Smith, J. and Chang, S.-F. 1996. Visualseek: A fully automated content-based image query system. In Proceedings of the ACM Conference on Multimedia. New York. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Smith, J. and Chang, S.-F. 1997. Webseek, a content-based image and video search and catalog tool for the Web. IEEE Multimedia.Google ScholarGoogle Scholar
  26. Song, R., Liu, H., Wen, J.-R., and Ma, W.-Y. 2004. Learning block importance models for Web pages. In Proceedings of the 13th ACM Conference on World Wide Web. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Wen, J.-R., Song, R., Cai, D., Zhu, K., Yu, S., Ye, S., and Ma, W.-Y. 2003. Microsoft Research asia at the Web track of TREC 2003. In Twelfth Text Retrieval Conference (TREC-12).Google ScholarGoogle Scholar

Index Terms

  1. Clustering and searching WWW images using link and page layout analysis

              Recommendations

              Comments

              Login options

              Check if you have access through your login credentials or your institution to get full access on this article.

              Sign in

              Full Access

              • Published in

                cover image ACM Transactions on Multimedia Computing, Communications, and Applications
                ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 3, Issue 2
                May 2007
                147 pages
                ISSN:1551-6857
                EISSN:1551-6865
                DOI:10.1145/1230812
                Issue’s Table of Contents

                Copyright © 2007 ACM

                Publisher

                Association for Computing Machinery

                New York, NY, United States

                Publication History

                • Published: 1 May 2007
                Published in tomm Volume 3, Issue 2

                Permissions

                Request permissions about this article.

                Request Permissions

                Check for updates

                Qualifiers

                • article

              PDF Format

              View or Download as a PDF file.

              PDF

              eReader

              View online with eReader.

              eReader