Saliency, Scale and Image Description

Kadir, Timor; Brady, Michael

doi:10.1023/A:1012460413855

Saliency, Scale and Image Description

Published: November 2001

Volume 45, pages 83–105, (2001)
Cite this article

International Journal of Computer Vision Aims and scope Submit manuscript

Timor Kadir¹ &
Michael Brady¹

3271 Accesses
840 Citations
9 Altmetric
Explore all metrics

Abstract

Many computer vision problems can be considered to consist of two main tasks: the extraction of image content descriptions and their subsequent matching. The appropriate choice of type and level of description is of course task dependent, yet it is generally accepted that the low-level or so called early vision layers in the Human Visual System are context independent.

This paper concentrates on the use of low-level approaches for solving computer vision problems and discusses three inter-related aspects of this: saliency; scale selection and content description. In contrast to many previous approaches which separate these tasks, we argue that these three aspects are intrinsically related. Based on this observation, a multiscale algorithm for the selection of salient regions of an image is introduced and its application to matching type problems such as tracking, object recognition and image retrieval is demonstrated.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Alvarez, L., Lions, P., and Morel, J. 1992. Image selective smoothing and edge detection by nonlinear diffusion. II. SIAM Journal on Numerical Analysis, 29(3):845–866.
Google Scholar
Bergholm, F. 1986. Edge focusing. In Proc.Int.Conf.on Pattern Recognition, Paris, France, pp. 597–600.
Blake, A. and Isard, M. 1997. The CONDENSATION algorithm-conditional density propagation and applications to visual tracking. In Advances in Neural Information Processing Systems,Vol. 9, M.C. Mozer, M.I. Jordan, and T. Petsche (Eds.). MIT Press: Cambridge, MA.
Google Scholar
Burt, P.J. and Adelson, E.H. 1983. The Laplacian pyramid as a compact image code. IEEE Trans.Communication, 31(4):532–540.
Google Scholar
Chomat, O., deVerdiere, V.C., Hall, D., and Crowley, J.L. 2000. Local scale selection for Gaussian based description techniques. In Proc.European Conf.Computer Vision, pp. 117–133.
Coifman, R. and Wickerhauser, M. 1992. Entropy-based algorithms for best basis selection. IEEE Trans.on Information Theory, 38(2):713–718.
Google Scholar
Deriche, R. and Blaszka, T. 1993. Recovering and characterizing image features using an efficient model based approach. In Proc.Int.Conf.on Computer Vision and Pattern Recognition, pp. 530–535.
Dufournaud, Y., Schmid, C., and Horaud, R. 2000. Matching images with different resolutions. In Proc.Int.Conf.on Computer Vision and Pattern Recognition, pp. 612–618.
Gilles, S. 1998. Robust description and matching of images. Ph.D. Thesis, University of Oxford.
Harris, C. and Stephens, M. 1988. A combined corner and edge detector. In Proc.4th Alvey Vision Conf,Manchester, pp. 189–192.
Jägersand, M. 1995. Saliency maps and attention selection in scale and spatial coordinates: An information theoretic approach. In Proc.Int.Conf.on Computer Vision. MIT Press: Cambridge, MA, pp. 195–202.
Google Scholar
Julesz, B. 1995. Dialogues on Perception. MIT Press: Cambridge, MA.
Google Scholar
Koenderink, J.J. 1984. The structure of images. Biological Cybernetics, 50:363–370.
Google Scholar
Koenderink, J.J. and van Doorn, A.J. 1987. Representation of local geometry in the visual system. Biological Cybernetics, 63: 291–297.
Google Scholar
Leclerc, Y.G. 1989. Constructing simple stable descriptions for image partitioning. Int.Journal of Computer Vision, 3:73–102.
Google Scholar
Lindeberg, T. 1993. On scale selection for differential operators. In Proc.8th Scandinavian Conf.on Image Analysis, Tromso, Norway, pp. 857–866.
Lindeberg, T. 1994. Junction detection with automatic selection of detection scales and localization scales. In Proc.Int.Conf.on Image Processing, pp. 924–928.
Lindeberg, T. and ter Haar Romeny, B.M. 1994. Linear Scale-Space: I.Basic Theory, II.Early Visual Operations. Kluwer Academic Publishers: Dordrecht, The Netherlands, pp. 1–77.
Google Scholar
Mallat, S. 1998. A Wavelet Tour of Signal Processing. Academic Press: San Diego.
Google Scholar
Marr, D. 1982. Vision: A Computational Investigation into the Human Representation ond Processing of Visual Information. W.H. Freeman: San Francisco.
Google Scholar
Marr, D. and Hildreth, E. 1979. Theory of edge detection. In Proceedings Royal Society of London Bulletin, 204:301–328.
Google Scholar
Milanese, R. 1993. Detecting salient regions in an image: From biological evidence to computer implementation. Ph.D. Thesis, University of Geneva.
Mokhtarian, F. and Suomela, R. 1998. Robust image corner detection through curvature scale space. IEEE Trans.Pattern Analysis and Machine Intelligence, 20(12):1376–1381.
Google Scholar
Neisser, U. 1964.Visual search. Scientific American, 210(6):94–102.
Google Scholar
Nene, S., Nayar, S., and Murase, H. 1996. Columbia image object library. Technical Report, Department of Computer Science, Columbia University.
Perona, P. and Malik, J. 1988. Scale space and edge detection using anisotropic diffusion. In Proc.Int.Conf.on Computer Vision and Pattern Recognition, pp. 16–22.
Schiele, B. 1997. Object recognition using multidimensional receptive field histograms. Ph.D. Thesis, I.N.P. de Grenoble.
Schmid, C. and Mohr, R. 1997. Local greyvalue invariants for image retrieval. IEEE Trans.Pattern Analysis and Machine Intelligence, 19(5):530–535.
Google Scholar
Schmid, C., Mohr, R., and Bauckhage, C. 1998. Comparing and evaluating interest points. In Proc.Int.Conf.on Computer Vision, pp. 230–235.
Sha'ashua, A. and Ullman, S. 1988. Structural saliency: The detection of globally salient structures using a locally connected network. In Proc.Int.Conf.on Computer Vision, Tampa, FL, pp. 321–327.
Starck, J. and Murtagh, F. 1999. Multiscale entropy filtering. EURASIP Signal Processing, pp. 147–165.
Swain, M.J. 1990. Color indexing. Ph.D. Thesis, University of Rochester.
ter Haar Romeny, B.M. 1996. Introduction to scale-space theory: Multiscale geometric image analysis. Ph.D. Thesis, Utrecht University.
Treisman, A. 1985. Preattentive processing in vision. Computer Vision, Graphics, and Image Processing, 31(2):156–177.
Google Scholar
Walker, K.N., Cootes, T.F., and Taylor, C.J. 1998a. Locating salient facial features using image invariants. In Int.Conf.on Automatic Face and Gesture Recognition, Nara, Japan.
Walker, K.N., Cootes, T.F., and Taylor, C.J. 1998b. Locating salient object features. In Proc.British Machine Vision Conference, Southampton, UK, pp. 557–566.
Weickert, J. 1997. A review of nonlinear diffusion filtering. Lecture Notes in Computer Science, 1252:3–28.
Google Scholar
Winter, A., Maitre, H., Cambou, N., and Legrand, E. 1997. Entropy and multiscale analysis: An original feature extraction algorithm for aerial and satellite images. In Proc.Int.Conf.on Acoustics, Speech, and Signal Processing. IEEE.
Witkin, A. 1983. Scale-space filtering. In Proc.Int.Joint Conf.on Artificial Intelligence, Karlsruhe, Germany.
Zheng, B.Y., Qian, W., and Clarke, L.P. 1996. Digital mammography-Mixed feature neural network with spectral entropy decision for detection of microcalcifications. IEEE Trans.on Medical Imaging, 15(5):589–597.
Google Scholar
Zhu, S.C. and Yuille, A. 1996. Region competition: Unifying snakes, region growing, and bayes/mdl for multiband image segmentation. IEEE Pattern Analysis and Machine Intelligence, 18(9):884–900.
Google Scholar

Download references

Author information

Authors and Affiliations

Robotics Research Laboratory, Department of Engineering Science, University of Oxford, Parks Road, Oxford, OX1 3PJ, UK
Timor Kadir & Michael Brady

Authors

Timor Kadir
View author publications
You can also search for this author in PubMed Google Scholar
Michael Brady
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kadir, T., Brady, M. Saliency, Scale and Image Description. International Journal of Computer Vision 45, 83–105 (2001). https://doi.org/10.1023/A:1012460413855

Download citation

Issue Date: November 2001
DOI: https://doi.org/10.1023/A:1012460413855

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Saliency, Scale and Image Description

Abstract

Access this article

Similar content being viewed by others

A Global-Local Approach to Saliency Detection

Visual Features—From Early Concepts to Modern Computer Vision

Dense, Scale-Less Descriptors

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Navigation

Saliency, Scale and Image Description

Abstract

Access this article

Similar content being viewed by others

A Global-Local Approach to Saliency Detection

Visual Features—From Early Concepts to Modern Computer Vision

Dense, Scale-Less Descriptors

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation