Abstract
Many computer vision problems can be considered to consist of two main tasks: the extraction of image content descriptions and their subsequent matching. The appropriate choice of type and level of description is of course task dependent, yet it is generally accepted that the low-level or so called early vision layers in the Human Visual System are context independent.
This paper concentrates on the use of low-level approaches for solving computer vision problems and discusses three inter-related aspects of this: saliency; scale selection and content description. In contrast to many previous approaches which separate these tasks, we argue that these three aspects are intrinsically related. Based on this observation, a multiscale algorithm for the selection of salient regions of an image is introduced and its application to matching type problems such as tracking, object recognition and image retrieval is demonstrated.
Similar content being viewed by others
References
Alvarez, L., Lions, P., and Morel, J. 1992. Image selective smoothing and edge detection by nonlinear diffusion. II. SIAM Journal on Numerical Analysis, 29(3):845–866.
Bergholm, F. 1986. Edge focusing. In Proc.Int.Conf.on Pattern Recognition, Paris, France, pp. 597–600.
Blake, A. and Isard, M. 1997. The CONDENSATION algorithm-conditional density propagation and applications to visual tracking. In Advances in Neural Information Processing Systems,Vol. 9, M.C. Mozer, M.I. Jordan, and T. Petsche (Eds.). MIT Press: Cambridge, MA.
Burt, P.J. and Adelson, E.H. 1983. The Laplacian pyramid as a compact image code. IEEE Trans.Communication, 31(4):532–540.
Chomat, O., deVerdiere, V.C., Hall, D., and Crowley, J.L. 2000. Local scale selection for Gaussian based description techniques. In Proc.European Conf.Computer Vision, pp. 117–133.
Coifman, R. and Wickerhauser, M. 1992. Entropy-based algorithms for best basis selection. IEEE Trans.on Information Theory, 38(2):713–718.
Deriche, R. and Blaszka, T. 1993. Recovering and characterizing image features using an efficient model based approach. In Proc.Int.Conf.on Computer Vision and Pattern Recognition, pp. 530–535.
Dufournaud, Y., Schmid, C., and Horaud, R. 2000. Matching images with different resolutions. In Proc.Int.Conf.on Computer Vision and Pattern Recognition, pp. 612–618.
Gilles, S. 1998. Robust description and matching of images. Ph.D. Thesis, University of Oxford.
Harris, C. and Stephens, M. 1988. A combined corner and edge detector. In Proc.4th Alvey Vision Conf,Manchester, pp. 189–192.
Jägersand, M. 1995. Saliency maps and attention selection in scale and spatial coordinates: An information theoretic approach. In Proc.Int.Conf.on Computer Vision. MIT Press: Cambridge, MA, pp. 195–202.
Julesz, B. 1995. Dialogues on Perception. MIT Press: Cambridge, MA.
Koenderink, J.J. 1984. The structure of images. Biological Cybernetics, 50:363–370.
Koenderink, J.J. and van Doorn, A.J. 1987. Representation of local geometry in the visual system. Biological Cybernetics, 63: 291–297.
Leclerc, Y.G. 1989. Constructing simple stable descriptions for image partitioning. Int.Journal of Computer Vision, 3:73–102.
Lindeberg, T. 1993. On scale selection for differential operators. In Proc.8th Scandinavian Conf.on Image Analysis, Tromso, Norway, pp. 857–866.
Lindeberg, T. 1994. Junction detection with automatic selection of detection scales and localization scales. In Proc.Int.Conf.on Image Processing, pp. 924–928.
Lindeberg, T. and ter Haar Romeny, B.M. 1994. Linear Scale-Space: I.Basic Theory, II.Early Visual Operations. Kluwer Academic Publishers: Dordrecht, The Netherlands, pp. 1–77.
Mallat, S. 1998. A Wavelet Tour of Signal Processing. Academic Press: San Diego.
Marr, D. 1982. Vision: A Computational Investigation into the Human Representation ond Processing of Visual Information. W.H. Freeman: San Francisco.
Marr, D. and Hildreth, E. 1979. Theory of edge detection. In Proceedings Royal Society of London Bulletin, 204:301–328.
Milanese, R. 1993. Detecting salient regions in an image: From biological evidence to computer implementation. Ph.D. Thesis, University of Geneva.
Mokhtarian, F. and Suomela, R. 1998. Robust image corner detection through curvature scale space. IEEE Trans.Pattern Analysis and Machine Intelligence, 20(12):1376–1381.
Neisser, U. 1964.Visual search. Scientific American, 210(6):94–102.
Nene, S., Nayar, S., and Murase, H. 1996. Columbia image object library. Technical Report, Department of Computer Science, Columbia University.
Perona, P. and Malik, J. 1988. Scale space and edge detection using anisotropic diffusion. In Proc.Int.Conf.on Computer Vision and Pattern Recognition, pp. 16–22.
Schiele, B. 1997. Object recognition using multidimensional receptive field histograms. Ph.D. Thesis, I.N.P. de Grenoble.
Schmid, C. and Mohr, R. 1997. Local greyvalue invariants for image retrieval. IEEE Trans.Pattern Analysis and Machine Intelligence, 19(5):530–535.
Schmid, C., Mohr, R., and Bauckhage, C. 1998. Comparing and evaluating interest points. In Proc.Int.Conf.on Computer Vision, pp. 230–235.
Sha'ashua, A. and Ullman, S. 1988. Structural saliency: The detection of globally salient structures using a locally connected network. In Proc.Int.Conf.on Computer Vision, Tampa, FL, pp. 321–327.
Starck, J. and Murtagh, F. 1999. Multiscale entropy filtering. EURASIP Signal Processing, pp. 147–165.
Swain, M.J. 1990. Color indexing. Ph.D. Thesis, University of Rochester.
ter Haar Romeny, B.M. 1996. Introduction to scale-space theory: Multiscale geometric image analysis. Ph.D. Thesis, Utrecht University.
Treisman, A. 1985. Preattentive processing in vision. Computer Vision, Graphics, and Image Processing, 31(2):156–177.
Walker, K.N., Cootes, T.F., and Taylor, C.J. 1998a. Locating salient facial features using image invariants. In Int.Conf.on Automatic Face and Gesture Recognition, Nara, Japan.
Walker, K.N., Cootes, T.F., and Taylor, C.J. 1998b. Locating salient object features. In Proc.British Machine Vision Conference, Southampton, UK, pp. 557–566.
Weickert, J. 1997. A review of nonlinear diffusion filtering. Lecture Notes in Computer Science, 1252:3–28.
Winter, A., Maitre, H., Cambou, N., and Legrand, E. 1997. Entropy and multiscale analysis: An original feature extraction algorithm for aerial and satellite images. In Proc.Int.Conf.on Acoustics, Speech, and Signal Processing. IEEE.
Witkin, A. 1983. Scale-space filtering. In Proc.Int.Joint Conf.on Artificial Intelligence, Karlsruhe, Germany.
Zheng, B.Y., Qian, W., and Clarke, L.P. 1996. Digital mammography-Mixed feature neural network with spectral entropy decision for detection of microcalcifications. IEEE Trans.on Medical Imaging, 15(5):589–597.
Zhu, S.C. and Yuille, A. 1996. Region competition: Unifying snakes, region growing, and bayes/mdl for multiband image segmentation. IEEE Pattern Analysis and Machine Intelligence, 18(9):884–900.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Kadir, T., Brady, M. Saliency, Scale and Image Description. International Journal of Computer Vision 45, 83–105 (2001). https://doi.org/10.1023/A:1012460413855
Issue Date:
DOI: https://doi.org/10.1023/A:1012460413855