Skip to main content
Log in

Active fixation for scene exploration

  • Published:
International Journal of Computer Vision Aims and scope Submit manuscript

Abstract

It is well-known that active selection of fixation points in humans is highly context and task dependent. It is therefore likely that successful computational processes for fixation in active vision should be so too. We are considering active fixation in the context of recognition of man-made objects characterized by their shapes. In this situation the qualitative shape and type of observed junctions play an important role. The fixations are driven by a grouping strategy, which forms sets of connected junctions separated from the surrounding at depth discontinuities. We have furthermore developed a methodology for rapid active detection and classification of junctions by selection of fixation points. The approach is based on direct computations from image data and allows integration of stereo and accommodation cues with luminance information. This work form a part of an effort to perform active recognition of generic objects, in the spirit of Malik and Biederman, but on real imagery rather than on line-drawings.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Aloimonos, Y., Weiss, I., and Bandyopadhyay, A. 1987. Active vision. In Proc. 1st International Conference on Computer Vision, pp. 35–54.

  • Anderson, B.D.O. and Moore, J.B. 1979. Optimal Filtering, Information and system science. Prentice-Hall, Inc.

  • Bajcsy, R. 1985. Active perception vs. passive perception. In Proceedings Third IEEE Workshop on Computer Vision, Bellair. IEEE, pp. 55–59.

  • Bajcsy R. and Campos M. 1992. Active and exploratory perception. CVGIP: Image Understanding, 56(1):31–40.

    Google Scholar 

  • Ballard D. 1991. Animate vision. J. of Artificial Intelligence, 48:57–86.

    Google Scholar 

  • Bengtsson A. and Eklundh J.-O. 1991. Shape representation by multiscale contour approximation. IEEE Trans. Pattern Analysis and Machine Intell., 13:85–94.

    Google Scholar 

  • Biederman, I. 1985. Human image understanding: Recent research and a theory. In Human and Machine Vision II, Academic Press: pp. 13–57.

  • Biederman I. 1987. Recognition-by-components: A theory of human image understanding. Psychological Review, 94:115–147.

    Google Scholar 

  • Brunnström, K. 1993. Active Exploration of Static Scenes, Ph.D. thesis, Dept. of Numerical Analysis and Computing Science, Royal Institute of Technology, ISRN KTH/NA/P-93/295E.

  • Brunnström K., Eklundh J.-O., and Lindeberg T. 1990. On scale and resolution in active analysis of local image structure. Image and Vision Computing, 8:289–296.

    Google Scholar 

  • Brunnström, K., Lindeberg, T., and Eklundh, J.-O. 1992. Active detection and classification of junctions by foveation with a headeye system guided by the scale-space primal sketch. In G. Sandini (ed.), Proc. 2nd European Conference on Computer Vision, volume 588 of Lecture Notes in Computer Science, Springer-Verlag. (Santa Margherita Ligure, Italy), pp. 701–709.

  • Crowley, J. and Christensen, H. (eds.) 1995. Vision-as-Process, Basic Research on Computer Vision Systems. Basic Research Series. Springer-Verlag.

  • Culhane, S. and Tsotsos, J. 1992. An attentional prototype for early vision. In Proc. 2nd European Conference on Computer Vision, Santa Margherita Ligure, Italy, pp. 551–562.

  • Dickinson S.J., Pentland A.P., and Rosenfeld A. 1992a. 3-d shape recovery using distributed aspect matching. IEEE Trans. Pattern Analysis and Machine Intell., 14(2): 174–198.

    Google Scholar 

  • Dickinson S.J., Pentland A.P., and Rosenfeld A. 1992b. From volumes to views: An approach to 3-d object recognition. Computer Vision, Graphics, and Image Processing: Image Understanding, Special Issue on CAD-Based Vision, 55(2):130–154.

    Google Scholar 

  • Dreschler L. and Nagel H.-H. 1982. Volumetric model and 3d-trajectory of a moving car derived from monocular tv-frame sequences of a street scene. Computer Graphics and Image Processing, 20(3):199–228.

    Google Scholar 

  • Fischler M. and Bolles R. 1981. Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM, 24:381–395.

    Google Scholar 

  • Förstner, W. and Gülch, E. 1987. Detection and precise location of distinct points, corners and centres of circular features. In Proc. Intercommission Workshop of the Int. Soc. for Photogrammetry and Remote Sensing, Interlaken, Switzerland.

  • Kitchen L. and Rosenfeld R. 1982. Gray-level corner detection. Pattern Recognition Letters, 1(2):95–102.

    Google Scholar 

  • Koenderink J. and van Doorn A. 1990. Receptive field families. Biological Cybernetics, 63:291–375.

    Google Scholar 

  • Lindeberg T. 1993. Detecting salient blob-like image structures and their scales with a scale-space primal sketch: A method for focus-of-attention. International Journal of Computer Vision, 11(3):283–318.

    Google Scholar 

  • Lindeberg, T. 1994. Scale-Space Theory in Computer Vision. The Kluwer International Series in Engineering and Computer Science. Kluwer Academic Publishers.

  • Malik J. 1987. Interpreting line drawings of curved objects. International Journal of Computer Vision, 1(1):73–104.

    Google Scholar 

  • Moravec, H.P. 1977. Obstacle avoidance and navigation in the real world by a seeing robot rover. Stanford AIM-340.

  • Pahlavan, K. 1993. Active Robot Vision and Primary Ocular Processes. Ph.D. thesis, Dept. of Numerical Analysis and Computing Science, Royal Institute of Technology, ISRN KTH/NA/P-93/165E.

  • Pahlavan, K., Uhlin, T., and Eklundh, J.-O. 1992. Integrating primary ocular processes. In Proc. 2nd European Conference on Computer Vision, Santa Margherita Ligure, Italy, pp. 526–541.

  • Rimey, R. and Brown, C. 1992. Where to look next using a bayes net: Incorporating geometric relations. In Proc. 2nd European Conference on Computer Vision, Santa Margherita Ligure, Italy, pp. 542–550.

  • Tenenbaum, J.M. 1970. Accomodation in Computer Vision. Ph.D. thesis, Stanford University.

  • Ullman S. 1987. Visual routines. In W. Richards and S. Ullman (eds.), Image Understanding 1985–86. AbIex, Norwood, N.J.

    Google Scholar 

  • Waltz D. 1975. Understanding line drawings of scenes with shadows. In W. P.H. (ed.), The Psychology of Computer Vision, McGraw-Hill, New York.

    Google Scholar 

  • Westelius, C.-J., Knutsson, H., and Granlund, G.H. 1991. Focus of attention control. In Proceedings of the 7th Scandinavian Conf. on Image Analysis, Aalborg, Denmark, pp. 667–674.

  • Wilkes, D. and Tsotsos, J.K. 1992. Active object recognition. In Proceedings 1992 IEEE Comp. Soc. Conf. on Computer Vision and Pattern Recognition, Champaign, IL, pp. 136–141.

  • Wilkes, D. and Tsotsos, J.K. 1993a. Behaviours for active object recognition. In Proceedings 1993 SPIE, Boston, MA, pp. 225–239.

  • Wilkes, D. and Tsotsos, J.K. 1993b. Efficient serial associative memory. In Proceedings 1993 IEEE Comp. Soc. Conf. on Computer Vision and Pattern Recognition, New York, NY, pp. 701–702.

  • Zhang, W. and Bergholm, F. 1993. An extension of Marr's signature based edge classification and other methods for determination of diffuseness and height of edges, as well as line width. In H.-H.N., et. al. (ed.), Proc. 4th International Conference on Computer Vision, Berlin, Germany, pp. 183–191.

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Brunnström, K., Eklundh, JO. & Uhlin, T. Active fixation for scene exploration. Int J Comput Vision 17, 137–162 (1996). https://doi.org/10.1007/BF00058749

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF00058749

Keywords

Navigation