Abstract
For applications in human identification, activity recognition, 3D reconstruction, entertainment and sports, it is often desirable to capture a set of synchronized video sequences of a person from multiple camera viewpoints (see Figure 8-1). One way to achieve this is to set up a ring of cameras all statically aimed at a single point in space, and to have an actor perform at this fixation point while the video footage is shot. This is the method used to create spectacular special effects in the movie The Matrix, where playing back frames from a single time step, across all cameras, yields the appearance of freezing the action in time while a virtual camera flies around the scene. However, in surveillance or sports applications it is not possible to predict beforehand the precise location where an interesting activity will occur, and therefore it is necessary to dynamically adjust the fixation point of multiple camera views. We have developed a system that tracks a person in real-time and adjusts the pan, tilt, zoom and focus of each camera to acquire synchronized multi-view video of a person moving through the scene.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Collins, R., Lipton, A., Fujiyoshi, H. and Kanade, T., “Algorithms for cooperative multi-sensor surveillance,” Proceedings of the IEEE, Vol. 89(10), pp. 1456–1477, October 2001.
Elgammal, A., Harwood, D. and Davis, L., “Non-parametric model for background subtraction,” European Conference on Computer Vision, 2000.
Stauffer, C. and Grimson, W.E.L., “Adaptive background mixture models for real-time tracking,” IEEE Computer Vision and Pattern Recognition, 1999, pp.246–252.
Toyama, K., Krumm, J., Brumitt, B. and Meyers, B., “Wallflower: Principles and practice of background maintenance,” International Conference on Computer Vision, 1999, pp.255–261.
Dellaert, F. and Collins, R., “Fast image-based tracking by selective pixel integration,” ICCV Workshop on Frame-Rate Vision, September 1999.
Bradski, G.R., “Computer vision face tracking for use in a perceptual user interface,” IEEE Workshop on Applications of Computer Vision, 1998, pp. 214–219.
Comaniciu, D., Ramesh, V. and Meer, P., “Real-time tracking of non-rigid objects using mean shift,” IEEE Computer Vision and Pattern Recognition, 2000, pp. 142–149.
Collins, R. and Tsin, Y., “Calibration of an outdoor active camera system” IEEE Computer Vision and Pattern Recognition, June 1999, pp. 528–534.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer Science+Business Media New York
About this chapter
Cite this chapter
Collins, R.T., Amidi, O., Kanade, T. (2003). Acquiring Multi-View Video with an Active Camera System. In: Foresti, G.L., Regazzoni, C.S., Varshney, P.K. (eds) Multisensor Surveillance Systems. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-0371-2_8
Download citation
DOI: https://doi.org/10.1007/978-1-4615-0371-2_8
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4613-5043-9
Online ISBN: 978-1-4615-0371-2
eBook Packages: Springer Book Archive