Abstract
Video surveillance has been widely employed in our society in the past years. In this context, humans play an important role and are the major players since they are responsible for changing the state of the scene through actions and activities. Therefore, the design of automatic methods to understand human behavior and recognize activities are important to determine which subjects are involved in an activity of interest. The computer vision research area has contributed vastly for the development of methods related to detection, tracking and recognition of humans. However, there is still a lack of methods able to recognize higher level activities (e.g., interaction among people that might be involved in an illegal activity). The first step to be successful in this enterprise is to detect and locate groups of people in the scene, which is essential to make inferences regarding interactions among persons. Aiming at such direction, this paper presents a group detection approach that combines motion and spatial information with low-level descriptors to be robust to situations such as partial occlusions. The experimental results obtained using the PETS 2009 and the BEHAVE datasets demonstrate that the proposed combination indeed achieves higher accuracies, indicating a promising direction for future research.
Chapter PDF
References
Aggarwal, J., Ryoo, M.: Human Activity Analysis: A Review. ACM Comput. Surv. 43(3), 1–43 (2011)
Blunsden, S., Fisher, R.B.: The behave video dataset: ground truthed video for multi-person behavior classification. Annals of the BMVA 4 (2010)
Saxena, S., Brémond, F., Thonnat, M., Ma, R.: Crowd behavior recognition for video surveillance. In: Blanc-Talon, J., Bourennane, S., Philips, W., Popescu, D., Scheunders, P. (eds.) ACIVS 2008. LNCS, vol. 5259, pp. 970–981. Springer, Heidelberg (2008)
Zhan, B., Monekosso, D.N., Remagnino, P., Velastin, S.A., Xu, L.-Q.: Crowd analysis: A survey. Mach. Vision Appl. 19, 345–357 (2008)
Vishwakarma, S., Agrawal, A.: A survey on activity recognition and behavior understanding in video surveillance. The Visual Computer 29(10), 983–1009 (2013)
Hongeng, S., Nevatia, R.: Multi-agent event recognition. In: ICCV, pp. 84–93 (2001)
Ryoo, M.S., Aggarwal, J.K.: Stochastic representation and recognition of high-level group activities. IJCV 93(2), 183–200 (2011)
Bingbing Ni, S.Y., Kassim, A.: Recognizing human group activities with localized causalities, pp. 1470–1477 (2009)
Yin, Y., Yang, G., Xu, J., Man, H.: Small group human activity recognition. In: ICIP, pp. 2709–2712 (2012)
Andrade, E., Fisher, R., Blunsden, S.: Detection of emergency events in crowded scenes. In: The Institution of Engineering and Technology Conference on Crime and Security, pp. 528–533 (June 2006)
Lin, W., Sun, M.-T., Poovendran, R., Zhang, Z.: Group event detection for video surveillance. In: IEEE International Symposium on Circuits and Systems, ISCAS 2009, pp. 2830–2833 (May 2009)
Saxena, S., Brémond, F., Thonnat, M., Ma, R.: Crowd behavior recognition for video surveillance. In: Blanc-Talon, J., Bourennane, S., Philips, W., Popescu, D., Scheunders, P. (eds.) ACIVS 2008. LNCS, vol. 5259, pp. 970–981. Springer, Heidelberg (2008)
Kwak, S., Han, B., Han, J.H.: Multi-agent event detection: Localization and role assignment. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2682–2689 (June 2013)
Chang, M.-C., Krahnstoever, N., Lim, S., Yu, T.: Group level activity recognition in crowded environments across multiple cameras. In: AVSS, pp. 56–63 (2010)
Rodrigues de Almeida, I., Rosito Jung, C.: Change detection in human crowds. In: 2013 26th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI), pp. 63–69 (August 2013)
Bouwmans, T., Baf, F.E., Vachon, B.: Background modeling using mixture of gaussians for foreground detection a survey. In: Recent Patents on Computer Science, pp. 219–237 (2008)
Yves Bouguet, J.: Pyramidal implementation of the lucas kanade feature tracker. Intel Corporation, Microprocessor Research Labs (2000)
Zhang, Y., Liu, X., Chang, M.-C., Ge, W., Chen, T.: Spatio-temporal phrases for activity recognition. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part III. LNCS, vol. 7574, pp. 707–721. Springer, Heidelberg (2012)
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005, vol. 1, pp. 886–893 (June 2005)
Bradski, G.: Opencv library. Dr. Dobb’s Journal of Software Tools (2000)
Doermann, D., Mihalcik, D.: Tools and techniques for video performance evaluation. In: Proceedings of the 15th International Conference on Pattern Recognition, vol. 4, pp. 167–170 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Mora-Colque, R.V.H., Cámara-Chávez, G., Schwartz, W.R. (2014). Detection of Groups of People in Surveillance Videos Based on Spatio-Temporal Clues. In: Bayro-Corrochano, E., Hancock, E. (eds) Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications. CIARP 2014. Lecture Notes in Computer Science, vol 8827. Springer, Cham. https://doi.org/10.1007/978-3-319-12568-8_115
Download citation
DOI: https://doi.org/10.1007/978-3-319-12568-8_115
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-12567-1
Online ISBN: 978-3-319-12568-8
eBook Packages: Computer ScienceComputer Science (R0)