Elsevier

Neurocomputing

Volume 290, 17 May 2018, Pages 74-86
Neurocomputing

Anomalous entities detection and localization in pedestrian flows

https://doi.org/10.1016/j.neucom.2018.02.045Get rights and content

Abstract

We propose a novel Gaussian kernel based integration model (GKIM) for anomalous entities detection and localization in pedestrian flows. The GKIM integrates spatio-temporal features for efficient and robust motion representation to capture the distinctive and meaningful information about the anomalous entities. We next propose a block based detection framework by training a recurrent conditional random field (R-CRF) using the GKIM features. The trained R-CRF model is then used to detect and localize the anomalous entities during the online testing stage. We conduct comprehensive experiments on three benchmark datasets and compare the performance of the proposed method with the state-of-the-art anomalous entities detection methods. Our experiments show that the proposed GKIM outperforms the compared methods in terms of equal error rate (EER) and detection rate (DR) in both frame-level and pixel-level comparisons. The frame-level analysis detects the presence of an anomalous entity in a frame regardless of its location. The pixel-level analysis localizes the anomalous entity in term of its pixels.

Introduction

The population growth and traffic congestion in urban areas are rapidly increasing which makes the safety of pedestrians a key concern [1]. Therefore, it is important to analyze the pedestrian flows to facilitate smart video surveillance for ensuring pedestrian safety. For this purpose, an important task to ensure pedestrian safety is the detection and localization of anomalous entities in the pedestrian flows which can be used to warn against the possible risks.

The term anomalous entity may have different interpretations varying significantly depending on the given context. Therefore, learning a generic model for anomaly detection is a challenging problem. In this paper, an anomalous entity is considered to be a moving object exhibiting motion patterns that do not conform to the expected behavior in the pedestrian flow and may warrant special attention. Such entities present infrequent behavior compared with the normal prevalent behaviors. This definition is not limited to specific entity types (a running pedestrian or a cyclist) or scenes (sparse or dense). Similar definitions are presented by a number of papers that addressed the problem in recent years [2], [3], [4], [5], [6], [7]. We consider that anomalous entities are rare in the pedestrian flow and they are different from the majority. The motivation of our paper is to build a generic model for the detection and localization of anomalous entities in pedestrian flows. We consider each moving object as part of the pedestrian flows and non-moving objects or groups of pedestrians as a background.

A number of computer vision methods for video surveillance [2], [3], [4], [5], [7], [8] have previously addressed anomalous entities detection. Most of these methods assume that the pedestrian flows are very consistent in motion. In fact, this assumption is not realistic since the pedestrian flows may be scattered and sparse. For example, the frequency and crowdedness of the pedestrians at a certain location may be higher in the official hours and lower during the weekends and later hours. Furthermore, the problem of detecting and locating anomalous entities in the pedestrian flows is very challenging due to the appearance variations of individual entities, temporal variations, and view angle changes.

To address the above mentioned challenges, we present an efficient method for anomalous entities detection and localization in pedestrian flows that does not rely on the assumption of pedestrian flow consistency. We propose a novel Gaussian kernel based integration model (GKIM). Our GKIM is based on a Gaussian kernel based integration of local difference binary patterns (LDBP) [9] and nested motion descriptor (NMD) [10]. We consider the LDBP since it is characterized by the compact representation of spatial information while the NMD is employed to encode temporal information. The distinguishing feature of NMD is the representation of motion information without requiring an explicit optical flow estimate. Furthermore, we exploit both LDBP and NMD simultaneously to integrate their strengths into a unified model. We propose a Gaussian kernel based approach to integrate the spatio-temporal features by transforming the trace and the determinant of our feature Jacobian matrix into a distinctive space. Therefore, the GKIM represents high quality description of anomalous entities in term of most distinctive information. GKIM models the evolving relative spatial relationships and captures a specific nuance of the underlying motion considering temporal variations. Due to the aforementioned properties, our proposed GKIM is independent of the scattered, sparse, and dense nature of the pedestrian flows.

The complete flow of our proposed method is shown in Fig. 1. In order to detect and localize anomalous entities, we divide each video frame into blocks of equal size where the spatio-temporal features for each block are extracted. To this end, the features are used as a-priori for recurrent conditional random field (R-CRF) [11] training which detect and localize anomalous entities during the testing stage. We propose to use the R-CRF since it can deal efficiently with the label bias problem [11] by integrating the traditional conditional random field (CRF) [12] and recurrent neural networks (RNN) [13]. The main contributions of this paper are:

  • 1.

    We propose a novel model called GKIM for anomalous entities detection and localization. One of the major attraction of the GKIM is its capability to model anomalous entities distinctively in pedestrian flows representing different degrees of scatteredness and sparseness. In the GKIM, we propose Jacobian matrix and Gaussian based kernel integration to transform spatio-temporal features into discriminant and unique representation. To the best of our knowledge, we are the first to explore R-CRF for entities classification in pedestrian flows.

  • 2.

    We extensively evaluate the proposed method on three standard datasets and compared to 11 state-of-the art methods. Our results show that the proposed method significantly outperforms all 10 state-of-the-art methods.

  • 3.

    We categorize state-of-the-art methods and present a comprehensive survey in this area in the next section.

To assess the proposed GKIM model, we perform extensive experiments on three benchmark datasets and compare the results with 11 state-of-the-art methods: the mixture of dynamic texture (MDT) [8], the mixture of optical flow (MPPCA) [14], the social force (SF) [15], the multiple location monitors (MLM) [16], the clustering and sparse coding (CSC) [4], the holistic features (HF) [5], hierarchical feature representation (HFR) [17], the pedestrian energy map (PEM) [18], the statistical histograms model (SHM) [6], the change detection model (CDM) [7], and the spatial-aware motion reconstruction (SMR)[19]. Our results show that GKIM achieves superior anomalous entities detection. Moreover, our proposed GKIM outperforms the compared methods in both frame-level and pixel-level analysis in terms of equal error rate (EER) and detection rate (DR). The frame-level analysis detects the presence of an anomalous entity in a frame regardless of its location. The pixel-level analysis localizes the anomalous entity in term of its pixels.

The rest of the paper is organized as follows. In Section 2, an overview of related work is provided. The proposed method for the detection and localization of anomalous entities is presented in Section 3. Experimental results on the benchmark datasets are shown in Section 4 and the conclusion is presented in Section 5.

Section snippets

Related work

Anomaly detection and motion segmentation methods are often correlated with each other, therefore, we discuss both by dividing them into three related categories. Methods considering only segmentation are categorized under the term motion segmentation and methods considering only anomaly detection are categorized under the term anomaly detection. Similarly, methods targeting both segmentation and anomaly are categorized under the term motion segmentation and anomaly detection.

In the motion

Proposed method for anomalous entities detection and localization

The proposed GKIM method consists of two main steps namely Gaussian kernel based feature integration and R-CRF model based classification. These steps are explained in detail in the following sections.

Experiments

We evaluate the performance of our proposed method for anomalous entities detection on three benchmark datasets available publicly. These include UCSD [8], UMN [45], and UCD [34].

Conclusion

We propose a novel GKIM method for anomalous entities detection and localization in pedestrian flows. The GKIM represents high quality description of anomalous entities in term of most distinctive information. The performance of our proposed method is tested on three datasets and compared to 10 closely related state-of-the-art methods. The performance metrics EER, DR, and ROC curves show that our method outperforms all the reference methods in both frame-level and pixel-level analysis.

As a

Acknowledgments

The work described in this paper is jointly supported by the University of Hail, Saudi Arabia and COMSATS Institute of Information Technology, Pakistan.

Habib Ullah did Ph.D. in 2015 from the University of Trento, Italy and the M.Sc. degree in Computer Engineering in 2009 from Hanyang University, Seoul, South Korea. He is currently working as Assistant Professor in the College of Computer Science and Engineering at the University of Ha’il, Ha’il, Saudi Arabia. His research interests include pedestrian motion analysis, video surveillance, and machine learning.

References (48)

  • J.D. Dávila

    Cities as innovation towards a new understanding of population growth, social inequality and urban sustainability

    Cities in the 21st Century

    (2016)
  • M. Marsden et al.

    Holistic features for real-time crowd behaviour anomaly detection

    Proceedings of the International Conference on Image Processing, IEEE ICIP

    (2016)
  • I.R. de Almeida et al.

    Detection of global and local motion changes in human crowds

    Trans. Circuits Syst. Video Technol. IEEE TCSVT

    (2017)
  • V. Mahadevan et al.

    Anomaly detection in crowded scenes

    Proceedings of the IEEE CVPR

    (2010)
  • YangX. et al.

    Local difference binary for ultrafast and distinctive feature description

    Trans. Pattern Anal. Mach. Intell. IEEE PAMI

    (2014)
  • J. Byrne

    Nested motion descriptors

    Proceedings of the International Conference on Computer Vision and Pattern Recognition, IEEE CVPR

    (2015)
  • YaoK. et al.

    Recurrent conditional random fields

    Proceedings of the Neural Information Processing Systems, NIPS Deep Learning Workshop

    (2013)
  • LaffertyJ. et al.

    Conditional random fields: probabilistic models for segmenting and labeling sequence data

    Proceedings of the International Conference on Machine Learning, ICML

    (2001)
  • YaoK. et al.

    Recurrent neural networks for language understanding.

    Interspeech

    (2013)
  • KimJ. et al.

    Observe locally, infer globally: a space-time MRF for detecting abnormal activities with incremental updates

    Proceedings of the IEEE CVPR

    (2009)
  • R. Mehran et al.

    Abnormal crowd behavior detection using social force model

    Proceedings of the IEEE CVPR

    (2009)
  • A. Adam et al.

    Robust real-time unusual event detection using multiple fixed-location monitors

    IEEE Trans. Pattern Anal. Mach. Intell.

    (2008)
  • ChengK.-W. et al.

    Video anomaly detection and localization using hierarchical feature representation and Gaussian process regression

    Proceedings of the International Conference on Computer Vision and Pattern Recognition, IEEE CVPR

    (2015)
  • YiS. et al.

    Pedestrian behavior modeling from stationary crowds with applications to intelligent surveillance

    Trans. Image Process. IEEE TIP

    (2016)
  • Cited by (0)

    Habib Ullah did Ph.D. in 2015 from the University of Trento, Italy and the M.Sc. degree in Computer Engineering in 2009 from Hanyang University, Seoul, South Korea. He is currently working as Assistant Professor in the College of Computer Science and Engineering at the University of Ha’il, Ha’il, Saudi Arabia. His research interests include pedestrian motion analysis, video surveillance, and machine learning.

    Ahmed B. Altamimi did Ph.D. and M.S. from the University of Victoria, Canada. He is currently working as Vice-Dean and Assistant Professor in the College of Computer Science and Engineering at the University of Ha’il, Ha’il, Saudi Arabia.His research interests include communication networks, distributed systems, and computer vision.

    Muhammad Uzair did Ph.D. from The University of Western Australia and MS from Hanyang University South Korea. He is currently working as Assistant Professor in the Department of Electrical Engineering at COMSATS Institute of Information Technology Wah Campus, Pakistan.His primary research areas are computer vision and machine learning. His research interests include hyperspectral image analysis, spectroscopy, face recognition, domain adaptation and video based classification.

    Mohib Ullah received bachelor degree in Electronic and Computer engineering from Politecnico Di Torino, Italy, in 2012, and Master degree in Telecommunication engineering from the University of Trento, Italy, in 2015. He is currently pursuing the Ph.D. degree in computer science from Norwegian University of Science and Technology (NTNU), Gjøvik, Norway. His research interests include crowd analysis, object detection and tracking.

    View full text