skip to main content
10.1145/3394171.3413827acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article
Public Access

Uncertainty-based Traffic Accident Anticipation with Spatio-Temporal Relational Learning

Authors Info & Claims
Published:12 October 2020Publication History

ABSTRACT

Traffic accident anticipation aims to predict accidents from dashcam videos as early as possible, which is critical to safety-guaranteed self-driving systems. With cluttered traffic scenes and limited visual cues, it is of great challenge to predict how long there will be an accident from early observed frames. Most existing approaches are developed to learn features of accident-relevant agents for accident anticipation, while ignoring the features of their spatial and temporal relations. Besides, current deterministic deep neural networks could be overconfident in false predictions, leading to high risk of traffic accidents caused by self-driving systems. In this paper, we propose an uncertainty-based accident anticipation model with spatio-temporal relational learning. It sequentially predicts the probability of traffic accident occurrence with dashcam videos. Specifically, we propose to take advantage of graph convolution and recurrent networks for relational feature learning, and leverage Bayesian neural networks to address the intrinsic variability of latent relational representations. The derived uncertainty-based ranking loss is found to significantly boost model performance by improving the quality of relational features. In addition, we collect a new Car Crash Dataset (CCD) for traffic accident anticipation which contains environmental attributes and accident reasons annotations. Experimental results on both public and the newly-compiled datasets show state-of-the-art performance of our model. Our code and CCD dataset are available at https://github.com/Cogito2012/UString.

Skip Supplemental Material Section

Supplemental Material

3394171.3413827.mp4

mp4

192 MB

References

  1. Charles Blundell, Julien Cornebise, Koray Kavukcuoglu, and Daan Wierstra. 2015. Weight Uncertainty in Neural Networks. In International Conference on Machine Learning.Google ScholarGoogle Scholar
  2. James Bradbury, Stephen Merity, Caiming Xiong, and Richard Socher. 2017. Quasi-Recurrent Neural Networks. In International Conference on Learning Representations.Google ScholarGoogle Scholar
  3. Zhaowei Cai and Nuno Vasconcelos. 2018. Cascade R-CNN: Delving into High Quality Object Detection. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Google ScholarGoogle Scholar
  4. Fu-Hsiang Chan, Yu-Ting Chen, Yu Xiang, and Min Sun. 2016. Anticipating Accidents in Dashcam Videos. In Asian Conference on Computer Vision.Google ScholarGoogle Scholar
  5. Kyunghyun Cho, Bart van Merriënboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. 2014. Learning Phrase Representations Using RNN Encoder--Decoder for Statistical Machine Translation. In Proceedings of the Conference on Empirical Methods in Natural Language Processing.Google ScholarGoogle ScholarCross RefCross Ref
  6. Junyoung Chung, Kyle Kastner, Laurent Dinh, Kratarth Goel, Aaron C Courville, and Yoshua Bengio. 2015. A Recurrent Latent Variable Model for Sequential Data. In Proceedings of Neural Information Processing Systems.Google ScholarGoogle Scholar
  7. G. Corcoran and J. Clark. 2019. Traffic Risk Assessment: A Two-Stream Approach Using Dynamic Attention. In Conference on Computer and Robot Vision.Google ScholarGoogle Scholar
  8. Michaël Defferrard, Xavier Bresson, and Pierre Vandergheynst. 2016. Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering. In Proceedings of Neural Information Processing Systems.Google ScholarGoogle Scholar
  9. John S. Denker and Yann LeCun. 1990. Transforming Neural-Net Output Levels to Probability Distributions. In Proceedings of Neural Information Processing Systems.Google ScholarGoogle Scholar
  10. J. Fang, D. Yan, J. Qiao, J. Xue, H. Wang, and S. Li. 2019. DADA-2000: Can Driving Accident be Predicted by Driver Attention? Analyzed by A Benchmark. In IEEE Intelligent Transportation Systems Conference.Google ScholarGoogle Scholar
  11. Yarin Gal and Zoubin Ghahramani. 2016. Bayesian Convolutional Neural Networks with Bernoulli Approximate Variational Inference. In International Conference on Learning Representations (Workshop).Google ScholarGoogle Scholar
  12. Andreas Geiger, Philip Lenz, and Raquel Urtasun. 2012. Are We Ready for Autonomous Driving? The KITTI Vision Benchmark Suite. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Alex Graves. 2011. Practical Variational Inference for Neural Networks. In Proceedings of Neural Information Processing Systems.Google ScholarGoogle Scholar
  14. Ehsan Hajiramezanali, Arman Hasanzadeh, Krishna Narayanan, Nick Duffield, Mingyuan Zhou, and Xiaoning Qian. 2019. Variational Graph Recurrent Neural Networks. In Proceedings of Neural Information Processing Systems.Google ScholarGoogle Scholar
  15. Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long Short-Term Memory. Neural Computation, Vol. 9, 8 (1997), 1735--1780.Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Alex Kendall and Yarin Gal. 2017. What Uncertainties Do We Need in Bayesian Deep Learning for Computer Vision?. In Proceedings of Neural Information Processing Systems.Google ScholarGoogle Scholar
  17. Diederik P Kingma and Max Welling. 2013. Auto-Encoding Variational Bayes. In International Conference on Learning Representations.Google ScholarGoogle Scholar
  18. Thomas N Kipf and Max Welling. 2016. Variational Graph Auto-Encoders. In Proceedings of Neural Information Processing Systems (Workshop).Google ScholarGoogle Scholar
  19. Thomas N Kipf and Max Welling. 2017. Semi-supervised Classification with Graph Convolutional Networks. In International Conference on Learning Representations.Google ScholarGoogle Scholar
  20. Yongchan Kwon, Joong-Ho Won, Beom Joon Kim, and Myunghee Cho Paik. 2018. Uncertainty Quantification Using Bayesian Neural Networks in Classification: Application to Ischemic Stroke Lesion Segmentation. In Medical Imaging with Deep Learning.Google ScholarGoogle Scholar
  21. Tsung-Yi Lin, Piotr Dollár, Ross Girshick, Kaiming He, Bharath Hariharan, and Serge Belongie. 2017. Feature Pyramid Networks for Object Detection. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Google ScholarGoogle Scholar
  22. Zachary C Lipton, John Berkowitz, and Charles Elkan. 2015. A Critical Review of Recurrent Neural Networks for Sequence Learning. arXiv:1506.00019 (2015).Google ScholarGoogle Scholar
  23. Shugao Ma, Leonid Sigal, and Stan Sclaroff. 2016. Learning Activity Progression in LSTMs for Activity Detection and Early Detection. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Google ScholarGoogle Scholar
  24. Radford M Neal. 2012. Bayesian learning for neural networks. Vol. 118. Springer Science & Business Media.Google ScholarGoogle Scholar
  25. Lukas Neumann, Andrew Zisserman, and Andrea Vedaldi. 2019. Future Event Prediction: If and When. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (Workshop).Google ScholarGoogle ScholarCross RefCross Ref
  26. Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, Alban Desmaison, Andreas Kopf, Edward Yang, Zachary DeVito, Martin Raison, Alykhan Tejani, Sasank Chilamkurthy, Benoit Steiner, Lu Fang, Junjie Bai, and Soumith Chintala. 2019. PyTorch: An Imperative Style, High-Performance Deep Learning Library. In Proceedings of Neural Information Processing Systems.Google ScholarGoogle Scholar
  27. Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. 2015. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. In Proceedings of Neural Information Processing Systems.Google ScholarGoogle Scholar
  28. Danilo Jimenez Rezende, Shakir Mohamed, and Daan Wierstra. 2014. Stochastic Backpropagation and Approximate Inference in Deep Generative Models. In International Conference on Machine Learning.Google ScholarGoogle Scholar
  29. Youngjoo Seo, Michaël Defferrard, Pierre Vandergheynst, and Xavier Bresson. 2018. Structured Sequence Modeling with Graph Convolutional Recurrent Networks. In Proceedings of Neural Information Processing Systems.Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Ankit Shah, Jean Baptiste Lamare, Tuan Nguyen Anh, and Alexander Hauptmann. 2018. CADP: A Novel Dataset for CCTV Traffic Camera based Accident Analysis. In International Workshop on Traffic and Street Surveillance for Safety and Security.Google ScholarGoogle ScholarCross RefCross Ref
  31. Kumar Shridhar, Felix Laumann, and Marcus Liwicki. 2018. Uncertainty Estimations by Softplus Normalization in Bayesian Convolutional Neural Networks with Variational Inference. arXiv:1806.05978 (2018).Google ScholarGoogle Scholar
  32. Tomoyuki Suzuki, Hirokatsu Kataoka, Yoshimitsu Aoki, and Yutaka Satoh. 2018. Anticipating Traffic Accidents with Adaptive Loss and Large-scale Incident DB. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Google ScholarGoogle Scholar
  33. Yoshiaki Takimoto, Yusuke Tanaka, Takeshi Kurashima, Shuhei Yamamoto, Maya Okawa, and Hiroyuki Toda. 2019. Predicting Traffic Accidents with Event Recorder Data. In Proceedings of ACM SIGSPATIAL International Workshop on Prediction of Human Mobility.Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, and Illia Polosukhin. 2017. Attention is All You Need. In Proceedings of Neural Information Processing Systems.Google ScholarGoogle Scholar
  35. Saining Xie, Ross Girshick, Piotr Dollár, Zhuowen Tu, and Kaiming He. 2017. Aggregated Residual Transformations for Deep Neural Networks. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Google ScholarGoogle Scholar
  36. Yu Yao, Mingze Xu, Yuchen Wang, David J Crandall, and Ella M Atkins. 2019. Unsupervised Traffic Accident Detection in First-person Videos. In International Conference on Intelligent Robots and Systems.Google ScholarGoogle Scholar
  37. Fisher Yu, Haofeng Chen, Xin Wang, Wenqi Xian, Yingying Chen, Fangchen Liu, Vashisht Madhavan, and Trevor Darrell. 2020. BDD100K: A Diverse Driving Dataset for Heterogeneous Multitask Learning. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Google ScholarGoogle Scholar
  38. Kuo-Hao Zeng, Shih-Han Chou, Fu-Hsiang Chan, Juan Carlos Niebles, and Min Sun. 2017. Agent-Centric Risk Assessment: Accident Anticipation and Risky Region Localization. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Google ScholarGoogle Scholar
  39. Rui Zhao, Kang Wang, Hui Su, and Qiang Ji. 2019. Bayesian Graph Convolution LSTM for Skeleton Based Action Recognition. In Proceedings of the IEEE International Conference on Computer Vision.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Uncertainty-based Traffic Accident Anticipation with Spatio-Temporal Relational Learning

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          MM '20: Proceedings of the 28th ACM International Conference on Multimedia
          October 2020
          4889 pages
          ISBN:9781450379885
          DOI:10.1145/3394171

          Copyright © 2020 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 12 October 2020

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          Overall Acceptance Rate995of4,171submissions,24%

          Upcoming Conference

          MM '24
          MM '24: The 32nd ACM International Conference on Multimedia
          October 28 - November 1, 2024
          Melbourne , VIC , Australia

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader