skip to main content
research-article

Detection and Recognition of Driver Distraction Using Multimodal Signals

Published:12 December 2022Publication History
Skip Abstract Section

Abstract

Distracted driving is a leading cause of accidents worldwide. The tasks of distraction detection and recognition have been traditionally addressed as computer vision problems. However, distracted behaviors are not always expressed in a visually observable way. In this work, we introduce a novel multimodal dataset of distracted driver behaviors, consisting of data collected using twelve information channels coming from visual, acoustic, near-infrared, thermal, physiological and linguistic modalities. The data were collected from 45 subjects while being exposed to four different distractions (three cognitive and one physical). For the purposes of this paper, we performed experiments with visual, physiological, and thermal information to explore potential of multimodal modeling for distraction recognition. In addition, we analyze the value of different modalities by identifying specific visual, physiological, and thermal groups of features that contribute the most to distraction characterization. Our results highlight the advantage of multimodal representations and reveal valuable insights for the role played by the three modalities on identifying different types of driving distractions.

REFERENCES

  1. [1] Abouelenien Mohamed, Pérez-Rosas Verónica, Mihalcea Rada, and Burzo Mihai. 2017. Detecting deceptive behavior via integration of discriminative features from multiple modalities. IEEE Transactions on Information Forensics and Security 12, 5 (2017), 10421055.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. [2] Abouelnaga Yehya, Eraqi Hesham M., and Moustafa Mohamed N.. 2017. Real-time distracted driver posture classification. arXiv preprint arXiv:1706.09498 (2017).Google ScholarGoogle Scholar
  3. [3] Avinash W., Dvijesh S., and Ioannis P.. 2010. A novel method to monitor driver’s distractions. In Proceedings of the 28th International Conference Extended Abstracts on Human Factors in Computing Systems, Atlanta, Georgia, USA.Google ScholarGoogle Scholar
  4. [4] Baltrušaitis Tadas, Robinson Peter, and Morency Louis-Philippe. 2016. Openface: An open source facial behavior analysis toolkit. In 2016 IEEE Winter Conference on Applications of Computer Vision (WACV). IEEE, 110.Google ScholarGoogle ScholarCross RefCross Ref
  5. [5] Begum Shahina. 2013. Intelligent driver monitoring systems based on physiological sensor signals: A review. In 16th International IEEE Conference on Intelligent Transportation Systems (ITSC’13). IEEE, 282289.Google ScholarGoogle ScholarCross RefCross Ref
  6. [6] Cardone Daniela, Perpetuini David, Filippini Chiara, Spadolini Edoardo, Mancini Lorenza, Chiarelli Antonio Maria, and Merla Arcangelo. 2020. Driver stress state evaluation by means of thermal imaging: A supervised machine learning approach based on ECG signal. Applied Sciences 10, 16 (Aug.2020), 5673. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  7. [7] Chen Lan-lan, Zhao Yu, Zhang Jian, and Zou Jun-zhong. 2015. Automatic detection of alertness/drowsiness from physiological signals using wavelet-based nonlinear features and machine learning. Expert Systems with Applications 42, 21 (2015), 73447355.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. [8] Cho Hyun-Seung, Yang Jin-Hee, Kim Sang-Min, Lee Jeong-Whan, Kwak Hwi-Kuen, Chae Je-Wook, and Lee Joo-Hyeon. 2020. Development of a chest-belt-type biosignal-monitoring wearable platform system. Journal of Electrical Engineering & Technology 15, 4 (2020), 18471855.Google ScholarGoogle ScholarCross RefCross Ref
  9. [9] Dobbins Chelsea and Fairclough Stephen. 2018. Detecting negative emotions during real-life driving via dynamically labelled physiological data. In 2018 IEEE International Conference on Pervasive Computing and Communications Workshops (PerCom Workshops). IEEE, 830835.Google ScholarGoogle ScholarCross RefCross Ref
  10. [10] Florence Curtis, Simon Thomas, Haegerich Tamara, Luo Feijun, and Zhou Chao. 2015. Estimated lifetime medical and work-loss costs of fatal injuries-United States, 2013. Morbidity and Mortality Weekly Report 64, 38 (2015), 10741077.Google ScholarGoogle ScholarCross RefCross Ref
  11. [11] Forczmański Paweł and Smoliński Anton. 2021. Supporting driver physical state estimation by means of thermal image processing. In International Conference on Computational Science. Springer, 149163.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. [12] Friedman Jerome H.. 2002. Stochastic gradient boosting. Computational Statistics & Data Analysis 38, 4 (2002), 367378.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. [13] Grassi Guido, Vailati Sabrina, Bertinieri Giovanni, Seravalle Gino, Stella Maria Luisa, Dell’Oro Raffaella, and Mancia Giuseppe. 1998. Heart rate as marker of sympathetic activity. Journal of Hypertension 16, 11 (1998), 16351639.Google ScholarGoogle ScholarCross RefCross Ref
  14. [14] Herman David Michael. 2020. Monitoring of steering wheel engagement for autonomous vehicles. (Sept.2020). US Patent App. 16/294,541.Google ScholarGoogle Scholar
  15. [15] Kane Michael J., Conway Andrew R. A., Miura Timothy K., and Colflesh Gregory J. H.. 2007. Working memory, attention control, and the N-back task: A question of construct validity.Journal of Experimental Psychology: Learning, Memory, and Cognition 33, 3 (2007), 615.Google ScholarGoogle ScholarCross RefCross Ref
  16. [16] Knapik Mateusz and Cyganek Bogusław. 2019. Driver’s fatigue recognition based on yawn detection in thermal images. Neurocomputing 338 (2019), 274292.Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. [17] Kolli Abhiram, Fasih Alireza, Machot Fadi Al, and Kyamakya Kyandoghere. 2011. Non-intrusive car driver’s emotion recognition using thermal camera. In Proceedings of the Joint INDS’11 & ISTET’11. IEEE, 15.Google ScholarGoogle ScholarCross RefCross Ref
  18. [18] Kose Neslihan, Kopuklu Okan, Unnervik Alexander, and Rigoll Gerhard. 2019. Real-time driver state monitoring using a CNN based spatio-temporal approach. In 2019 IEEE Intelligent Transportation Systems Conference (ITSC). IEEE, 32363242.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. [19] Lee John D., McGehee Daniel V., Brown Timothy L., and Reyes Michelle L.. 2002. Collision warning timing, driver distraction, and driver response to imminent rear-end collisions in a high-fidelity driving simulator. Human Factors 44, 2 (2002), 314334. DOI: arXiv:PMID: 12452276.Google ScholarGoogle ScholarCross RefCross Ref
  20. [20] Levy Matthew N.. 1971. Brief reviews: Sympathetic-parasympathetic interactions in the heart. Circulation Research 29, 5 (1971), 437445.Google ScholarGoogle ScholarCross RefCross Ref
  21. [21] Liaw Andy, Wiener Matthew, et al. 2002. Classification and regression by Random Forest. R News 2, 3 (2002), 1822.Google ScholarGoogle Scholar
  22. [22] Liu Tianchi, Yang Yan, Huang Guang-Bin, Yeo Yong Kiang, and Lin Zhiping. 2015. Driver distraction detection using semi-supervised machine learning. IEEE Transactions on Intelligent Transportation Systems 17, 4 (2015), 11081120.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. [23] Lopez Miguel Bordallo, Blanco Carlos R del, and Garcia Narciso. 2017. Detecting exercise-induced fatigue using thermal imaging and deep learning. In 2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA). IEEE, 16.Google ScholarGoogle ScholarCross RefCross Ref
  24. [24] Mbouna Ralph Oyini, Kong Seong G., and Chun Myung-Geun. 2013. Visual analysis of eye state and head pose for driver alertness monitoring. IEEE Transactions on Intelligent Transportation Systems 14, 3 (2013), 14621469.Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. [25] McDonald Anthony D., Ferris Thomas K., and Wiener Tyler A.. 2020. Classification of driver distraction: A comprehensive analysis of feature generation, machine learning, and input measures. Human Factors 62, 6 (2020), 10191035. DOI: arXiv:PMID: 31237788.Google ScholarGoogle ScholarCross RefCross Ref
  26. [26] Meyers H.. 2010. ProComp infiniti/biograph infiniti biofeedback system (version 5.1. 2). Montreal, QB: Thought Technology Ltd (2010).Google ScholarGoogle Scholar
  27. [27] Mühlbacher-Karrer Stephan, Mosa Ahmad Haj, Faller Lisa-Marie, Ali Mouhannad, Hamid Raiyan, Zangl Hubert, and Kyamakya Kyandoghere. 2017. A driver state detection system-combining a capacitive hand detection sensor with physiological sensors. IEEE Transactions on Instrumentation and Measurement 66, 4 (2017), 624636.Google ScholarGoogle ScholarCross RefCross Ref
  28. [28] Control Centers for Disease Control & Prevention (CDC) National Center for Injury Prevention &. 2019. Distracted Driving. https://www.cdc.gov/motorvehiclesafety/distracted_driving/index.html. (2019). [Online; accessed 13-April-2020].Google ScholarGoogle Scholar
  29. [29] Control Centers for Disease Control & Prevention (CDC) National Center for Injury Prevention &. 2019. Road Traffic Injuries and Deaths—A Global Problem. https://www.cdc.gov/injury/features/global-road-safety/index.html. (2019). [Online; accessed 13-April-2020].Google ScholarGoogle Scholar
  30. [30] Control Centers for Disease Control & Prevention (CDC) National Center for Injury Prevention &. 2020. Cost of Injury Data. https://www.cdc.gov/injury/wisqars/cost/index.html. (2020). [Online; accessed 13-April-2020].Google ScholarGoogle Scholar
  31. [31] (NHTSA) US Department of Transportation National Highway Traffic Safety Administration. 2019. Distracted Driving. https://www.nhtsa.gov/risky-driving/distracted-d riving. (2019). [Online; accessed 13-April-2020].Google ScholarGoogle Scholar
  32. [32] Odate Shotaro, Sakamoto Naohiro, and Midorikawa Yukinori. 2020. Development of Electrostatic Capacity Type Steering Sensor Using Conductive Leather. Technical Report. SAE Technical Paper.Google ScholarGoogle ScholarCross RefCross Ref
  33. [33] Papakostas Michalis, Rajavenkatanarayanan Akilesh, and Makedon Fillia. 2019. CogBeacon: A multi-modal dataset and data-collection platform for modeling cognitive fatigue. Technologies 7, 2 (2019), 46.Google ScholarGoogle ScholarCross RefCross Ref
  34. [34] Pedregosa Fabian, Varoquaux Gaël, Gramfort Alexandre, Michel Vincent, Thirion Bertrand, Grisel Olivier, Blondel Mathieu, Prettenhofer Peter, Weiss Ron, Dubourg Vincent, et al. 2011. Scikit-learn: Machine learning in Python. The Journal of Machine Learning Research 12 (2011), 28252830.Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. [35] Rao Xuli, Lin Feng, Chen Zhide, and Zhao Jiaxu. 2019. Distracted driving recognition method based on deep convolutional neural network. Journal of Ambient Intelligence and Humanized Computing (2019), 18.Google ScholarGoogle Scholar
  36. [36] Rastgoo Mohammad Naim, Nakisa Bahareh, Maire Frederic, Rakotonirainy Andry, and Chandran Vinod. 2019. Automatic driver stress level classification using multimodal deep learning. Expert Systems with Applications 138 (2019), 112793. DOI:Google ScholarGoogle Scholar
  37. [37] Riani Kais, Papakostas Michalis, Kokash Hussein, Abouelenien Mohamed, Burzo Mihai, and Mihalcea Rada. 2020. Towards detecting levels of alertness in drivers using multiple modalities. In Proceedings of the 13th ACM International Conference on Pervasive Technologies Related to Assistive Environments (PETRA’20). Association for Computing Machinery, New York, NY, USA, Article 12, 9 pages. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. [38] Ringeval Fabien, Schuller Björn, Valstar Michel, Jaiswal Shashank, Marchi Erik, Lalanne Denis, Cowie Roddy, and Pantic Maja. 2015. Av+ ec 2015: The first affect recognition challenge bridging across audio, video, and physiological data. In Proceedings of the 5th International Workshop on Audio/Visual Emotion Challenge. 38.Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. [39] Rodríguez-Colón Sol M., He Fan, Bixler Edward O., Fernandez-Mendoza Julio, Vgontzas Alexandros N., Calhoun Susan, Zheng Zhi-Jie, and Liao Duanping. 2015. Sleep variability and cardiac autonomic modulation in adolescents–Penn State Child Cohort (PSCC) study. Sleep Medicine 16, 1 (2015), 6772.Google ScholarGoogle ScholarCross RefCross Ref
  40. [40] Rongben Wang, Lie Guo, Bingliang Tong, and Lisheng Jin. 2004. Monitoring mouth movement for driver fatigue or distraction with one camera. In Proceedings of the 7th International IEEE Conference on Intelligent Transportation Systems (IEEE Cat. No. 04TH8749). IEEE, 314319.Google ScholarGoogle ScholarCross RefCross Ref
  41. [41] Saragih Jason M., Lucey Simon, and Cohn Jeffrey F.. 2011. Deformable model fitting by regularized landmark mean-shift. International Journal of Computer Vision 91, 2 (2011), 200215.Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. [42] Schif Diana, Forst Holger, and Schwarz Ulrich T.. 2021. Methods for sweat detection in cars based on thermal images of the face. IEEE Sensors Journal (2021).Google ScholarGoogle ScholarCross RefCross Ref
  43. [43] Sharma Pragya, Hui Xiaonan, Zhou Jianlin, Conroy Thomas B., and Kan Edwin C.. 2020. Wearable radio-frequency sensing of respiratory rate, respiratory volume, and heart rate. NPJ Digital Medicine 3, 1 (2020), 110.Google ScholarGoogle ScholarCross RefCross Ref
  44. [44] Shi Jianbo et al. 1994. Good features to track. In 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 593600.Google ScholarGoogle Scholar
  45. [45] Shin Heung-Sub, Jung Sang-Joong, Kim Jong-Jin, and Chung Wan-Young. 2010. Real time car driver’s condition monitoring system. In Sensors, 2010 IEEE, 951954.Google ScholarGoogle Scholar
  46. [46] Sinha Sudipta N., Frahm Jan-Michael, Pollefeys Marc, and Genc Yakup. 2006. GPU-based video feature tracking and matching. In EDGE, Workshop on Edge Computing Using New Commodity Architectures, Vol. 278. 4321.Google ScholarGoogle Scholar
  47. [47] Sinnapolu GiriBabu and Alawneh Shadi. 2020. Intelligent wearable heart rate sensor implementation for in-vehicle infotainment and assistance. Internet of Things 12 (2020), 100277.Google ScholarGoogle ScholarCross RefCross Ref
  48. [48] Solovey Erin T., Zec Marin, Perez Enrique Abdon Garcia, Reimer Bryan, and Mehler Bruce. 2014. Classifying driver workload using physiological and driving performance data: Two field studies. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. 40574066.Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. [49] Taamneh Salah, Tsiamyrtzis Panagiotis, Dcosta Malcolm, Buddharaju Pradeep, Khatri Ashik, Manser Michael, Ferris Thomas, Wunderlich Robert, and Pavlidis Ioannis. 2017. A multimodal dataset for various forms of distracted driving. Scientific Data 4 (2017), 170110.Google ScholarGoogle ScholarCross RefCross Ref
  50. [50] Tian Y.-I., Kanade Takeo, and Cohn Jeffrey F.. 2001. Recognizing action units for facial expression analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence 23, 2 (2001), 97115.Google ScholarGoogle ScholarDigital LibraryDigital Library
  51. [51] Verma R., Mitra B., and Chakraborty Sandip. 2019. Avoiding stress driving: Online trip recommendation from driving behavior prediction. 2019 IEEE International Conference on Pervasive Computing and Communications (PerCom’19), 110.Google ScholarGoogle Scholar
  52. [52] Wang K., Murphey Y. L., Zhou Y., Hu X., and Zhang X.. 2019. Detection of driver stress in real-world driving environment using physiological signals. In 2019 IEEE 17th International Conference on Industrial Informatics (INDIN), Vol. 1. 18071814.Google ScholarGoogle ScholarCross RefCross Ref
  53. [53] Xie Yongquan, Murphey Yi L., and Kochhar Dev. 2019. Personalized driver workload estimation using deep neural network learning from physiological and vehicle signals. IEEE Transactions on Intelligent Vehicles (2019).Google ScholarGoogle Scholar
  54. [54] Yang Shuo, Yin Zhong, Wang Yagang, Zhang Wei, Wang Yongxiong, and Zhang Jianhua. 2019. Assessing cognitive mental workload via EEG signals and an ensemble deep learning classifier based on denoising autoencoders. Computers in Biology and Medicine 109 (2019), 159170. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  55. [55] Zepf Sebastian, Haouij Neska El, Lee Jinmo, Ghandeharioun Asma, Hernandez Javier, and Picard Rosalind W.. 2020. Studying personalized just-in-time auditory breathing guides and potential safety implications during simulated driving(UMAP’20). Association for Computing Machinery, New York, NY, USA, 275283. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Detection and Recognition of Driver Distraction Using Multimodal Signals

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        • Published in

          cover image ACM Transactions on Interactive Intelligent Systems
          ACM Transactions on Interactive Intelligent Systems  Volume 12, Issue 4
          December 2022
          321 pages
          ISSN:2160-6455
          EISSN:2160-6463
          DOI:10.1145/3561952
          Issue’s Table of Contents

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 12 December 2022
          • Online AM: 27 April 2022
          • Accepted: 15 February 2022
          • Revised: 16 November 2021
          • Received: 9 August 2021
          Published in tiis Volume 12, Issue 4

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Refereed

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Full Text

        View this article in Full Text.

        View Full Text

        HTML Format

        View this article in HTML Format .

        View HTML Format