skip to main content
10.1145/3173574.3173576acmconferencesArticle/Chapter ViewAbstractPublication PageschiConference Proceedingsconference-collections
research-article

Deep Thermal Imaging: Proximate Material Type Recognition in the Wild through Deep Learning of Spatial Surface Temperature Patterns

Published:19 April 2018Publication History

ABSTRACT

We introduce Deep Thermal Imaging, a new approach for close-range automatic recognition of materials to enhance the understanding of people and ubiquitous technologies of their proximal environment. Our approach uses a low-cost mobile thermal camera integrated into a smartphone to capture thermal textures. A deep neural network classifies these textures into material types. This approach works effectively without the need for ambient light sources or direct contact with materials. Furthermore, the use of a deep learning network removes the need to handcraft the set of features for different materials. We evaluated the performance of the system by training it to recognize 32 material types in both indoor and outdoor environments. Our approach produced recognition accuracies above 98% in 14,860 images of 15 indoor materials and above 89% in 26,584 images of 17 outdoor materials. We conclude by discussing its potentials for real-time use in HCI applications and future directions.

Skip Supplemental Material Section

Supplemental Material

pn1011-file5.mp4

mp4

22.6 MB

References

  1. Miika Aittala, Tim Weyrich, and Jaakko Lehtinen. 2015. Two-shot SVBRDF capture for stationary materials. ACM Transactions on Graphics (TOG) - Proceedings of ACM SIGGRAPH 2015 34, 4: 110--122. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Sean Bell, Paul Upchurch, Noah Snavely, and Kavita Bala. 2015. Material Recognition in the Wild With the Materials in Context Database. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 3479--3487.Google ScholarGoogle ScholarCross RefCross Ref
  3. LouAnne E. Boyd, Xinlong Jiang, and Gillian R. Hayes. 2017. ProCom: Designing and Evaluating a Mobile and Wearable System to Support Proximity Awareness for People with Autism. In Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems (CHI '17), 2865--2877. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. William D. Callister and David G. Rethwisch. 2011. Materials science and engineering: an introduction. John Wiley & Sons NY.Google ScholarGoogle Scholar
  5. Victor C. Chen and Hao Ling. 2002. Time-frequency transforms for radar imaging and signal analysis. Artech House.Google ScholarGoogle Scholar
  6. E. Cheung and V. J. Lumelsky. 1989. Proximity sensing in robot manipulator motion planning: system and implementation issues. IEEE Transactions on Robotics and Automation 5, 6: 740--751.Google ScholarGoogle ScholarCross RefCross Ref
  7. Youngjun Cho, Andrea Bianchi, Nicolai Marquardt, and Nadia Bianchi-Berthouze. 2016. RealPen: Providing Realism in Handwriting Tasks on Touch Surfaces Using Auditory-Tactile Feedback. In Proceedings of the 29th Annual Symposium on User Interface Software and Technology (UIST '16), 195-- 205. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Youngjun Cho, Nadia Bianchi-Berthouze, and Simon J. Julier. 2017. DeepBreath: Deep Learning of Breathing Patterns for Automatic Stress Recognition using LowCost Thermal Imaging in Unconstrained Settings. In the 7th International Conference on Affective Computing and Intelligent Interaction, ACII 2017, 456--463.Google ScholarGoogle Scholar
  9. Youngjun Cho, Munchae Joung, and Sunuk Kim. 2015. Electronic device having proximity touch function and control method thereof, US Patent, Publication Number: US2015/0062087. https://patents.google.com/patent/US20150062087A1Google ScholarGoogle Scholar
  10. Youngjun Cho, Simon J. Julier, Nicolai Marquardt, and Nadia Bianchi-Berthouze. 2017. Robust tracking of respiratory rate in high-dynamic range scenes using mobile thermal imaging. Biomedical Optics Express 8, 10: 4480--4503.Google ScholarGoogle ScholarCross RefCross Ref
  11. Mircea Cimpoi, Subhransu Maji, and Andrea Vedaldi. 2015. Deep Filter Banks for Texture Recognition and Segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 3828-- 3836.Google ScholarGoogle ScholarCross RefCross Ref
  12. Mike Crang and Nigel Thrift. 2002. Thinking Space. Routledge.Google ScholarGoogle Scholar
  13. Anind K. Dey and Jonna Häkkilä. 2008. ContextAwareness and Mobile Devices. Handbook of research on user interface design and evaluation for mobile technology: 205--217.Google ScholarGoogle Scholar
  14. Andrey Dimitrov and Mani Golparvar-Fard. 2014. Vision-based material recognition for automated monitoring of construction progress and generating building information modeling from unordered site image collections. Advanced Engineering Informatics 28, 1: 37--49. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Paul Dourish. 2004. What we talk about when we talk about context. Personal and Ubiquitous Computing 8, 1: 19--30.Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Marks Eric and Teizer Jochen. Proximity Sensing and Warning Technology for Heavy Construction Equipment Operation. Construction Research Congress 2012: 981--990.Google ScholarGoogle Scholar
  17. Jakob Eriksson, Lewis Girod, Bret Hull, Ryan Newton, Samuel Madden, and Hari Balakrishnan. 2008. The pothole patrol: using a mobile sensor network for road surface monitoring. In Proceedings of the 6th international conference on Mobile systems, applications, and services, 29--39. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Ross Girshick. 2015. Fast R-CNN. In Proceedings of the IEEE International Conference on Computer Vision, 1440--1448. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Kotaro Hara, Vicki Le, and Jon Froehlich. 2013. Combining Crowdsourcing and Google Street View to Identify Street-level Accessibility Problems. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '13), 631--640. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Chris Harrison and Scott E. Hudson. 2008. Lightweight Material Detection for Placement-aware Mobile Computing. In Proceedings of the 21st Annual ACM Symposium on User Interface Software and Technology (UIST '08), 279--282. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Antonius Hendriks, Damian M. Lyons, and Frank Guida. 2001. Vacuum cleaner with obstacle avoidance, US Patent, Publication Number: US6226830 B1. https://patents.google.com/patent/US6226830B1Google ScholarGoogle Scholar
  22. Ken Hinckley and Mike Sinclair. 1999. Touch-sensing Input Devices. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '99), 223--230. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Catherine Holloway and Nick Tyler. 2013. A microlevel approach to measuring the accessibility of footways for wheelchair users using the Capability Model. Transportation planning and technology 36, 7: 636--649.Google ScholarGoogle Scholar
  24. H. Holone, G. Misund, and H. Holmstedt. 2007. Users Are Doing It For Themselves: Pedestrian Navigation With User Generated Content. In The 2007 International Conference on Next Generation Mobile Applications, Services and Technologies (NGMAST 2007), 91--99. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Harald Holone, Gunnar Misund, H\a akon Tolsby, and Steinar Kristoffersen. 2008. Aspects of Personal Navigation with Collaborative User Feedback. In Proceedings of the 5th Nordic Conference on Humancomputer Interaction: Building Bridges (NordiCHI '08), 182--191. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Max Jaderberg, Karen Simonyan, Andrew Zisserman, and koray kavukcuoglu. 2015. Spatial Transformer Networks. In Proceedings of Advances in Neural Information Processing Systems 28 (NIPS), 2017-- 2025. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Piyawan Kasemsuppakorn and Hassan A. Karimi. 2009. Personalised routing for wheelchair navigation. Journal of Location Based Services 3, 1: 24--54. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. J. Kölzer, E. Oesterschulze, and G. Deboy. 1996. Thermal imaging and measurement techniques for electronic materials and devices. Microelectronic Engineering 31, 1: 251--270.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. 2012. ImageNet Classification with Deep Convolutional Neural Networks. In Proceedings of Advances in Neural Information Processing Systems 25 (NIPS), 1097--1105. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Daniel Kurz. 2014. Thermal touch: Thermographyenabled everywhere touch interfaces for mobile augmented reality applications. In Mixed and Augmented Reality (ISMAR), 2014 IEEE International Symposium on, 9--16.Google ScholarGoogle Scholar
  31. Gierad Laput, Robert Xiao, and Chris Harrison. 2016. ViBand: High-Fidelity Bio-Acoustic Sensing Using Commodity Smartwatch Accelerometers. In Proceedings of the 29th Annual Symposium on User Interface Software and Technology (UIST '16), 321-- 333. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Eric Larson, Gabe Cohn, Sidhant Gupta, Xiaofeng Ren, Beverly Harrison, Dieter Fox, and Shwetak Patel. 2011. HeatWave: Thermal Imaging for Surface User Interaction. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '11), 2565--2574. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. John J. Leonard and Hugh F. Durrant-Whyte. 2012. Directed Sonar Sensing for Mobile Robot Navigation. Springer Science & Business Media.Google ScholarGoogle Scholar
  34. C. Liu, L. Sharan, E. H. Adelson, and R. Rosenholtz. 2010. Exploring features in a Bayesian framework for material recognition. In 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 239--246.Google ScholarGoogle Scholar
  35. H. Liu, X. Song, J. Bimbo, L. Seneviratne, and K. Althoefer. 2012. Surface material recognition through haptic exploration using an intelligent contact sensing finger. In 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, 52--57.Google ScholarGoogle Scholar
  36. J. M. Lloyd. 2013. Thermal Imaging Systems. Springer Science & Business Media.Google ScholarGoogle Scholar
  37. Sapan Naik and Bankim Patel. 2017. Thermal imaging with fuzzy classifier for maturity and size based nondestructive mango (Mangifera Indica L.) grading. In Emerging Trends & Innovation in ICT (ICEI), 2017 International Conference on, 15--20.Google ScholarGoogle ScholarCross RefCross Ref
  38. Vinod Nair and Geoffrey E. Hinton. 2010. Rectified linear units improve restricted boltzmann machines. In Proceedings of the 27th international conference on machine learning (ICML-10), 807--814. Retrieved from http://machinelearning.wustl.edu/mlpapers/paper_files/ icml2010_NairH10.pdf Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. I. Pavlidis, Peter Symosek, B. Fritz, Mike Bazakos, and Nikolaos Papanikolopoulos. 2000. Automatic detection of vehicle occupants: the imaging problemand its solution. Machine Vision and Applications 11, 6: 313-- 320. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. 2015. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. In Proceedings of Advances in Neural Information Processing Systems 28 (NIPS), 91--99. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. E. F. J. Ring and K. Ammer. 2012. Infrared thermal imaging in medicine. Physiological Measurement 33, 3: R33.Google ScholarGoogle ScholarCross RefCross Ref
  42. Marc Rioux. 1984. Laser range finder based on synchronized scanners. Applied Optics 23, 21: 3837-- 3844.Google ScholarGoogle ScholarCross RefCross Ref
  43. Yvonne Rogers. 2011. Interaction design gone wild: striving for wild theory. Interactions 18, 4: 58--62. Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. Bernardino Romera-Paredes and Philip Torr. 2015. An embarrassingly simple approach to zero-shot learning. In Proceedings of the 32nd International Conference on International Conference on Machine Learning (ICML), 2152--2161. Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Chandra Roychoudhuri, Al F. Kracklauer, and Kathy Creath. 2008. The nature of light: what is a photon? CRC Press.Google ScholarGoogle Scholar
  46. Fauzia Saeed, Siti Qamariatul, Mujib Rahman, and Alan Woodside. 2015. The state of pothole management in UK local authority. Bituminous Mixtures and Pavements VI: 153--159.Google ScholarGoogle Scholar
  47. Alireza Sahami Shirazi, Yomna Abdelrahman, Niels Henze, Stefan Schneegass, Mohammadreza Khalilbeigi, and Albrecht Schmidt. 2014. Exploiting Thermal Reflection for Interactive Systems. In Proceedings of the 32Nd Annual ACM Conference on Human Factors in Computing Systems (CHI '14), 3483--3492. Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. Munehiko Sato, Shigeo Yoshida, Alex Olwal, Boxin Shi, Atsushi Hiyama, Tomohiro Tanikawa, Michitaka Hirose, and Ramesh Raskar. 2015. SpecTrans: Versatile Material Classification for Interaction with Textureless, Specular and Transparent Surfaces. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems (CHI '15), 2191--2200. Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. Albrecht Schmidt, Michael Beigl, and Hans-W. Gellersen. 1999. There is more to context than location. Computers & Graphics 23, 6: 893--901.Google ScholarGoogle ScholarCross RefCross Ref
  50. Igor V. Tetko, David J. Livingstone, and Alexander I. Luik. 1995. Neural network studies. 1. Comparison of overfitting and overtraining. Journal of Chemical Information and Computer Sciences 35, 5: 826--833.Google ScholarGoogle ScholarCross RefCross Ref
  51. Andrea Vedaldi and Karel Lenc. 2015. MatConvNet: Convolutional Neural Networks for MATLAB. In Proceedings of the 23rd ACM International Conference on Multimedia (MM '15), 689--692. Google ScholarGoogle ScholarDigital LibraryDigital Library
  52. Michael Vollmer and MÃ Klaus-Peter. 2017. Infrared thermal imaging: fundamentals, research and applications. John Wiley & Sons.Google ScholarGoogle Scholar
  53. Ting-Chun Wang, Jun-Yan Zhu, Ebi Hiroaki, Manmohan Chandraker, Alexei A. Efros, and Ravi Ramamoorthi. 2016. A 4D Light-Field Dataset and CNN Architectures for Material Recognition. In Computer Vision -- ECCV 2016 (Lecture Notes in Computer Science), 121--138.Google ScholarGoogle Scholar
  54. Jason Wiese, T. Scott Saponas, and A.J. Bernheim Brush. 2013. Phoneprioception: Enabling Mobile Phones to Infer Where They Are Kept. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '13), 2157--2166. Google ScholarGoogle ScholarDigital LibraryDigital Library
  55. Hui-Shyong Yeo, Gergely Flamich, Patrick Schrempf, David Harris-Birtill, and Aaron Quigley. 2016. RadarCat: Radar Categorization for Input & Interaction. In Proceedings of the 29th Annual Symposium on User Interface Software and Technology (UIST '16), 833--841. Google ScholarGoogle ScholarDigital LibraryDigital Library
  56. Hui-Shyong Yeo, Juyoung Lee, Andrea Bianchi, David Harris-Birtill, and Aaron Quigley. 2017. SpeCam: Sensing Surface Color and Material with the Frontfacing Camera of a Mobile Device. In Proceedings of the 19th International Conference on HumanComputer Interaction with Mobile Devices and Services (MobileHCI '17), 25:1--25:9. Google ScholarGoogle ScholarDigital LibraryDigital Library
  57. Xiangxin Zhu, Carl Vondrick, Deva Ramanan, and Charless C. Fowlkes. 2012. Do we need more training data or better models for object detection. In Proceedings of the British Machine Vision Conference (BMVC '12).Google ScholarGoogle Scholar
  58. Seeing AI. Retrieved from https://www.microsoft.com/en-us/seeing-ai/Google ScholarGoogle Scholar
  59. Encoded Reality. Retrieved from http://viral.media.mit.edu/projects/encoded_reality/Google ScholarGoogle Scholar
  60. Porsche Panamera 2017. Retrieved from http://www.motorauthority.com/news/1105162_2017- porsche-panamera-deep-dive/page-3#Google ScholarGoogle Scholar
  61. CAT S60: The World's First Thermal Imaging Smartphone. Retrieved from https://www.catphones.com/?product=cat-s60- smartphoneGoogle ScholarGoogle Scholar
  62. JETSON TK1. Retrieved from http://www.nvidia.com/object/jetson-tk1-embeddeddev-kit.htmlGoogle ScholarGoogle Scholar
  63. Caffe deep learning framework. Retrieved from http://caffe.berkeleyvision.org/Google ScholarGoogle Scholar
  64. TensorFlow Mobile. Retrieved from https://www.tensorflow.org/mobile/Google ScholarGoogle Scholar
  65. Dyson 360 Eye robot. Retrieved from https://www.dyson.co.uk/robot-vacuums/dyson-360- eye-overview.htmlGoogle ScholarGoogle Scholar
  66. LG HOM-BOT Square. Retrieved from http://www.lg.com/uk/hom-botGoogle ScholarGoogle Scholar
  67. Google Maps now lets users add wheelchair accessibility details for locations. Retrieved from https://techcrunch.com/2017/07/08/google-maps-nowlets-users-add-wheelchair-accessibility-details-forlocations/Google ScholarGoogle Scholar

Index Terms

  1. Deep Thermal Imaging: Proximate Material Type Recognition in the Wild through Deep Learning of Spatial Surface Temperature Patterns

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      CHI '18: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems
      April 2018
      8489 pages
      ISBN:9781450356206
      DOI:10.1145/3173574

      Copyright © 2018 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 19 April 2018

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      CHI '18 Paper Acceptance Rate666of2,590submissions,26%Overall Acceptance Rate6,199of26,314submissions,24%

      Upcoming Conference

      CHI '24
      CHI Conference on Human Factors in Computing Systems
      May 11 - 16, 2024
      Honolulu , HI , USA

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader