research-article

Deep Thermal Imaging: Proximate Material Type Recognition in the Wild through Deep Learning of Spatial Surface Temperature Patterns

Authors:
Youngjun Cho

University College London, London, United Kingdom

University College London, London, United Kingdom
View Profile

,
Nadia Bianchi-Berthouze

University College London, London, United Kingdom

University College London, London, United Kingdom
View Profile

,
Nicolai Marquardt

University College London, London, United Kingdom

University College London, London, United Kingdom
View Profile

,
Simon J. Julier

University College London, London, United Kingdom

University College London, London, United Kingdom
View Profile

CHI '18: Proceedings of the 2018 CHI Conference on Human Factors in Computing SystemsApril 2018Paper No.: 2Pages 1–13https://doi.org/10.1145/3173574.3173576

Published:19 April 2018Publication History

CHI '18: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems

Pages 1–13

ABSTRACT

We introduce Deep Thermal Imaging, a new approach for close-range automatic recognition of materials to enhance the understanding of people and ubiquitous technologies of their proximal environment. Our approach uses a low-cost mobile thermal camera integrated into a smartphone to capture thermal textures. A deep neural network classifies these textures into material types. This approach works effectively without the need for ambient light sources or direct contact with materials. Furthermore, the use of a deep learning network removes the need to handcraft the set of features for different materials. We evaluated the performance of the system by training it to recognize 32 material types in both indoor and outdoor environments. Our approach produced recognition accuracies above 98% in 14,860 images of 15 indoor materials and above 89% in 26,584 images of 17 outdoor materials. We conclude by discussing its potentials for real-time use in HCI applications and future directions.

Supplemental Material

pn1011-file5.mp4

mp4

22.6 MB

Download

References

Miika Aittala, Tim Weyrich, and Jaakko Lehtinen. 2015. Two-shot SVBRDF capture for stationary materials. ACM Transactions on Graphics (TOG) - Proceedings of ACM SIGGRAPH 2015 34, 4: 110--122. Google ScholarDigital Library
Sean Bell, Paul Upchurch, Noah Snavely, and Kavita Bala. 2015. Material Recognition in the Wild With the Materials in Context Database. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 3479--3487.Google ScholarCross Ref
LouAnne E. Boyd, Xinlong Jiang, and Gillian R. Hayes. 2017. ProCom: Designing and Evaluating a Mobile and Wearable System to Support Proximity Awareness for People with Autism. In Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems (CHI '17), 2865--2877. Google ScholarDigital Library
William D. Callister and David G. Rethwisch. 2011. Materials science and engineering: an introduction. John Wiley & Sons NY.Google Scholar
Victor C. Chen and Hao Ling. 2002. Time-frequency transforms for radar imaging and signal analysis. Artech House.Google Scholar
E. Cheung and V. J. Lumelsky. 1989. Proximity sensing in robot manipulator motion planning: system and implementation issues. IEEE Transactions on Robotics and Automation 5, 6: 740--751.Google ScholarCross Ref
Youngjun Cho, Andrea Bianchi, Nicolai Marquardt, and Nadia Bianchi-Berthouze. 2016. RealPen: Providing Realism in Handwriting Tasks on Touch Surfaces Using Auditory-Tactile Feedback. In Proceedings of the 29th Annual Symposium on User Interface Software and Technology (UIST '16), 195-- 205. Google ScholarDigital Library
Youngjun Cho, Nadia Bianchi-Berthouze, and Simon J. Julier. 2017. DeepBreath: Deep Learning of Breathing Patterns for Automatic Stress Recognition using LowCost Thermal Imaging in Unconstrained Settings. In the 7th International Conference on Affective Computing and Intelligent Interaction, ACII 2017, 456--463.Google Scholar
Youngjun Cho, Munchae Joung, and Sunuk Kim. 2015. Electronic device having proximity touch function and control method thereof, US Patent, Publication Number: US2015/0062087. https://patents.google.com/patent/US20150062087A1Google Scholar
Youngjun Cho, Simon J. Julier, Nicolai Marquardt, and Nadia Bianchi-Berthouze. 2017. Robust tracking of respiratory rate in high-dynamic range scenes using mobile thermal imaging. Biomedical Optics Express 8, 10: 4480--4503.Google ScholarCross Ref
Mircea Cimpoi, Subhransu Maji, and Andrea Vedaldi. 2015. Deep Filter Banks for Texture Recognition and Segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 3828-- 3836.Google ScholarCross Ref
Mike Crang and Nigel Thrift. 2002. Thinking Space. Routledge.Google Scholar
Anind K. Dey and Jonna Häkkilä. 2008. ContextAwareness and Mobile Devices. Handbook of research on user interface design and evaluation for mobile technology: 205--217.Google Scholar
Andrey Dimitrov and Mani Golparvar-Fard. 2014. Vision-based material recognition for automated monitoring of construction progress and generating building information modeling from unordered site image collections. Advanced Engineering Informatics 28, 1: 37--49. Google ScholarDigital Library
Paul Dourish. 2004. What we talk about when we talk about context. Personal and Ubiquitous Computing 8, 1: 19--30.Google ScholarDigital Library
Marks Eric and Teizer Jochen. Proximity Sensing and Warning Technology for Heavy Construction Equipment Operation. Construction Research Congress 2012: 981--990.Google Scholar
Jakob Eriksson, Lewis Girod, Bret Hull, Ryan Newton, Samuel Madden, and Hari Balakrishnan. 2008. The pothole patrol: using a mobile sensor network for road surface monitoring. In Proceedings of the 6th international conference on Mobile systems, applications, and services, 29--39. Google ScholarDigital Library
Ross Girshick. 2015. Fast R-CNN. In Proceedings of the IEEE International Conference on Computer Vision, 1440--1448. Google ScholarDigital Library
Kotaro Hara, Vicki Le, and Jon Froehlich. 2013. Combining Crowdsourcing and Google Street View to Identify Street-level Accessibility Problems. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '13), 631--640. Google ScholarDigital Library
Chris Harrison and Scott E. Hudson. 2008. Lightweight Material Detection for Placement-aware Mobile Computing. In Proceedings of the 21st Annual ACM Symposium on User Interface Software and Technology (UIST '08), 279--282. Google ScholarDigital Library
Antonius Hendriks, Damian M. Lyons, and Frank Guida. 2001. Vacuum cleaner with obstacle avoidance, US Patent, Publication Number: US6226830 B1. https://patents.google.com/patent/US6226830B1Google Scholar
Ken Hinckley and Mike Sinclair. 1999. Touch-sensing Input Devices. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '99), 223--230. Google ScholarDigital Library
Catherine Holloway and Nick Tyler. 2013. A microlevel approach to measuring the accessibility of footways for wheelchair users using the Capability Model. Transportation planning and technology 36, 7: 636--649.Google Scholar
H. Holone, G. Misund, and H. Holmstedt. 2007. Users Are Doing It For Themselves: Pedestrian Navigation With User Generated Content. In The 2007 International Conference on Next Generation Mobile Applications, Services and Technologies (NGMAST 2007), 91--99. Google ScholarDigital Library
Harald Holone, Gunnar Misund, H\a akon Tolsby, and Steinar Kristoffersen. 2008. Aspects of Personal Navigation with Collaborative User Feedback. In Proceedings of the 5th Nordic Conference on Humancomputer Interaction: Building Bridges (NordiCHI '08), 182--191. Google ScholarDigital Library
Max Jaderberg, Karen Simonyan, Andrew Zisserman, and koray kavukcuoglu. 2015. Spatial Transformer Networks. In Proceedings of Advances in Neural Information Processing Systems 28 (NIPS), 2017-- 2025. Google ScholarDigital Library
Piyawan Kasemsuppakorn and Hassan A. Karimi. 2009. Personalised routing for wheelchair navigation. Journal of Location Based Services 3, 1: 24--54. Google ScholarDigital Library
J. Kölzer, E. Oesterschulze, and G. Deboy. 1996. Thermal imaging and measurement techniques for electronic materials and devices. Microelectronic Engineering 31, 1: 251--270.Google ScholarDigital Library
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. 2012. ImageNet Classification with Deep Convolutional Neural Networks. In Proceedings of Advances in Neural Information Processing Systems 25 (NIPS), 1097--1105. Google ScholarDigital Library
Daniel Kurz. 2014. Thermal touch: Thermographyenabled everywhere touch interfaces for mobile augmented reality applications. In Mixed and Augmented Reality (ISMAR), 2014 IEEE International Symposium on, 9--16.Google Scholar
Gierad Laput, Robert Xiao, and Chris Harrison. 2016. ViBand: High-Fidelity Bio-Acoustic Sensing Using Commodity Smartwatch Accelerometers. In Proceedings of the 29th Annual Symposium on User Interface Software and Technology (UIST '16), 321-- 333. Google ScholarDigital Library
Eric Larson, Gabe Cohn, Sidhant Gupta, Xiaofeng Ren, Beverly Harrison, Dieter Fox, and Shwetak Patel. 2011. HeatWave: Thermal Imaging for Surface User Interaction. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '11), 2565--2574. Google ScholarDigital Library
John J. Leonard and Hugh F. Durrant-Whyte. 2012. Directed Sonar Sensing for Mobile Robot Navigation. Springer Science & Business Media.Google Scholar
C. Liu, L. Sharan, E. H. Adelson, and R. Rosenholtz. 2010. Exploring features in a Bayesian framework for material recognition. In 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 239--246.Google Scholar
H. Liu, X. Song, J. Bimbo, L. Seneviratne, and K. Althoefer. 2012. Surface material recognition through haptic exploration using an intelligent contact sensing finger. In 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, 52--57.Google Scholar
J. M. Lloyd. 2013. Thermal Imaging Systems. Springer Science & Business Media.Google Scholar
Sapan Naik and Bankim Patel. 2017. Thermal imaging with fuzzy classifier for maturity and size based nondestructive mango (Mangifera Indica L.) grading. In Emerging Trends & Innovation in ICT (ICEI), 2017 International Conference on, 15--20.Google ScholarCross Ref
Vinod Nair and Geoffrey E. Hinton. 2010. Rectified linear units improve restricted boltzmann machines. In Proceedings of the 27th international conference on machine learning (ICML-10), 807--814. Retrieved from http://machinelearning.wustl.edu/mlpapers/paper_files/ icml2010_NairH10.pdf Google ScholarDigital Library
I. Pavlidis, Peter Symosek, B. Fritz, Mike Bazakos, and Nikolaos Papanikolopoulos. 2000. Automatic detection of vehicle occupants: the imaging problemand its solution. Machine Vision and Applications 11, 6: 313-- 320. Google ScholarDigital Library
Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. 2015. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. In Proceedings of Advances in Neural Information Processing Systems 28 (NIPS), 91--99. Google ScholarDigital Library
E. F. J. Ring and K. Ammer. 2012. Infrared thermal imaging in medicine. Physiological Measurement 33, 3: R33.Google ScholarCross Ref
Marc Rioux. 1984. Laser range finder based on synchronized scanners. Applied Optics 23, 21: 3837-- 3844.Google ScholarCross Ref
Yvonne Rogers. 2011. Interaction design gone wild: striving for wild theory. Interactions 18, 4: 58--62. Google ScholarDigital Library
Bernardino Romera-Paredes and Philip Torr. 2015. An embarrassingly simple approach to zero-shot learning. In Proceedings of the 32nd International Conference on International Conference on Machine Learning (ICML), 2152--2161. Google ScholarDigital Library
Chandra Roychoudhuri, Al F. Kracklauer, and Kathy Creath. 2008. The nature of light: what is a photon? CRC Press.Google Scholar
Fauzia Saeed, Siti Qamariatul, Mujib Rahman, and Alan Woodside. 2015. The state of pothole management in UK local authority. Bituminous Mixtures and Pavements VI: 153--159.Google Scholar
Alireza Sahami Shirazi, Yomna Abdelrahman, Niels Henze, Stefan Schneegass, Mohammadreza Khalilbeigi, and Albrecht Schmidt. 2014. Exploiting Thermal Reflection for Interactive Systems. In Proceedings of the 32Nd Annual ACM Conference on Human Factors in Computing Systems (CHI '14), 3483--3492. Google ScholarDigital Library
Munehiko Sato, Shigeo Yoshida, Alex Olwal, Boxin Shi, Atsushi Hiyama, Tomohiro Tanikawa, Michitaka Hirose, and Ramesh Raskar. 2015. SpecTrans: Versatile Material Classification for Interaction with Textureless, Specular and Transparent Surfaces. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems (CHI '15), 2191--2200. Google ScholarDigital Library
Albrecht Schmidt, Michael Beigl, and Hans-W. Gellersen. 1999. There is more to context than location. Computers & Graphics 23, 6: 893--901.Google ScholarCross Ref
Igor V. Tetko, David J. Livingstone, and Alexander I. Luik. 1995. Neural network studies. 1. Comparison of overfitting and overtraining. Journal of Chemical Information and Computer Sciences 35, 5: 826--833.Google ScholarCross Ref
Andrea Vedaldi and Karel Lenc. 2015. MatConvNet: Convolutional Neural Networks for MATLAB. In Proceedings of the 23rd ACM International Conference on Multimedia (MM '15), 689--692. Google ScholarDigital Library
Michael Vollmer and MÃ Klaus-Peter. 2017. Infrared thermal imaging: fundamentals, research and applications. John Wiley & Sons.Google Scholar
Ting-Chun Wang, Jun-Yan Zhu, Ebi Hiroaki, Manmohan Chandraker, Alexei A. Efros, and Ravi Ramamoorthi. 2016. A 4D Light-Field Dataset and CNN Architectures for Material Recognition. In Computer Vision -- ECCV 2016 (Lecture Notes in Computer Science), 121--138.Google Scholar
Jason Wiese, T. Scott Saponas, and A.J. Bernheim Brush. 2013. Phoneprioception: Enabling Mobile Phones to Infer Where They Are Kept. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '13), 2157--2166. Google ScholarDigital Library
Hui-Shyong Yeo, Gergely Flamich, Patrick Schrempf, David Harris-Birtill, and Aaron Quigley. 2016. RadarCat: Radar Categorization for Input & Interaction. In Proceedings of the 29th Annual Symposium on User Interface Software and Technology (UIST '16), 833--841. Google ScholarDigital Library
Hui-Shyong Yeo, Juyoung Lee, Andrea Bianchi, David Harris-Birtill, and Aaron Quigley. 2017. SpeCam: Sensing Surface Color and Material with the Frontfacing Camera of a Mobile Device. In Proceedings of the 19th International Conference on HumanComputer Interaction with Mobile Devices and Services (MobileHCI '17), 25:1--25:9. Google ScholarDigital Library
Xiangxin Zhu, Carl Vondrick, Deva Ramanan, and Charless C. Fowlkes. 2012. Do we need more training data or better models for object detection. In Proceedings of the British Machine Vision Conference (BMVC '12).Google Scholar
Seeing AI. Retrieved from https://www.microsoft.com/en-us/seeing-ai/Google Scholar
Encoded Reality. Retrieved from http://viral.media.mit.edu/projects/encoded_reality/Google Scholar
Porsche Panamera 2017. Retrieved from http://www.motorauthority.com/news/1105162_2017- porsche-panamera-deep-dive/page-3#Google Scholar
CAT S60: The World's First Thermal Imaging Smartphone. Retrieved from https://www.catphones.com/?product=cat-s60- smartphoneGoogle Scholar
JETSON TK1. Retrieved from http://www.nvidia.com/object/jetson-tk1-embeddeddev-kit.htmlGoogle Scholar
Caffe deep learning framework. Retrieved from http://caffe.berkeleyvision.org/Google Scholar
TensorFlow Mobile. Retrieved from https://www.tensorflow.org/mobile/Google Scholar
Dyson 360 Eye robot. Retrieved from https://www.dyson.co.uk/robot-vacuums/dyson-360- eye-overview.htmlGoogle Scholar
LG HOM-BOT Square. Retrieved from http://www.lg.com/uk/hom-botGoogle Scholar
Google Maps now lets users add wheelchair accessibility details for locations. Retrieved from https://techcrunch.com/2017/07/08/google-maps-nowlets-users-add-wheelchair-accessibility-details-forlocations/Google Scholar

Index Terms

Deep Thermal Imaging: Proximate Material Type Recognition in the Wild through Deep Learning of Spatial Surface Temperature Patterns
1. Human-centered computing
  1. Human computer interaction (HCI)

Recommendations

Smile detection in the wild with deep convolutional neural networks

Smile or happiness is one of the most universal facial expressions in our daily life. Smile detection in the wild is an important and challenging problem, which has attracted a growing attention from affective computing community. In this paper, we ...
Read More
Skin Temperature Extraction Using Facial Landmark Detection and Thermal Imaging for Comfort Assessment
BuildSys '19: Proceedings of the 6th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation

Despite the large share of energy consumption, current HVAC systems in buildings fail to meet their primary purpose of maintaining comfortable indoor conditions. Current "one size fits all" approach to control the thermal conditions in an environment ...
Read More
Face Recognition via Thermal Imaging: A Comparative Study of Traditional and CNN-Based Approaches
ICMLSC '24: Proceedings of the 2024 8th International Conference on Machine Learning and Soft Computing

In this article, a face recognition via thermal imaging: a comparative study of traditional and CNN-based approaches is proposed. The methodology comprises two distinct components: traditional face recognition and CNN-based face recognition. In the ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CHI '18: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems
April 2018
8489 pages
ISBN:9781450356206
DOI:10.1145/3173574
General Chairs:
Regan Mandryk
University of Saskatchewan, Canada
,
Mark Hancock
University of Waterloo, Canada
,
Program Chairs:
Mark Perry
Brunel University London, UK
,
Anna Cox
University College London, UK
Copyright © 2018 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 19 April 2018
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
context-aware mobile computing
deep learning
in the wild
material recognition
sensing
thermal imaging
Qualifiers
- research-article
Conference

Acceptance Rates
CHI '18 Paper Acceptance Rate666of2,590submissions,26%Overall Acceptance Rate6,199of26,314submissions,24%
More
Upcoming Conference
CHI '24

Sponsor:

sigchi

CHI Conference on Human Factors in Computing Systems

May 11 - 16, 2024

Honolulu , HI , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 29
  Total Citations
  View Citations
- 1,955
  Total Downloads
- Downloads (Last 12 months)82
- Downloads (Last 6 weeks)6
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Deep Thermal Imaging: Proximate Material Type Recognition in the Wild through Deep Learning of Spatial Surface Temperature Patterns

CHI '18: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems

ABSTRACT

Supplemental Material

References

Cited By

Index Terms

Recommendations

Smile detection in the wild with deep convolutional neural networks

Skin Temperature Extraction Using Facial Landmark Detection and Thermal Imaging for Comfort Assessment

Face Recognition via Thermal Imaging: A Comparative Study of Traditional and CNN-Based Approaches