skip to main content
research-article

Mobile Multi-Food Recognition Using Deep Learning

Published:10 August 2017Publication History
Skip Abstract Section

Abstract

In this article, we propose a mobile food recognition system that uses the picture of the food, taken by the user’s mobile device, to recognize multiple food items in the same meal, such as steak and potatoes on the same plate, to estimate the calorie and nutrition of the meal. To speed up and make the process more accurate, the user is asked to quickly identify the general area of the food by drawing a bounding circle on the food picture by touching the screen. The system then uses image processing and computational intelligence for food item recognition. The advantage of recognizing items, instead of the whole meal, is that the system can be trained with only single item food images. At the training stage, we first use region proposal algorithms to generate candidate regions and extract the convolutional neural network (CNN) features of all regions. Second, we perform region mining to select positive regions for each food category using maximum cover by our proposed submodular optimization method. At the testing stage, we first generate a set of candidate regions. For each region, a classification score is computed based on its extracted CNN features and predicted food names of the selected regions. Since fast response is one of the important parameters for the user who wants to eat the meal, certain heavy computational parts of the application are offloaded to the cloud. Hence, the processes of food recognition and calorie estimation are performed in cloud server. Our experiments, conducted with the FooDD dataset, show an average recall rate of 90.98%, precision rate of 93.05%, and accuracy of 94.11% compared to 50.8% to 88% accuracy of other existing food recognition systems.

References

  1. www.obesitynetwork.ca.Google ScholarGoogle Scholar
  2. http://www.who.int.Google ScholarGoogle Scholar
  3. Parisa Pouladzadeh, Shervin Shirmohammadi, and Abdulsalam Yassine. 2016. You are what you eat: So, measure what you eat. IEEE Instrum. Meas. Mag. 19, 1, 9--15. Google ScholarGoogle ScholarCross RefCross Ref
  4. Parisa Pouladzadeh, Pallavi Kuhad, Sri Vijay Bharat Peddi, Abdulsalam Yassine, and Shervin Shirmohammadi. 2016. Calorie measurement and food classification using deep learning neural network In Proceedings of the IEEE International Conference on Instrumentation and Measurement Technology (I2MTC’16).Google ScholarGoogle Scholar
  5. P. Pouladzadeh, A. Yassine, and S. Shirmohammadi. 2015 FooDD: Food detection dataset for calorie measurement using food images, in new trends in image analysis and processing. In Proceedings of the ICIAP 2015 Workshops, V. Murino, E. Puppo, D. Sona, M. Cristani, and C. Sansone (eds.). Lecture Notes in Computer Science, Springer, Vol. 9281, 441--448. DOI:10.1007/978-3-319-23222-5_54 Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Sri Vijay Bharat Peddi, Abdulsalam Yassine, and Shervin Shirmohammadi. 2015. Cloud based virtualization for a calorie measurement e-health mobile application. In Proceedings of the 2015 International Conference on Multimedia and Expo Workshops (ICME’15). 1--6Google ScholarGoogle ScholarCross RefCross Ref
  7. Lukas Bossard, Matthieu Guillaumin, and Luc Van Gool. 2014. Food-101--mining discriminative components with random forests. In Proceeding of the European Conference on Computer Vision--(ECCV’14). 446--461.Google ScholarGoogle ScholarCross RefCross Ref
  8. Yuji Matsuda, Hajime Hoashi, and Keiji Yanai. 2012. Recognition of multiple-food images by detecting candidate regions. In Proceedings of the IEEE International Conference on Multimedia and Expo (ICME’12). 25--30. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Yoshihiro Kawano and Katsuki Yanai. 2013. Real-time mobile food recognition system. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW’13). 1--7. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Weiyu Zhang, Qian Yu, Behjat Siddiquie, Ajay Divakaran, and Harpreet Sawhney. 2015. Food recognition and nutrition estimation on a smartphone. J. Diabetes Sci. Technol. 9, 3, 525--533. Google ScholarGoogle ScholarCross RefCross Ref
  11. Satoru Fujishige. 2005. Submodular Functions and Optimization, Vol. 58. Elsevier.Google ScholarGoogle Scholar
  12. Yangqing Jia, Evan Shelhamer, Jeff Donahue, Sergey Karayev, Jonathan Long, Ross Girshick, Sergio Guadarrama, and Trevor Darrell. 2014. Cafe: Convolutional architecture for fast feature embedding. In Proceedings of the ACM International Conference on Multimedia. 675--678.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Laurence Awolsey. 1982. An analysis of the greedy algorithm for the submodular set covering problem. Combinatorica 2, 4, 385--393. Google ScholarGoogle ScholarCross RefCross Ref
  14. Fengqing Zhu, Marc Bosch, Nitin Khanna, and Carol J. Boushey. 2015. Multiple hypotheses image segmentation and classification with application to dietary assessment. IEEE J. Biomed. Health Informat. 19, 1, 377--389. Google ScholarGoogle ScholarCross RefCross Ref
  15. Hokuto Kagaya, Kiyoharu Aizawa, and Makoto Ogawa. 2014. Food detection and recognition using convolutional neural network. In Proceedings of the ACM International Conference on Multimedia, 1085--1088. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Kiyoharu Aizawa and Makoto Ogawa. 2015. FoodLog: Multimedia tool for healthcare applications. IEEE MultiMed. 22, 2, 4--9. Google ScholarGoogle ScholarCross RefCross Ref
  17. Sosuke Amano, Kiyoharu Aizawa, and Makoto Ogawa. 2015. Food category representatives: Extracting categories from meal names in food recordings and recipe data. In Proceedings of the IEEE International Conference on Multimedia Big Data. 48--55. DOI:10.1109/BigMM.2015.54 Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Colin Ware. 2008. Toward a perceptual theory of flow visualization. IEEE Comput. Graph. Appl. 28, 2 (2008), 6--11. DOI:http://dx.doi.org/10.1109/MCG.2008.39 Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Xi-Jin Zhang, Yi-Fan Lu, and Song-Hai Zhang. 2016. Multi-task learning for food identification and analysis with deep convolutional neural networks. J. Comput. Sci. Technol. 31, 3, 489--500. Google ScholarGoogle ScholarCross RefCross Ref
  20. Morteza Akbari Fard, Hamed Haddadi, and Alireza Tavakoli Targhi. 2016. Fruits and vegetables calorie counter using, convolutional neural networks. In ACM Dig. Health, 121--122.Google ScholarGoogle Scholar
  21. Ashutosh Singla, Lin Yuan, and Touradj Ebrahimi. 2016. Food/non-food image classification and food categorization using pre-trained googlenet model. In Proceedings of the 2nd International Workshop on Multimedia Assisted Dietary Management. 3--11. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Wataru Shimoda Keiji Yanai. 2016. Foodness proposal for multiple food detection by training of single food images. In Proceedings of the 2nd International Workshop on Multimedia Assisted Dietary Management. 13--21.Google ScholarGoogle Scholar
  23. Joachim Dehais, Marios Anthimopoulos, and Stavroula Mougiakakou. 2016. Food image segmentation for dietary assessment. In Proceedings of the 2nd International Workshop on Multimedia Assisted Dietary Management. 23--28. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. A. Krizhevsky, I. Sutskever, and G. Hinton, 2012. Imagenet classification with deep convolutional neural networks. In Proceedings of Neural Information Processing Systems (NIPS’12).Google ScholarGoogle Scholar
  25. M. D. Zeiler, M. Ranzato, R. Monga, M. Mao, K. Yang, Q. V. Le, P. Nguyen, A. Senior, V. Vanhoucke, J. Dean and G. E. Hinton. 2013. Onrectied linear units for speech processing. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP’13).Google ScholarGoogle Scholar
  26. Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner. 1998. “Gradientbased learning applied to document recognition. Proc. IEEE, 86, 11: 2278--2324. Google ScholarGoogle ScholarCross RefCross Ref
  27. Parisa Pouladzadeh, Shervin Shirmohammadi, and Rana Almaghrabi. 2014. Measuring calorie and nutrition from food image. IEEE Trans. Instrument. Measure. 63, 8, 1947--1956. Google ScholarGoogle ScholarCross RefCross Ref
  28. Z. Su, Q. XU, M. Fei, and M. Dong. 2016. Game theoretic resource allocation in media cloud with mobile social users. IEEE Trans. Multimed. (TMM) 18, 8, 1650--1660.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Mobile Multi-Food Recognition Using Deep Learning

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Multimedia Computing, Communications, and Applications
      ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 13, Issue 3s
      Special Section on Deep Learning for Mobile Multimedia and Special Section on Best Papers from ACM MMSys/NOSSDAV 2016
      August 2017
      258 pages
      ISSN:1551-6857
      EISSN:1551-6865
      DOI:10.1145/3119899
      Issue’s Table of Contents

      Copyright © 2017 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 10 August 2017
      • Accepted: 1 March 2017
      • Revised: 1 February 2017
      • Received: 1 October 2016
      Published in tomm Volume 13, Issue 3s

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader