skip to main content
research-article

Surface Damage Identification for Heritage Site Protection: A Mobile Crowd-sensing Solution Based on Deep Learning

Published:14 March 2023Publication History
Skip Abstract Section

Abstract

This article addresses the general problem of built heritage protection against both deterioration and loss. To continuously monitor and update the structural health status, a crowd-sensing solution based on powerful and automatic deep learning technique is proposed. The aim of this solution is to get rid of the limitations of manual and visual damage detection methods that are costly and time-consuming. Instead, automatic visual inspection for damage detection on walls is efficiently and effectively performed using an embedded Convolutional Neural Network (CNN). This CNN detects the most frequent types of surface damage on wall photos. The study has been conducted in the Kasbah of Algiers where the four following types of damages have been considered: Efflorescence, spall, crack, and mold. The CNN is designed and trained to be integrated into a mobile application for a participatory crowd-sensing solution. The application should be widely and freely deployed, so any user can take a picture of a suspected damaged wall and get an instant and automatic diagnosis through the embedded CNN. In this context, we have chosen MobileNetV2 with a transfer learning approach. A set of real images have been collected and manually annotated and have been used for training, validation, and test. Extensive experiments have been conducted to assess the efficiency and the effectiveness of the proposed solution, using a 5-fold cross-validation procedure. Obtained results show in particular a mean weighted average precision of 0.868 ± 0.00862 (with a 99% of confidence level) and a mean weighted average recall of 0.84 ± 0.00729 (with a 99% of confidence level). To evaluate the performance of MobileNetV2 as a feature extractor, we conducted a comparative study with other small backbones. Further analysis of CNN activation using Grad-Cam has also been done. Obtained results show that our method remains effective even when using a small network and medium- to low-resolution images. MobileNetV2-based CNN size is smaller, and computational cost better, compared to the other CNNs, with similar performance results. Finally, detected surface damages have also been plotted on a geographic map, giving a global view of their distribution.

REFERENCES

  1. [1] Abdel-Qader Ikhlas, Abudayyeh Osama, and Kelly Michael E.. 2003. Analysis of edge-detection techniques for crack identification in bridges. J. Comput. Civil Eng. 17, 4 (2003), 255263. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  2. [2] Abdi Mohamed E. H., Naili Qais, Habbache Mahdi, Said Bilhadj, Boumenir Abdelhak, Douibi Taouba, Djermane Dahlia, and Berrani Sid-Ahmed. 2022. Effectively detecting left bundle branch block false defects in myocardial perfusion imaging (MPI) with a convolutional neural network (CNN). Stud. Health Technol. Inform. 289 (2022), 216219.Google ScholarGoogle Scholar
  3. [3] Alam Firoj, Imran Muhammad, and Ofli Ferda. 2017. Image4Act: Online social media image processing for disaster response. In IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM). 601604.Google ScholarGoogle Scholar
  4. [4] Bayoudh Khaled, Hamdaoui Fayçal, and Mtibaa Abdellatif. 2021. Transfer learning based hybrid 2D-3D CNN for traffic sign recognition and semantic road detection applied in advanced driver assistance systems. Appl. Intell. 51, 1 (2021), 124142.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. [5] Bianco Simone, Cadene Remi, Celona Luigi, and Napoletano Paolo. 2018. Benchmark analysis of representative deep neural network architectures. IEEE Access 6 (2018), 6427064277. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  6. [6] Cawley Gavin C. and Talbot Nicola L. C.. 2010. On over-fitting in model selection and subsequent selection bias in performance evaluation. J. Mach. Learn. Res. 11, 70 (2010), 20792107. Retrieved from http://jmlr.org/papers/v11/cawley10a.html.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. [7] Cha Young-Jin, Choi Wooram, and Büyüköztürk Oral. 2017. Deep learning-based crack damage detection using convolutional neural networks. Comput.-aid. Civil Infrast. Eng. 32, 5 (2017), 361378. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. [8] Cha Young-Jin, Choi Wooram, Suh Gahyun, Mahmoudkhani Sadegh, and Büyüköztürk Oral. 2018. Autonomous structural visual inspection using region-based deep learning for detecting multiple damage types. Comput.-aid. Civil Infrast. Eng. 33, 9 (2018), 731747. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. [9] Chollet François. 2017. Transfer Learning and Fine-tuning. Tensorflow. Retrieved from https://www.tensorflow.org/tutorials/images/transfer_learning.Google ScholarGoogle Scholar
  10. [10] Du Yumin, Chen Wenwu, Cui Kai, Zhang Jingke, Chen Zhuo, and Zhang Qiyong. 2020. Damage assessment of earthen sites of the Ming Great Wall in Qinghai Province: A comparison between support vector machine (SVM) and BP neural network. J. Comput. Cult. Herit. 13, 2 (May2020). DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. [11] Dung Cao Vu and Anh Le Duc. 2019. Autonomous concrete crack detection using deep fully convolutional neural network. Autom. Construct. 99 (2019), 5258. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  12. [12] Fawcett Tom. 2006. An introduction to ROC analysis. Patt. Recog. Lett. 27, 8 (2006), 861874. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. [13] Fiorucci Marco, Khoroshiltseva Marina, Pontil Massimiliano, Traviglia Arianna, Bue Alessio Del, and James Stuart. 2020. Machine learning for cultural heritage: A survey. Patt. Recog. Lett. 133 (2020), 102108. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  14. [14] Fushiki Tadayoshi. 2011. Estimation of prediction error by using K-fold cross-validation. Statist. Comput. 21, 2 (1 Apr.2011), 137146. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. [15] Gibb Spencer, La Hung Manh, and Louis Sushil. 2018. A genetic algorithm for convolutional network structure optimization for concrete crack detection. In IEEE Congress on Evolutionary Computation (CEC’18). 18. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. [16] He Kaiming, Zhang Xiangyu, Ren Shaoqing, and Sun Jian. 2015. Deep residual learning for image recognition. CoRR abs/1512.03385 (2015).Google ScholarGoogle Scholar
  17. [17] He Kaiming, Zhang Xiangyu, Ren Shaoqing, and Sun Jian. 2016. Deep residual learning for image recognition. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR’16). 770778. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  18. [18] Heidari Mojtaba, Torabi-Kaveh Mehdi, Chastre Carlos, Ludovico-Marques Marco, Mohseni Hassan, and Akefi Hossein. 2017. Determination of weathering degree of the Persepolis stone under laboratory and natural conditions using fuzzy inference system. Construct. Build. Mater. 145 (2017), 2841. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  19. [19] Heritage Centre UNESCO World. 2021. UNESCO World Heritage Centre - State of Conservation (SOC 2021) Kasbah of Algiers (Algeria). UNESCO. Retrieved from https://whc.unesco.org/en/soc/4230/.Google ScholarGoogle Scholar
  20. [20] Howard Andrew G., Zhu Menglong, Chen Bo, Kalenichenko Dmitry, Wang Weijun, Weyand Tobias, Andreetto Marco, and Adam Hartwig. 2017. MobileNets: Efficient convolutional neural networks for mobile vision applications. CoRR abs/1704.04861 (2017).Google ScholarGoogle Scholar
  21. [21] Huang Gao, Liu Zhuang, and Weinberger Kilian Q.. 2016. Densely connected convolutional networks. CoRR abs/1608.06993 (2016).Google ScholarGoogle Scholar
  22. [22] jdariasl. 2021. Global Multi-label Confusion Matrix. Retrieved from https://github.com/jdariasl/global_multiLabel_confusion_matrix.Google ScholarGoogle Scholar
  23. [23] Kiliszek Damian and Kroszczyński Krzysztof. 2020. Performance of the precise point positioning method along with the development of GPS, GLONASS and Galileo systems. Measurement 164 (2020), 108009. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  24. [24] Krizhevsky Alex, Sutskever Ilya, and Hinton Geoffrey E.. 2012. ImageNet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems, Vol. 25. Curran Associates, Inc. Retrieved from https://proceedings.neurips.cc/paper/2012/file/c399862d3b9d6b76c8436e924a68c45b-Paper.pdf.Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. [25] Krstinić Damir, Braović Maja, Šerić Ljiljana, and Božić-Štulić Dunja. 2020. Multi-label classifier performance evaluation with confusion matrix. Comput. Sci. Inf. Technol. 10, 8 (2020).Google ScholarGoogle Scholar
  26. [26] Kumar Pakhee, Ofli Ferda, Imran Muhammad, and Castillo Carlos. 2020. Detection of disaster-affected cultural heritage sites from social media images using deep learning techniques. J. Comput. Cult. Herit. 13, 3 (Aug.2020). DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. [27] Lawrence S., Giles C. L., Tsoi Ah Chung, and Back A. D.. 1997. Face recognition: A convolutional neural-network approach. IEEE Trans. Neural Netw. 8, 1 (1997), 98113. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. [28] LeCun Y., Boser B., Denker J. S., Henderson D., Howard R. E., Hubbard W., and Jackel L. D.. 1989. Backpropagation applied to handwritten zip code recognition. Neural Computat. 1, 4 (1989), 541551. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. [29] Li Shengyuan, Zhao Xuefeng, and Zhou Guangyi. 2019. Automatic pixel-level multiple damage detection of concrete structure using fully convolutional network. Comput.-aid. Civil Infrast. Eng. 34, 7 (2019), 616634.Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. [30] Mishra Mayank. 2021. Machine learning techniques for structural health monitoring of heritage buildings: A state-of-the-art review and case studies. J. Cultur. Herit. 47 (2021), 227245. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  31. [31] Moens Marie-Francine, Jones Gareth J. F., Ghosh Saptarshi, Ganguly Debasis, Chakraborty Tanmoy, and Ghosh Kripabandhu. 2018. WWW’18 workshop on exploitation of social media for emergency relief and preparedness: Chairs’ welcome & organization. In the Web Conference (WWW’18). International World Wide Web Conferences Steering Committee, 16091611. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. [32] Patel Rupa and Chaware Anita. 2020. Transfer learning with fine-tuned MobileNetV2 for diabetic retinopathy. In International Conference for Emerging Technology (INCET’20). 14. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  33. [33] Perumal Ramani and Venkatachalam Subbiah Bharathi. 2021. Non invasive detection of moss and crack in monuments using image processing techniques. J. Amb. Intell. Human. Comput. 12, 5 (1 May2021), 52775285. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  34. [34] Plotly. 2022. Mapbox Layers. Plotly. Retrieved from https://plotly.com/python/mapbox-layers/.Google ScholarGoogle Scholar
  35. [35] Pragalath Haran, Seshathiri Sankarasrinivasan, Rathod Harsh, Esakki Balasubramanian, and Gupta Rishi. 2018. Deterioration assessment of infrastructure using fuzzy logic and image processing algorithm. J. Perform. Construct. Facil. 32, 2 (2018), 04018009. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  36. [36] Refaeilzadeh Payam, Tang Lei, and Liu Huan. 2009. Cross-validation. Springer US, Boston, MA, 532538. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  37. [37] Ren Shaoqing, He Kaiming, Girshick Ross B., and Sun Jian. 2015. Faster R-CNN: Towards real-time object detection with region proposal networks. CoRR abs/1506.01497 (2015).Google ScholarGoogle Scholar
  38. [38] Russakovsky Olga, Deng Jia, Su Hao, Krause Jonathan, Satheesh Sanjeev, Ma Sean, Huang Zhiheng, Karpathy Andrej, Khosla Aditya, Bernstein Michael S., Berg Alexander C., and Fei-Fei Li. 2014. ImageNet large scale visual recognition challenge. CoRR abs/1409.0575 (2014).Google ScholarGoogle Scholar
  39. [39] Sandler Mark, Howard Andrew G., Zhu Menglong, Zhmoginov Andrey, and Chen Liang-Chieh. 2018. Inverted residuals and linear bottlenecks: Mobile networks for classification, detection and segmentation. CoRR abs/1801.04381 (2018).Google ScholarGoogle Scholar
  40. [40] Selvaraju Ramprasaath R., Das Abhishek, Vedantam Ramakrishna, Cogswell Michael, Parikh Devi, and Batra Dhruv. 2016. Grad-CAM: Why did you say that? Visual explanations from deep networks via gradient-based localization. CoRR abs/1610.02391 (2016).Google ScholarGoogle Scholar
  41. [41] Shorten Connor and Khoshgoftaar Taghi M.. 2019. A survey on image data augmentation for deep learning. J. Big Data 6, 1 (6 July2019), 60. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  42. [42] Simonyan Karen and Zisserman Andrew. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014).Google ScholarGoogle Scholar
  43. [43] Szegedy Christian, Liu Wei, Jia Yangqing, Sermanet Pierre, Reed Scott E., Anguelov Dragomir, Erhan Dumitru, Vanhoucke Vincent, and Rabinovich Andrew. 2014. Going deeper with convolutions. CoRR abs/1409.4842 (2014).Google ScholarGoogle Scholar
  44. [44] Szegedy Christian, Vanhoucke Vincent, Ioffe Sergey, Shlens Jonathon, and Wojna Zbigniew. 2015. Rethinking the inception architecture for computer vision. CoRR abs/1512.00567 (2015).Google ScholarGoogle Scholar
  45. [45] Tan Mingxing and Le Quoc V.. 2021. EfficientNetV2: Smaller models and faster training. CoRR abs/2104.00298 (2021).Google ScholarGoogle Scholar
  46. [46] Team Keras. 2021. Keras Documentation: Keras Applications. Keras. Retrieved from https://keras.io/api/applications/.Google ScholarGoogle Scholar
  47. [47] Wang Niannian, Zhao Qingan, Li Shengyuan, Zhao Xuefeng, and Zhao Peng. 2018. Damage classification for masonry historic structures using convolutional neural networks based on still images. Comput.-aid. Civil Infrast. Eng. 33, 12 (2018), 10731089. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. [48] Wang Niannian, Zhao Xuefeng, Wang Linan, and Zou Zheng. 2019. Novel system for rapid investigation and damage detection in cultural heritage conservation based on deep learning. J. Infrast. Syst. 25, 3 (2019), 04019020. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  49. [49] Wang Niannian, Zhao Xuefeng, Zhao Peng, Zhang Yang, Zou Zheng, and Ou Jinping. 2019. Automatic damage detection of historic masonry buildings based on mobile deep learning. Autom. Construct. 103 (2019), 5366. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  50. [50] Wang Niannian, Zhao Xuefeng, Zou Zheng, Zhao Peng, and Qi Fei. 2020. Autonomous damage segmentation and measurement of glazed tiles in historic buildings via deep learning. Comput.-aid. Civil Infrast. Eng. 35, 3 (2020), 277291. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  51. [51] Xiang Qian, Wang Xiaodan, Li Rui, Zhang Guoling, Lai Jie, and Hu Qingshuang. 2019. Fruit image classification based on MobileNetV2 with transfer learning technique. In 3rd International Conference on Computer Science and Application Engineering (CSAE’19). Association for Computing Machinery, New York, NY. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  52. [52] Zeiler Matthew D. and Fergus Rob. 2014. Visualizing and understanding convolutional networks. In Computer Vision – ECCV 2014, Fleet David, Pajdla Tomas, Schiele Bernt, and Tuytelaars Tinne (Eds.). Springer International Publishing, Cham, 818833.Google ScholarGoogle Scholar
  53. [53] Zhang Xinglin, Yang Zheng, Sun Wei, Liu Yunhao, Tang Shaohua, Xing Kai, and Mao Xufei. 2016. Incentives for mobile crowd sensing: A survey. IEEE Commun. Surv. Tutor. 18, 1 (2016), 5467. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  54. [54] Zoph Barret, Vasudevan Vijay, Shlens Jonathon, and Le Quoc V.. 2017. Learning transferable architectures for scalable image recognition. CoRR abs/1707.07012 (2017).Google ScholarGoogle Scholar
  55. [55] Zorlu Kivanc. 2008. Description of the weathering states of building stones by fractal geometry and fuzzy inference system in the Olba ancient city (Southern Turkey). Eng. Geol. 101, 3 (2008), 124133. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  56. [56] Zou Zheng, Zhao Xuefeng, Zhao Peng, Qi Fei, and Wang Niannian. 2019. CNN-based statistics and location estimation of missing components in routine inspection of historic buildings. J. Cultur. Herit. 38 (2019), 221230. DOI:Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Surface Damage Identification for Heritage Site Protection: A Mobile Crowd-sensing Solution Based on Deep Learning

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Full Text

        View this article in Full Text.

        View Full Text

        HTML Format

        View this article in HTML Format .

        View HTML Format