Skip to main content

A CNN Approach for Audio Classification in Construction Sites

  • Chapter
  • First Online:

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 184))

Abstract

Convolutional Neural Networks (CNNs) have been widely used in the field of audio recognition and classification, since they often provide positive results. Motivated by the success of this kind of approach and the lack of practical methodologies for the monitoring of construction sites by using audio data, we developed an application for the classification of different types and brands of construction vehicles and tools, which operates on the emitted audio through a stack of convolutional layers. The proposed architecture works on the mel-spectrogram representation of the input audio frames and it demonstrates its effectiveness in environmental sound classification (ESC) achieving a high accuracy. In summary, our contribution shows that techniques employed for general ESC can be also successfully adapted to a more specific environmental sound classification task, such as event recognition in construction sites.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   199.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    Available at: https://librosa.github.io/.

References

  1. Scardapane, S., Scarpiniti, M., Bucciarelli, M., Colone, F., Mansueto, M.V., Parisi, R.: Microphone array based classification for security monitoring in unstructured environments. AEÜ Int. J. Electron. Commun. 69(11), 1715–1723 (2015)

    Article  Google Scholar 

  2. Weinstein, E., Steele, K., Agarwal, A., Glass, J.: LOUD: a 1020-node modular micro-phone array and beamformer for intelligent computing spaces. Technical Report MIT/LCS Technical Memo MIT-LCS-TM-642 (2004)

    Google Scholar 

  3. Kaushik, B., Nance, D., Ahuja, K.K.: A review of the role of acoustic sensors in the modern battlefield. In: Proceedings of the 11th AIAA/CEAS Aeroacoustics Conference, pp. 1–13 (2005)

    Google Scholar 

  4. Wang, D., Brown, G.J.: Computational Auditory Scene Analysis: Principles, Algorithms, and Applications. Wiley-IEEE Press (2006)

    Google Scholar 

  5. Fu, Z., Lu, G., Ting, K.M., Zhang, D.: A survey of audio-based music classification and annotation. IEEE Trans. Multimed. 13(2), 303–319 (2011)

    Article  Google Scholar 

  6. Cheng, C.-F., Rashidi, A., Davenport, M.A., Anderson, D.V.: Activity analysis of construction equipment using audio signals and support vector machines. Autom. Constr. 81, 240–253 (2017)

    Article  Google Scholar 

  7. Zhang, T., Lee, Y.-C., Scarpiniti, M., Uncini, A.: A supervised machine learning-based sound identification for construction activity monitoring and performance evaluation. In: Proceedings of 2018 Construction Research Congress (CRC 2018), New Orleans, Louisiana, USA, pp. 358–366, 2–4 April 2018

    Google Scholar 

  8. Lee, Y.-C., Scarpiniti, M., Uncini, A.: Advanced sound identification classifiers using a grid search algorithm for accurate audio-based construction progress monitoring. J. Comput. Civil Eng. (2020)

    Google Scholar 

  9. Sherafat, B., Rashidi, A., Lee, Y.-C., Ahn, C.R.: A hybrid kinematic-acoustic system for automated activity detection of construction equipment. Sensors 19(19), 4286 (2019)

    Google Scholar 

  10. Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. MIT Press (2016)

    Google Scholar 

  11. Piczak, K.J.: Environmental sound classification with convolutional neural networks. In: 2015 IEEE 25th International Workshop on Machine Learning for Signal Processing (MLSP), pp. 1–6, Sept. 2015

    Google Scholar 

  12. Tokozume, Y., Harada, T.: Learning environmental sounds with end-to-end convolutional neural network. In: 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2721–2725, March 2017

    Google Scholar 

  13. Xie, Y., Lee, Y.-C., Scarpiniti, M.: Deep Learning-Based Highway Construction and Maintenance Activities Monitoring in Night Time. Construction Research Congress (CRC 2020), Tempe, AZ, USA, 8–10 March 2020

    Google Scholar 

  14. Li, S., Yao, Y., Hu, J., Liu, G., Yao, X., Hu, J.: An ensemble stacked convolutional neural network model for environmental event sound recognition. Appl. Sci. 8(7) (2018)

    Google Scholar 

  15. Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. In: ICLR (2015)

    Google Scholar 

  16. Stevens, S.S., Volkmann, J., Newman, E.B.: A scale for the measurement of the psychological magnitude pitch (1937)

    Google Scholar 

  17. Alpaydin, E.: Introduction to Machine Learning, 3rd edn. MIT Press (2014)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Michele Scarpiniti .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Maccagno, A., Mastropietro, A., Mazziotta, U., Scarpiniti, M., Lee, YC., Uncini, A. (2021). A CNN Approach for Audio Classification in Construction Sites. In: Esposito, A., Faundez-Zanuy, M., Morabito, F., Pasero, E. (eds) Progresses in Artificial Intelligence and Neural Systems. Smart Innovation, Systems and Technologies, vol 184. Springer, Singapore. https://doi.org/10.1007/978-981-15-5093-5_33

Download citation

Publish with us

Policies and ethics