Paper
15 June 2022 Research on audio recognition and processing based on MLP model
Chuxin Hang, Mandan Zhuang, Tongyuan Bai, Peng Yuan, Kang Sun
Author Affiliations +
Proceedings Volume 12285, International Conference on Advanced Algorithms and Neural Networks (AANN 2022); 1228504 (2022) https://doi.org/10.1117/12.2637054
Event: International Conference on Advanced Algorithms and Neural Networks (AANN 2022), 2022, Zhuhai, China
Abstract
This article is based on deep learning theory and big data technology to build a model on how to analyse massive amounts of audio data and use it to provide better services. Firstly, the spectrograms and waveforms are visualised to initially analyse the audio features. Then, the MFCC and Chroma features of audio were extracted respectively, and the MLP model was built to classify the two features and trained separately. In order to make the audio recognition technique highly efficient, this paper also adopts the non-negative matrix decomposition method (NMF) to enhance the audio data, which makes the differentiation between different audio data more significant, and the accuracy of the MLP model built based on the reconstructed new audio data finally reaches 89.12%.
© (2022) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Chuxin Hang, Mandan Zhuang, Tongyuan Bai, Peng Yuan, and Kang Sun "Research on audio recognition and processing based on MLP model", Proc. SPIE 12285, International Conference on Advanced Algorithms and Neural Networks (AANN 2022), 1228504 (15 June 2022); https://doi.org/10.1117/12.2637054
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Data modeling

Feature extraction

Signal processing

Neural networks

Process modeling

Visualization

Back to Top