Person identification based on voice biometric using deep neural network

AL-Shakarchy, Noor D.; Obayes, Hadab Khalid; Abdullah, Zahraa Najm

doi:10.1007/s41870-022-01142-1

Person identification based on voice biometric using deep neural network

Original Research
Published: 28 December 2022

Volume 15, pages 789–795, (2023)
Cite this article

International Journal of Information Technology Aims and scope Submit manuscript

Noor D. AL-Shakarchy ORCID: orcid.org/0000-0001-9416-9680¹,
Hadab Khalid Obayes² &
Zahraa Najm Abdullah¹

380 Accesses
Explore all metrics

Abstract

Nowadays in all everyday transactions, technological progress has become an intrinsic characteristic that depends on such electronic applications as financial and banking transfers, health care, project management, and other crucial life aspects. The core of these applications is person Identification and/or verification steps which can be considered one of the complicated limitations. Accordingly, the employment of biometric attributes can yield promising outcomes in these fields. A One’s voice is a unique bio-feature whereby people can be authenticated and precludes others from assuming a one’s identity without their previous knowing or assent. This work proposes a model with a new architecture to identify the person by exploiting the unique individual characteristics available in one’s voice based on deep learning. An augmentation method is utilized to increase the samples in the available dataset. The available temporal information at an input audio file is analysed then feature maps from this information are extracted which represent the salient temporal feature (time-domain features). The decision is made based on tracking these voice features over time. Successful and promising results are achieved through this work, the accuracy is close to 99.81% (± 1.78%) and the values of loss function are close to 0.009 over VoxCeleb1 dataset for identifying 40 subjects.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Bangla Speech-Based Person Identification Using LSTM Networks

Multimodal Biometric for Person Identification Using Deep Learning Approach

Article 04 March 2022

A deep learning approach for person identification using ear biometrics

Article 28 October 2020

References

Mohanty P, Nayak AK (2022) CNN based keyword spotting: an application for context based voiced Odia words. Int J Inf Technol. https://doi.org/10.1007/s41870-022-00992-z
Article Google Scholar
Jain AK, Ross A, Prabhakar S (2004) An introduction to biometric recognition. IEEE Trans Circuits Syst Video Technol 14(1):4–20. https://doi.org/10.1109/TCSVT.2003.818349
Article Google Scholar
Farooq H, Naaz S (2020) Performance analysis of biometric recognition system based on palm print (2020). Int J Inf Technol 12:1281–1289
Google Scholar
Rachad S, Nsiri B, Bensassi B (2015) System identification of inventory system using ARX and ARMAX models. Int J Control Autom 8(12):283–294. https://doi.org/10.14257/ijca.2015.8.12.26
Article Google Scholar
Pappalardo CM, Guida D (2018) System identification algorithm for computing the modal parameters of linear mechanical systems. Machines. https://doi.org/10.3390/machines6020012
Article Google Scholar
Mandalapu H et al (2021) Audio-visual biometric recognition and presentation attack detection: a comprehensive survey. IEEE Access 9:37431–37455. https://doi.org/10.1109/ACCESS.2021.3063031
Article Google Scholar
Mamyrbayev OZ, Othman M, Akhmediyarova AT, Kydyrbekova AS, Mekebayev NO (2019) Voice verification using i-vectors and neural networks with limited training data. Bull Natl Acad Sci Repub Kaz 3(379):36–43. https://doi.org/10.32014/2019.2518-1467.66
Article Google Scholar
Kumar A, Mittal VH (2021) Speech recognition in noisy environment using hybrid technique. Int J Inf Technol 13:483–492
Google Scholar
Ye F, Yang J (2021) A deep neural network model for speaker identification. Appl Sci 11(8):1–18. https://doi.org/10.3390/app11083603
Article Google Scholar
Aizat K, Mohamed O, Orken M, Ainur A, Zhumazhanov B (2020) Identification and authentication of user voice using DNN features and i-vector. Cogent Eng. https://doi.org/10.1080/23311916.2020.1751557
Article Google Scholar
Zhipeng D, Jingcheng W, Yumin X, Qingmin M, Xiaoming W (2019) Voiceprint recognition based on BP neural network and CNN. J Phys Conf Ser. https://doi.org/10.1088/1742-6596/1237/3/032032
Article Google Scholar
Khdier HY, Jasim WM, Aliesawi SA (2021) Deep learning algorithms based voiceprint recognition system in noisy environment. J Phys Conf Ser. https://doi.org/10.1088/1742-6596/1804/1/012042
Article Google Scholar
Antony A, Gopikakumari R (2018) Speaker identification based on combination of MFCC and UMRT based features. Proced Comput Sci 143:250–257. https://doi.org/10.1016/j.procs.2018.10.393
Article Google Scholar
Johnson JM, Khoshgoftaar TM (2019) Survey on deep learning with class imbalance. J Big Data. https://doi.org/10.1186/s40537-019-0192-5
Article Google Scholar
Najafabadi MM, Villanustre F, Khoshgoftaar TM, Seliya N, Wald R, Muharemagic E (2015) Deep learning applications and challenges in big data analytics. J Big Data 2(1):1–21. doi: https://doi.org/10.1186/s40537-014-0007-7
Article Google Scholar
Obayes HK, Al-A’araji N, Al-Shamery E (2019) Examination and forecasting of drug consumption based on recurrent deep learning. Int J Recent Technol Eng 8(2):414–420. https://doi.org/10.35940/ijrte.B1069.0982S1019
Article Google Scholar
Ravi D et al (2017) Deep learning for health informatics. IEEE J Biomed Heal Inform 21(1):4–21. https://doi.org/10.1109/JBHI.2016.2636665
Article Google Scholar
Obayes HK, Al-Turaihi FS, Alhussayni KH (2021) Sentiment classification of user’s reviews on drugs based on global vectors for word representation and bidirectional long short-term memory recurrent neural network. Indones J Electr Eng Comput Sci 23(1):345–353. doi: https://doi.org/10.11591/ijeecs.v23.i1.pp345-353
Article Google Scholar
Al-Shakarchy ND, Ali IH (2019) Abnormal head movement classification using deep neural network DNN. AIP Conf Proc. https://doi.org/10.1063/1.5123123
Article Google Scholar
Al-Shakarchy ND, Ali IH (2020) Detecting abnormal movement of driver’s head based on spatial-temporal features of video using deep neural network DNN. Indones J Electr Eng Comput Sci 19(1):344–352. https://doi.org/10.11591/ijeecs.v19.i1.pp344-352
Article Google Scholar
Fridman L et al (2017) MIT Autonomous vehicle technology study: large-scale deep learning based analysis of driver behavior and interaction with automation, pp 1–17. http://arxiv.org/abs/1711.06976
Nagrani; A, Chung JS, Zisserman A (2017) A large-scale speaker identification dataset. INTERSPEECH
Buduma N, Locascio N (2017) Fundamentals of deep learning: designing next-generation machine intelligence algorithms. Nikhil Buduma; with contributions by Nicholas Locascio
Jung Y (2018) Multiple predicting K-fold cross-validation for model selection. J Nonparametr Stat 30(1):197–215. https://doi.org/10.1080/10485252.2017.1404598
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Faculty of Computer Science and Information Technology, Kerbala University, Kerbala, Iraq
Noor D. AL-Shakarchy & Zahraa Najm Abdullah
College of Education for Humanities Studies, University of Babylon, Babylon, Iraq
Hadab Khalid Obayes

Authors

Noor D. AL-Shakarchy
View author publications
You can also search for this author in PubMed Google Scholar
Hadab Khalid Obayes
View author publications
You can also search for this author in PubMed Google Scholar
Zahraa Najm Abdullah
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Noor D. AL-Shakarchy.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

AL-Shakarchy, N.D., Obayes, H. & Abdullah, Z.N. Person identification based on voice biometric using deep neural network. Int. j. inf. tecnol. 15, 789–795 (2023). https://doi.org/10.1007/s41870-022-01142-1

Download citation

Received: 06 July 2022
Accepted: 13 December 2022
Published: 28 December 2022
Issue Date: February 2023
DOI: https://doi.org/10.1007/s41870-022-01142-1

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Person identification based on voice biometric using deep neural network

Abstract

Access this article

Similar content being viewed by others

Bangla Speech-Based Person Identification Using LSTM Networks

Multimodal Biometric for Person Identification Using Deep Learning Approach

A deep learning approach for person identification using ear biometrics

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Person identification based on voice biometric using deep neural network

Abstract

Access this article

Similar content being viewed by others

Bangla Speech-Based Person Identification Using LSTM Networks

Multimodal Biometric for Person Identification Using Deep Learning Approach

A deep learning approach for person identification using ear biometrics

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation