Skip to main content

Environmental Noise Analysis for Robust Automatic Speech Recognition

  • Conference paper
  • First Online:
Advanced Computer and Communication Engineering Technology

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 315))

  • 2783 Accesses

Abstract

Most of the speech communication applications viz. telephony, hands-free communication, voice recording, automatic speech recognition, interactive voice response system, human-machine interfaces, etc. that require at least one microphone, desired speech signal is usually contaminated by background noise and reverberation. As a result, the speech signal has to be “cleaned” with digital signal processing tools before it is played out, transmitted, or stored. The noise estimation and reduction techniques will help to clean and attenuate the noise component in speech data, known as Speech Enhancement. In this paper, we recorded the speech in different environmental conditions and estimated the noise signal/background noise distribution in speech. Now the speech is enhanced by using the compliment of Weiner-Hopf optimal filter. And this enhanced speech signal is given for training and testing the Automatic Speech Recognition (ASR) system, which will improve the word accuracy. The analysis of speech and results presented in this paper are produced using MATLAB.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Veisi, H., Sameti, H.: Hidden-Markov-model-based voice activity detector with high speech detection rate for speech enhancement. IET Sig. Process. 6(1), 54–63 (2012)

    Google Scholar 

  2. Kashiwagi, Y., Suzuki, M., Minematsu, N., Hirose, K.: Audio-Visual feature interation based on piecewise linear transformation for noise robust automatic speech recognition. In: ICASSP, pp. 149–152. IEEE (2010)

    Google Scholar 

  3. Ghaemmaghami, H., Dean, D., Sridharan, S., McCown, I.: Noise robust voice activity detection using normal testing and time-domain histogram analysis. In: ICASSP, pp. 4470–4473. IEEE (2010)

    Google Scholar 

  4. Dhananjaya, N., Yegnanarayana, B., Senior Member, IEEE: Voiced/nonvoiced detection based on robustness of voiced epochs. IEEE Sig. Process. Lett. 17(3) (2010)

    Google Scholar 

  5. Benesty, J., Chen, J., Huang, Y., Cohen, I.: Noise reduction in speech processing. Springer, Heidelberg (2009)

    Google Scholar 

  6. Kim, D.Y., Un, C.K., Kim. N.S.: Speech recognition in noisy environments using first-order vector Taylor series. Speech Commun. 24, 39–49 (1998)

    Google Scholar 

  7. Ramirez, J., Yelamos, P., Gorriz, J.M., Segura, J.C.: SVM-based speech endpoint detection using contextual speech features. Electron. Lett. 42(7), 426–428 (2006)

    Google Scholar 

  8. Sebastian Seung, H.: Wiener-Hopf equations. Convolution and correlation in continuous time, 9.29 Lecture 3: February 11, 2003 (2003)

    Google Scholar 

  9. Verma, A.R., Singh, R.K., Kumar, A., Ranjeet, K.: An improved method for speech enhancement based on 2D-DWT using hybrid weiner filtering. In: 2012 IEEE International Conference on Computational Intelligence and Computing Research (2012)

    Google Scholar 

  10. Samudravijaya, K., Barot, M.: A comparison of public domain software tools for speech recognition. Workshop on spoken language processing, pp. 125–131 (2003)

    Google Scholar 

  11. Fukane, A.R., Sahare, S.L.: Role of noise estimation in enhancement of noisy speech signals for hearing aids. In: Computational Intelligence and Communication Networks (CICN), pp.648–652. IEEE (2011)

    Google Scholar 

  12. Ling, G., Yamada, T., Makino, S., Kitawaki, N.: Performance estimation of noisy speech recognition using spectral distortion and snr of noise-reduced speech. In: TENCON 2013, IEEE (2013)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to M. Rao Venkata .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Kishore, N.S.B., Venkata, M.R., Nagamani, M. (2015). Environmental Noise Analysis for Robust Automatic Speech Recognition. In: Sulaiman, H., Othman, M., Othman, M., Rahim, Y., Pee, N. (eds) Advanced Computer and Communication Engineering Technology. Lecture Notes in Electrical Engineering, vol 315. Springer, Cham. https://doi.org/10.1007/978-3-319-07674-4_75

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-07674-4_75

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-07673-7

  • Online ISBN: 978-3-319-07674-4

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics