ISCA Archive Interspeech 2004
ISCA Archive Interspeech 2004

Noise robust real world spoken dialogue system using GMM based rejection of unintended inputs

Akinobu Lee, Keisuke Nakamura, Ryuichi Nisimura, Hiroshi Saruwatari, Kiyohiro Shikano

To realize a robust spoken dialogue system for use in a real environment, the robust rejection of unintended inputs such as laughter, coughing, background speech and other noise based on GMM is implemented and examined on the basis of actual utterances. All the triggered inputs to a speech-oriented guidance system from 125 days of field tests in a public space are collected, and the occurrence of unintended inputs is investigated. GMM classifiers for voice categories (adult speech and child speech) and non-voice categories (laughter, coughing and other noises) are trained on the basis of the analysis result. The rejection performance of unintended speech was experimented on actual uncontrolled real inputs, and an EER of 3.32% was achieved by the 5-class GMM, which outperforms simple 2-class (voice / non-voice) GMM. The rejection of background speech using GMM is also investigated.


doi: 10.21437/Interspeech.2004-111

Cite as: Lee, A., Nakamura, K., Nisimura, R., Saruwatari, H., Shikano, K. (2004) Noise robust real world spoken dialogue system using GMM based rejection of unintended inputs. Proc. Interspeech 2004, 173-176, doi: 10.21437/Interspeech.2004-111

@inproceedings{lee04f_interspeech,
  author={Akinobu Lee and Keisuke Nakamura and Ryuichi Nisimura and Hiroshi Saruwatari and Kiyohiro Shikano},
  title={{Noise robust real world spoken dialogue system using GMM based rejection of unintended inputs}},
  year=2004,
  booktitle={Proc. Interspeech 2004},
  pages={173--176},
  doi={10.21437/Interspeech.2004-111}
}