ISCA Archive Interspeech 2007
ISCA Archive Interspeech 2007

Evaluation of real-time voice activity detection based on high order statistics

David Cournapeau, Tatsuya Kawahara

We have proposed a method for real-time, unsupervised voice activity detection (VAD). In this paper, problems of feature selection and classification scheme are addressed. The feature is based on High Order Statistics (HOS) to discriminate close and far-field talk, enhanced by a feature derived from the normalized autocorrelation. Comparative effectiveness on several HOS is shown. The classification is done in real-time with a recursive, online EM algorithm. The algorithm is evaluated on the CENSREC-1-C database, which is used for VAD evaluation for automatic speech recognition (ASR) [1], and the proposed method is confirmed to significantly outperform the baseline energy-based method.


doi: 10.21437/Interspeech.2007-734

Cite as: Cournapeau, D., Kawahara, T. (2007) Evaluation of real-time voice activity detection based on high order statistics. Proc. Interspeech 2007, 2945-2948, doi: 10.21437/Interspeech.2007-734

@inproceedings{cournapeau07_interspeech,
  author={David Cournapeau and Tatsuya Kawahara},
  title={{Evaluation of real-time voice activity detection based on high order statistics}},
  year=2007,
  booktitle={Proc. Interspeech 2007},
  pages={2945--2948},
  doi={10.21437/Interspeech.2007-734}
}