Speech enhancement by online non-negative spectrogram decomposition in nonstationary noise environments

Duan, Zhiyao; Mysore, Gautham J.; Smaragdis, Paris

doi:10.21437/Interspeech.2012-181

Speech enhancement by online non-negative spectrogram decomposition in nonstationary noise environments

Zhiyao Duan, Gautham J. Mysore, Paris Smaragdis

Classical single-channel speech enhancement algorithms have two convenient properties: they require pre-learning the noise model but not the speech model, and they work online. How- ever, they often have difficulties in dealing with non-stationary noise sources. Source separation algorithms based on non- negative spectrogram decompositions are capable of dealing with non-stationary noise, but do not possess the aforemen- tioned properties. In this paper we present a novel algorithm that combines the advantages of both classical algorithms and nonnegative spectrogram decomposition algorithms. Experi- ments show that it significantly outperforms four categories of classical algorithms in non-stationary noise environments.

doi: 10.21437/Interspeech.2012-181

Cite as: Duan, Z., Mysore, G.J., Smaragdis, P. (2012) Speech enhancement by online non-negative spectrogram decomposition in nonstationary noise environments. Proc. Interspeech 2012, 595-598, doi: 10.21437/Interspeech.2012-181

@inproceedings{duan12_interspeech,
  author={Zhiyao Duan and Gautham J. Mysore and Paris Smaragdis},
  title={{Speech enhancement by online non-negative spectrogram decomposition in nonstationary noise environments}},
  year=2012,
  booktitle={Proc. Interspeech 2012},
  pages={595--598},
  doi={10.21437/Interspeech.2012-181}
}