ISCA Archive Interspeech 2005
ISCA Archive Interspeech 2005

Bandwidth expansion of narrowband speech using non-negative matrix factorization

Dhananjay Bansal, Bhiksha Raj, Paris Smaragdis

In this paper, we present a novel technique for the estimation of the high frequency components (4-8kHz) of speech signals from narrow-band (0-4 kHz) signals using convolutive Non-Negative Matrix Factorisation (NMF). The proposed technique utilizes a brief recording of simultaneous broad band and narrow band signals from a target speaker to learn a set of broad-band non-negative "bases" for the speaker. The low-frequency components of these bases are used to determine how the high-frequency components must be combined in order to reconstruct the high-frequency components of new narrow-band signals from the speaker. Experiments reveal that the technique is able to reconstruct broadband speech that is perceptually virtually indistinguishable from true broadband recordings.


doi: 10.21437/Interspeech.2005-528

Cite as: Bansal, D., Raj, B., Smaragdis, P. (2005) Bandwidth expansion of narrowband speech using non-negative matrix factorization. Proc. Interspeech 2005, 1505-1508, doi: 10.21437/Interspeech.2005-528

@inproceedings{bansal05_interspeech,
  author={Dhananjay Bansal and Bhiksha Raj and Paris Smaragdis},
  title={{Bandwidth expansion of narrowband speech using non-negative matrix factorization}},
  year=2005,
  booktitle={Proc. Interspeech 2005},
  pages={1505--1508},
  doi={10.21437/Interspeech.2005-528}
}