Abstract
In this paper, we propose a novel method for blind source separation (BSS) based on time-frequency sparseness (TF) that can estimate the number of sources and time-frequency masks, even if the spatial aliasing problem exists. Many previous approaches, such as degenerate unmixing estimation technique (DUET) or observation vector clustering (OVC), are limited to microphone arrays of small spatial extent to avoid spatial aliasing. We develop an offline and an online algorithm that can both deal with spatial aliasing by directly comparing observed and model phase differences using a distance metric that incorporates the phase indeterminacy of 2π and considering all frequency bins simultaneously. Separation is achieved using a linear blind beamformer approach, hence musical noise common to binary masking is avoided. Furthermore, the offline algorithm can estimate the number of sources. Both algorithms are evaluated in simulations and real-world scenarios and show good separation performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Cermak, J., Araki, S., Sawada, H., Makino, S.: Blind source separation based on a beamformer array and time frequency binary masking. In: Proc. ICASSP (2007)
Araki, S., Sawada, H., Mukay, R., Makino, S.: Underdetermined blind sparse source separation of arbitrarily arranged multiple sensors. Signal Processing 87(8), 1833–1847 (2007)
Sawada, H., Araki, S., Mukay, R., Makino, S.: Grouping separated frequency components by estimating propagation model parameters in frequency-domain blind source separation. IEEE Transactions on Audio, Speech and Language Processing 15(5), 1592–1604 (2007)
Araki, S., Nakatani, T., Sawada, H., Makino, S.: Stereo source separation and source counting with MAP estimation with Dirichlet prior considering spatial aliasing problem. In: Adali, T., Jutten, C., Romano, J.M.T., Barros, A.K. (eds.) ICA 2009. LNCS, vol. 5441, pp. 742–750. Springer, Heidelberg (2009)
Nesta, F., Omologo, M., Svaizer, P.: A novel robust solution to the permutation problem based on a joint multiple TDOA estimation. In: Proc. International Workshop for Acoustic Echo and Noise Control, IWAENC (2008)
Chami, Z.E., Guerin, A., Pham, A., Serviere, C.: A phase-based dual microphone method to count and locate audio sources in reverberant rooms. In: Proc. IEEE Workshop on Applications of Signal processing to Audio and Acoustics, WASPAA (2009)
Rickard, S., Balan, R., Rosca, J.: Real-time time-frequency based blind source separation. In: Proc. ICA (2001)
Loesch, B., Yang, B.: Online blind source separation based on time-frequency sparseness. In: Proc. ICASSP (2009)
Cummins, F., Grimaldi, M., Leonard, T., Simko, J.: The CHAINS corpus (characterizing individual speakers) (2006), http://chains.ucd.ie/
Real World Computing Partnership, RWCP Sound Scene Database in Real Acoustic Environment (2001), http://tosa.mri.co.jp/sounddb/indexe.htm
Lehmann, E.: Image-source method for room impulse response simulation, room acoustics (2008), http://www.watri.org.au/~ericl/ism_code.html
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Loesch, B., Yang, B. (2010). Blind Source Separation Based on Time-Frequency Sparseness in the Presence of Spatial Aliasing. In: Vigneron, V., Zarzoso, V., Moreau, E., Gribonval, R., Vincent, E. (eds) Latent Variable Analysis and Signal Separation. LVA/ICA 2010. Lecture Notes in Computer Science, vol 6365. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15995-4_1
Download citation
DOI: https://doi.org/10.1007/978-3-642-15995-4_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15994-7
Online ISBN: 978-3-642-15995-4
eBook Packages: Computer ScienceComputer Science (R0)