Blind Source Separation Based on Time-Frequency Sparseness in the Presence of Spatial Aliasing

Loesch, Benedikt; Yang, Bin

doi:10.1007/978-3-642-15995-4_1

Benedikt Loesch²¹ &
Bin Yang²¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 6365))

Included in the following conference series:

International Conference on Latent Variable Analysis and Signal Separation

3234 Accesses
16 Citations

Abstract

In this paper, we propose a novel method for blind source separation (BSS) based on time-frequency sparseness (TF) that can estimate the number of sources and time-frequency masks, even if the spatial aliasing problem exists. Many previous approaches, such as degenerate unmixing estimation technique (DUET) or observation vector clustering (OVC), are limited to microphone arrays of small spatial extent to avoid spatial aliasing. We develop an offline and an online algorithm that can both deal with spatial aliasing by directly comparing observed and model phase differences using a distance metric that incorporates the phase indeterminacy of 2π and considering all frequency bins simultaneously. Separation is achieved using a linear blind beamformer approach, hence musical noise common to binary masking is avoided. Furthermore, the offline algorithm can estimate the number of sources. Both algorithms are evaluated in simulations and real-world scenarios and show good separation performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Cermak, J., Araki, S., Sawada, H., Makino, S.: Blind source separation based on a beamformer array and time frequency binary masking. In: Proc. ICASSP (2007)
Google Scholar
Araki, S., Sawada, H., Mukay, R., Makino, S.: Underdetermined blind sparse source separation of arbitrarily arranged multiple sensors. Signal Processing 87(8), 1833–1847 (2007)
Article MATH Google Scholar
Sawada, H., Araki, S., Mukay, R., Makino, S.: Grouping separated frequency components by estimating propagation model parameters in frequency-domain blind source separation. IEEE Transactions on Audio, Speech and Language Processing 15(5), 1592–1604 (2007)
Article Google Scholar
Araki, S., Nakatani, T., Sawada, H., Makino, S.: Stereo source separation and source counting with MAP estimation with Dirichlet prior considering spatial aliasing problem. In: Adali, T., Jutten, C., Romano, J.M.T., Barros, A.K. (eds.) ICA 2009. LNCS, vol. 5441, pp. 742–750. Springer, Heidelberg (2009)
Chapter Google Scholar
Nesta, F., Omologo, M., Svaizer, P.: A novel robust solution to the permutation problem based on a joint multiple TDOA estimation. In: Proc. International Workshop for Acoustic Echo and Noise Control, IWAENC (2008)
Google Scholar
Chami, Z.E., Guerin, A., Pham, A., Serviere, C.: A phase-based dual microphone method to count and locate audio sources in reverberant rooms. In: Proc. IEEE Workshop on Applications of Signal processing to Audio and Acoustics, WASPAA (2009)
Google Scholar
Rickard, S., Balan, R., Rosca, J.: Real-time time-frequency based blind source separation. In: Proc. ICA (2001)
Google Scholar
Loesch, B., Yang, B.: Online blind source separation based on time-frequency sparseness. In: Proc. ICASSP (2009)
Google Scholar
Cummins, F., Grimaldi, M., Leonard, T., Simko, J.: The CHAINS corpus (characterizing individual speakers) (2006), http://chains.ucd.ie/
Real World Computing Partnership, RWCP Sound Scene Database in Real Acoustic Environment (2001), http://tosa.mri.co.jp/sounddb/indexe.htm
Lehmann, E.: Image-source method for room impulse response simulation, room acoustics (2008), http://www.watri.org.au/~ericl/ism_code.html

Download references

Author information

Authors and Affiliations

Chair of System Theory and Signal Processing, University of Stuttgart,
Benedikt Loesch & Bin Yang

Authors

Benedikt Loesch
View author publications
You can also search for this author in PubMed Google Scholar
Bin Yang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dept. of Electrical Engineering, Universitè d’Evry Val d’Essone, 40 rue du Pelvoux, 91020, Courcouronnes, France
Vincent Vigneron
Laboratoire I3S, Les Algorithmes - Euclide-B, BP 121, Université de Nice-Sophia Antipolis, 2000 Route des Lucioles, 06903, Sophia Antipolis Cedex, France
Vicente Zarzoso
School of Engineering, Dept. of Telecommunications, ISITSchool of Engineering, Dept. of Telecommunications, ISITV, Université de Toulon, Avenue George Pompidou, BP 56, La Valette du Var, Cedex, 83162, France
Eric Moreau
INRIA France, Equipe-projet METISS, Centre de Recherche INRIA Rennes-Bretagne Atlantique, Campus de Beaulieu, 35042, Rennes cedex, France
Rémi Gribonval
INRIA France, Equipe-projet METISS, Centre de Recherche INRIA Rennes-Bretagne Atlantique, Campus de Beaulieu, 35042, Rennes Cedex, France
Emmanuel Vincent

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Loesch, B., Yang, B. (2010). Blind Source Separation Based on Time-Frequency Sparseness in the Presence of Spatial Aliasing. In: Vigneron, V., Zarzoso, V., Moreau, E., Gribonval, R., Vincent, E. (eds) Latent Variable Analysis and Signal Separation. LVA/ICA 2010. Lecture Notes in Computer Science, vol 6365. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15995-4_1

Download citation

DOI: https://doi.org/10.1007/978-3-642-15995-4_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15994-7
Online ISBN: 978-3-642-15995-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics