skip to main content
10.1145/1622176.1622193acmconferencesArticle/Chapter ViewAbstractPublication PagesuistConference Proceedingsconference-collections
research-article

User guided audio selection from complex sound mixtures

Published:04 October 2009Publication History

ABSTRACT

In this paper we present a novel interface for selecting sounds in audio mixtures. Traditional interfaces in audio editors provide a graphical representation of sounds which is either a waveform, or some variation of a time/frequency transform. Although with these representations a user might be able to visually identify elements of sounds in a mixture, they do not facilitate object-specific editing (e.g. selecting only the voice of a singer in a song). This interface uses audio guidance from a user in order to select a target sound within a mixture. The user is asked to vocalize (or otherwise sonically represent) the desired target sound, and an automatic process identifies and isolates the elements of the mixture that best relate to the user's input. This way of pointing to specific parts of an audio stream allows a user to perform audio selections which would have been infeasible otherwise.

Skip Supplemental Material Section

Supplemental Material

p89-smaragdis.mp4

mp4

7.4 MB

References

  1. Flandrin, P. 1999. Time-Frequency/Time-scale Analysis, in Wavelet Analysis and Its Applications series, Academic Press; ISBN 978-0-12-259870-8.Google ScholarGoogle Scholar
  2. Makino, S., T.-W. Lee, H. Sawada (eds.) 2007. Blind Speech Separation, in Signals and Communication Technology Series, Springer, ISBN: 978-1-4020-6478-4.Google ScholarGoogle Scholar
  3. Smaragdis, P. Raj, B. and Shashanka, M.V. 2007. Supervised and Semi-Supervised Separation of Sounds from Single-Channel Mixtures. In proceedings of ICA2009. London, UK. September 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Blei, D., Ng, A., Jordan, M. 2003. Latent Dirichlet allocation. in Journal of Machine Learning Research 3: pp. 993--1022. doi:10.1162/jmlr.2003.3.4-5.993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Fevotte, C., R. Gribonval and E. Vincent. 2005. BSS EVAL Toolbox User Guide, IRISA Technical Report 1706, Rennes, France, April 2005.Google ScholarGoogle Scholar

Index Terms

  1. User guided audio selection from complex sound mixtures

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in
            • Published in

              cover image ACM Conferences
              UIST '09: Proceedings of the 22nd annual ACM symposium on User interface software and technology
              October 2009
              278 pages
              ISBN:9781605587455
              DOI:10.1145/1622176

              Copyright © 2009 ACM

              Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 4 October 2009

              Permissions

              Request permissions about this article.

              Request Permissions

              Check for updates

              Author Tags

              Qualifiers

              • research-article

              Acceptance Rates

              Overall Acceptance Rate842of3,967submissions,21%

              Upcoming Conference

              UIST '24

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader