skip to main content
10.1145/1460096.1460117acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Searching musical audio datasets by a batch of multi-variant tracks

Authors Info & Claims
Published:30 October 2008Publication History

ABSTRACT

Multi-variant music tracks are those audio tracks of a particular song which are sung and recorded by different people (i.e., cover songs). As music social clubs grow on the Internet, more and more people like to upload music recordings onto such music social sites to share their own home-produced albums and participate in Internet singing contests. Therefore it is very important to explore a computer-assisted evaluation tool to detect these audio-based multi-variant tracks. In this paper we investigate such a task: the original track of a song is embedded in datasets, with a batch of multi-variant audio tracks of this song as input, our retrieval system returns an ordered list by similarity and indicates the position of relevant audio track. To help process multi-variant audio tracks, we suggest a semantic indexing framework and propose the Federated Features (FF) scheme to generate the semantic summarization of audio feature sequences. The conjunction of federated features with three typical similarity searching schemes, K-Nearest Neighbor (KNN), Locality Sensitive Hashing (LSH), and Exact Euclidian LSH (E2LSH), is evaluated. From these findings, a computer-assisted evaluation tool for searching multi-variant audio tracks was developed to search over large musical audio datasets.

References

  1. J. S. Downie. The Music Information Retrieval Evaluation eXchange (MIREX). In D-Lib Magazine 12, 2006. http://dlib.org/dlib/december06/downie/12downie.html.Google ScholarGoogle Scholar
  2. J. P. Bello. Audio-based Cover Song Retrieval Using Approximate Chord Sequences: Testing Shifts, Gaps, Swaps and Beats. ISMIR'07, pp.239--244, 2007.Google ScholarGoogle Scholar
  3. D. Ellis and G. Poliner. Identifying cover songs with chroma features and dynamic programming beat tracking. ICASSP'07, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  4. Y. Yu, K. Joe, and J. S. Downie. Efficient Query-by- Content Audio Retrieval by Locality Sensitive Hashing and Partial Sequence Comparison. IEICE Transaction on Information and System, Vol.E91-D, No.6, pp. 1730--1739, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Y. Yu, J. S. Downie, and K. Joe. An Evaluation of Feature Extraction for Query-by-Content Audio Information Retrieval. Ninth IEEE International Symposium on Multimedia Workshops (ISMW), pp. 297--302, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Y. Yu, M. Takata, and K. Joe. Index-Based Similarity Searching with Partial Sequence Comparison for Query-by-Content Audio Retrieval. Workshop on Learning Semantics of Audio Signals (LSAS'06), pp.76--86, 2006.Google ScholarGoogle Scholar
  7. F. Moerchen, I. Mierswa, and A. Ultsch. Understandable Models of Music Collection based on Exhaustive Feature Generation with Temporal Statistics. KDD'06, pp.882--891, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. C. Yang. Efficient Acoustic Index for Music Retrieval with Various Degrees of Similarity. ACM Multimedia, pp. 584--591, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. B. Cui, J. L. Shen, G. Cong, H. T. Shen, and C. Yu. Exploring Composite Acoustic Features for Efficient Music Similarity Query. ACM MM'06, pp.634--642, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. T. Pohle, M. Schedl, P. Knees, and G. Widmer. Automatically Adapting the Structure of Audio Similarity Spaces. Workshop on Learning Semantics of Audio Signals (LSAS'06), pp. 66--75, 2006.Google ScholarGoogle Scholar
  11. LSH Algorithm and Implementation (E2LSH) http://web.mit.edu/andoni/www/LSH/index.html.Google ScholarGoogle Scholar
  12. P. Indyk and N. Thaper. Fast color image retrieval via embeddings. Workshop on Statistical and Computational Theories of Vision (ICCV), 2003.Google ScholarGoogle Scholar
  13. S. Y. Hu. Efficient Video Retrieval by Locality Sensitive Hashing. ICASSP'05, pp.449--452, 2005.Google ScholarGoogle Scholar
  14. J. Reiss, J. J. Aucouturier, and M. Sandler. Efficient multi dimensional searching routines for music information retrieval. ISMIR'01, 2001.Google ScholarGoogle Scholar
  15. I. Karydis, A. Nanopoulos, A. N. Papadopoulos and Y. Manolopoulos. Audio Indexing for Efficient Music Information Retrieval. MMM'05, pp. 22--29, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. M. Casey and M. Slaney. Song Intersection by Approximate Nearest Neighbor Search. ISMIR'06, pp. 144--149, 2006.Google ScholarGoogle Scholar
  17. M. Lesaffre and M. Leman. Using Fuzzy to Handle Semantic Descriptions of Music in a Content-based Retrieval System. Workshop on Learning Semantics of Audio Signals (LSAS'06), pp.43--5, 2006.Google ScholarGoogle Scholar
  18. G. Tzanetakis and P. Cook. Musical Genre Classification of Audio Signals. IEEE Transactions on Speech and Audio Processing, Vol.10, No.5, pp. 293--302, 2002.Google ScholarGoogle ScholarCross RefCross Ref
  19. R. Miotto and N. Orio. A Methodology for the Segmentation and Identification of Music Works. ISMIR'07, pp.239--244, 2007.Google ScholarGoogle Scholar
  20. L. Rabiner and B.-H. Juang. Fundamentals of Speech Recognition. Prentice-Hall, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Searching musical audio datasets by a batch of multi-variant tracks

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        MIR '08: Proceedings of the 1st ACM international conference on Multimedia information retrieval
        October 2008
        506 pages
        ISBN:9781605583129
        DOI:10.1145/1460096

        Copyright © 2008 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 30 October 2008

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Upcoming Conference

        MM '24
        MM '24: The 32nd ACM International Conference on Multimedia
        October 28 - November 1, 2024
        Melbourne , VIC , Australia

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader