research-article

Searching musical audio datasets by a batch of multi-variant tracks

Authors:
Yi Yu

Nara Women's University, Nara, Japan

Nara Women's University, Nara, Japan
View Profile

,
J. Stephen Downie

University of Illinois at Urbana-Champaign, Champaign, IL, USA

University of Illinois at Urbana-Champaign, Champaign, IL, USA
View Profile

,
Lei Chen

Hong Kong University of Science and Technology, Hong Kong, China

Hong Kong University of Science and Technology, Hong Kong, China
View Profile

,
Vincent Oria

New Jersey Institute of Technology, Newark, NJ, USA

New Jersey Institute of Technology, Newark, NJ, USA
View Profile

,
Kazuki Joe

Nara Women's University, Nara, Japan

Nara Women's University, Nara, Japan
View Profile

MIR '08: Proceedings of the 1st ACM international conference on Multimedia information retrievalOctober 2008Pages 121–127https://doi.org/10.1145/1460096.1460117

Published:30 October 2008Publication History

MIR '08: Proceedings of the 1st ACM international conference on Multimedia information retrieval

Pages 121–127

ABSTRACT

Multi-variant music tracks are those audio tracks of a particular song which are sung and recorded by different people (i.e., cover songs). As music social clubs grow on the Internet, more and more people like to upload music recordings onto such music social sites to share their own home-produced albums and participate in Internet singing contests. Therefore it is very important to explore a computer-assisted evaluation tool to detect these audio-based multi-variant tracks. In this paper we investigate such a task: the original track of a song is embedded in datasets, with a batch of multi-variant audio tracks of this song as input, our retrieval system returns an ordered list by similarity and indicates the position of relevant audio track. To help process multi-variant audio tracks, we suggest a semantic indexing framework and propose the Federated Features (FF) scheme to generate the semantic summarization of audio feature sequences. The conjunction of federated features with three typical similarity searching schemes, K-Nearest Neighbor (KNN), Locality Sensitive Hashing (LSH), and Exact Euclidian LSH (E²LSH), is evaluated. From these findings, a computer-assisted evaluation tool for searching multi-variant audio tracks was developed to search over large musical audio datasets.

References

J. S. Downie. The Music Information Retrieval Evaluation eXchange (MIREX). In D-Lib Magazine 12, 2006. http://dlib.org/dlib/december06/downie/12downie.html.Google Scholar
J. P. Bello. Audio-based Cover Song Retrieval Using Approximate Chord Sequences: Testing Shifts, Gaps, Swaps and Beats. ISMIR'07, pp.239--244, 2007.Google Scholar
D. Ellis and G. Poliner. Identifying cover songs with chroma features and dynamic programming beat tracking. ICASSP'07, 2007.Google ScholarCross Ref
Y. Yu, K. Joe, and J. S. Downie. Efficient Query-by- Content Audio Retrieval by Locality Sensitive Hashing and Partial Sequence Comparison. IEICE Transaction on Information and System, Vol.E91-D, No.6, pp. 1730--1739, 2008. Google ScholarDigital Library
Y. Yu, J. S. Downie, and K. Joe. An Evaluation of Feature Extraction for Query-by-Content Audio Information Retrieval. Ninth IEEE International Symposium on Multimedia Workshops (ISMW), pp. 297--302, 2007. Google ScholarDigital Library
Y. Yu, M. Takata, and K. Joe. Index-Based Similarity Searching with Partial Sequence Comparison for Query-by-Content Audio Retrieval. Workshop on Learning Semantics of Audio Signals (LSAS'06), pp.76--86, 2006.Google Scholar
F. Moerchen, I. Mierswa, and A. Ultsch. Understandable Models of Music Collection based on Exhaustive Feature Generation with Temporal Statistics. KDD'06, pp.882--891, 2006. Google ScholarDigital Library
C. Yang. Efficient Acoustic Index for Music Retrieval with Various Degrees of Similarity. ACM Multimedia, pp. 584--591, 2002. Google ScholarDigital Library
B. Cui, J. L. Shen, G. Cong, H. T. Shen, and C. Yu. Exploring Composite Acoustic Features for Efficient Music Similarity Query. ACM MM'06, pp.634--642, 2006. Google ScholarDigital Library
T. Pohle, M. Schedl, P. Knees, and G. Widmer. Automatically Adapting the Structure of Audio Similarity Spaces. Workshop on Learning Semantics of Audio Signals (LSAS'06), pp. 66--75, 2006.Google Scholar
LSH Algorithm and Implementation (E2LSH) http://web.mit.edu/andoni/www/LSH/index.html.Google Scholar
P. Indyk and N. Thaper. Fast color image retrieval via embeddings. Workshop on Statistical and Computational Theories of Vision (ICCV), 2003.Google Scholar
S. Y. Hu. Efficient Video Retrieval by Locality Sensitive Hashing. ICASSP'05, pp.449--452, 2005.Google Scholar
J. Reiss, J. J. Aucouturier, and M. Sandler. Efficient multi dimensional searching routines for music information retrieval. ISMIR'01, 2001.Google Scholar
I. Karydis, A. Nanopoulos, A. N. Papadopoulos and Y. Manolopoulos. Audio Indexing for Efficient Music Information Retrieval. MMM'05, pp. 22--29, 2005. Google ScholarDigital Library
M. Casey and M. Slaney. Song Intersection by Approximate Nearest Neighbor Search. ISMIR'06, pp. 144--149, 2006.Google Scholar
M. Lesaffre and M. Leman. Using Fuzzy to Handle Semantic Descriptions of Music in a Content-based Retrieval System. Workshop on Learning Semantics of Audio Signals (LSAS'06), pp.43--5, 2006.Google Scholar
G. Tzanetakis and P. Cook. Musical Genre Classification of Audio Signals. IEEE Transactions on Speech and Audio Processing, Vol.10, No.5, pp. 293--302, 2002.Google ScholarCross Ref
R. Miotto and N. Orio. A Methodology for the Segmentation and Identification of Music Works. ISMIR'07, pp.239--244, 2007.Google Scholar
L. Rabiner and B.-H. Juang. Fundamentals of Speech Recognition. Prentice-Hall, 1993. Google ScholarDigital Library

Index Terms

Searching musical audio datasets by a batch of multi-variant tracks
1. Applied computing
  1. Arts and humanities
    1. Sound and music computing
2. Information systems
  1. Information retrieval
    1. Specialized information retrieval
      1. Multimedia and multimodal retrieval
        Music retrieval

Recommendations

COSIN: content-based retrieval system for cover songs
MM '08: Proceedings of the 16th ACM international conference on Multimedia

We develop a content-based audio COver Song IdeNtification (COSIN) system to detect/group cover songs. The COSIN takes music audio content as input and performs similarity searching to locate variants of the input (i.e., cover versions). Identified ...
Read More
A Two-Stage Audio Retrieval Method for Searching Unannotated Audio Clips
ISM '08: Proceedings of the 2008 Tenth IEEE International Symposium on Multimedia

Traditional audio retrieval systems deal principally with audio clips having text descriptions. To retrieve unannotated audio clips is cumbersome because of the immaturity of content-based analysis and retrieval techniques. In this paper, we propose a ...
Read More
Content-based music audio recommendation
MULTIMEDIA '05: Proceedings of the 13th annual ACM international conference on Multimedia

We present the MusicSurfer, a metadata free system for the interaction with massive collections of music. MusicSurfer automatically extracts descriptions related to instrumentation, rhythm and harmony from music audio signals. Together with efficient ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
MIR '08: Proceedings of the 1st ACM international conference on Multimedia information retrieval
October 2008
506 pages
ISBN:9781605583129
DOI:10.1145/1460096
General Chair:
Michael S. Lew
Leiden University, The Netherlands
,
Program Chairs:
Alberto del Bimbo
University of Florence, Italy
,
Erwin M. Bakker
Leiden University, The Netherlands
Copyright © 2008 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 30 October 2008
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
content-based audio retrieval
cover songs
hash-based indexing
musical audio sequences summarization
Qualifiers
- research-article
Conference
Upcoming Conference
MM '24

Sponsor:

sigmm

MM '24: The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne , VIC , Australia
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 3
  Total Citations
  View Citations
- 229
  Total Downloads
- Downloads (Last 12 months)0
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Searching musical audio datasets by a batch of multi-variant tracks

MIR '08: Proceedings of the 1st ACM international conference on Multimedia information retrieval

ABSTRACT

References

Cited By

Index Terms

Recommendations

COSIN: content-based retrieval system for cover songs

A Two-Stage Audio Retrieval Method for Searching Unannotated Audio Clips

Content-based music audio recommendation