Raga Classification From Vocal Performances Using Multimodal Analysis

doi:10.5281/zenodo.7316650

Published December 4, 2022 | Version v1

Conference paper Open

Raga Classification From Vocal Performances Using Multimodal Analysis

Work on musical gesture and embodied cognition suggests a rich complementarity between audio and movement information in musical performance. Pose estimation algorithms now make it possible (in contrast to Motion Capture) to collect rich movement information from unconstrained performances of indefinite length. Vocal performances of Indian art music offer the opportunity to carry out multimodal analysis using this information, combing musician's body movements (i.e. pose and gesture data) with audio features. In this work we investigate raga identification from 12 s excerpts from a dataset of 3 singers and 9 ragas using the combination of audio and visual representations that are each semantically salient on their own. While gesture based classification is relatively weak by itself, we show that combining latent representations from the pre-trained unimodal networks can surpass the already high performance obtained by audio features.

Files

000033.pdf

Files (345.7 kB)

Name	Size	Download all
000033.pdf md5:5a2379fb13780f390ad7207db452f393	345.7 kB	Preview Download

104

Views

105

Downloads

Show more details

	All versions	This version
Views	104	104
Downloads	105	104
Data volume	40.8 MB	40.4 MB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

ISMIR

Imprint

Proceedings of the 23rd International Society for Music Information Retrieval Conference, 283-290. Bengaluru, India.

Conference

International Society for Music Information Retrieval Conference (ISMIR 2022) , Bengaluru, India, December 4-8, 2022

Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: November 13, 2022
Modified: November 15, 2022

Raga Classification From Vocal Performances Using Multimodal Analysis

Creators

Description

Files

000033.pdf

Files (345.7 kB)