ABSTRACT
We present a novel dataset for multi-view video and spatial audio. An ensemble of ten musicians from the BBC Philharmonic Orchestra performed in the orchestra's rehearsal studio in Salford, UK, on 25th March 2014. This presented a controlled environment in which to capture a dataset that could be used to simulate a large event, whilst allowing control over the conditions and performance. The dataset consists of hundreds of video and audio clips captured during 18 takes of performances, using a broad range of professional-and consumer-grade equipment, up to 4K video and high-end spatial microphones. In addition to the audiovisual essence, sensor metadata has been captured, and ground truth annotations, in particular for temporal synchronization and spatial alignment, have been created. A part of the dataset has also been prepared for adaptive content streaming. The dataset is released under a Creative Commons Attribution Non-Commercial Share Alike license and hosted on a specifically adapted content management platform.
- Information technology -- Multimedia content description interface -- Part 9: Profiles and levels, Amendment 1: Extensions to profiles and levels. ISO/IEC 15938-9:2005/Amd1:2012, 2012.Google Scholar
- Information technology -- Dynamic adaptive streaming over HTTP (DASH) -- Part 1: Media presentation description and segment formats. ISO/IEC 23009-1: 2014, 2014.Google Scholar
- W. Bailer and P. Schallauer. An MPEG-7 extension for describing visual impairments. In Proc. of International Workshop on Image Analysis for Multimedia Interactive Services, Klagenfurt, AT, 2008. Google ScholarDigital Library
- K. Braeckman, R. De Sutter, M. Matton, and T. Blomme. A media sharing platform built with open source software. In Proc. of International Conference on Distributed Multimedia Systems, Illinois, USA, October 2010.Google Scholar
- EBU Core Metadata Set (EBU Core), v. 1.5. EBU Tech 3293, Apr. 2014.Google Scholar
- R. Grandl, K. Su, and C. Westphal. On the interaction of adaptive video streaming with content-centric networking. In Proc. of International Packet Video Workshop, 2013.Google ScholarCross Ref
- K. Hamasaki. Multichannel Recording Techniques for Reproducing Adequate Spatial Impression. In Proceedings of AES 24th International Conference on Multichannel Audio, 2003.Google Scholar
- S. Lederer, C. Mueller, and C. Timmerer. Dynamic adaptive streaming over HTTP dataset. In Proc. of ACM Multimedia Systems Conference, 2012. Google ScholarDigital Library
- S. Lederer, C. Mueller, C. Timmerer, C. Concolato, J. Le Feuvre, and K. Fliegel. Distributed DASH dataset. In Proc. of ACM Multimedia Systems Conference, 2013. Google ScholarDigital Library
- C. Mueller, S. Lederer, and C. Timmerer. A proxy effect analyis and fair adatpation algorithm for multiple competing Dynamic Adaptive Streaming over HTTP clients. In Proc. Visual Communications and Image Processing, 2012.Google ScholarCross Ref
- M. Saini, S. P. Venkatagiri, W. T. Ooi, and M. C. Chan. The Jiku Mobile Video Dataset. In Proc. of ACM Multimedia Systems Conference, 2013. Google ScholarDigital Library
- T. Stockhammer. Dynamic Adaptive Streaming over HTTP - Design Principles and Standards. In Proc. of ACM Conference on Multimedia Systems, 2012. Google ScholarDigital Library
- G. Theile and H. Wittek. Principles in Surround Recordings with Height. In Proc. of International Conference on Spatial Audio, 2011.Google Scholar
Index Terms
- Multi-sensor concert recording dataset including professional and user-generated content
Recommendations
Exploring the user-generated content (UGC) uploading behavior on youtube
WWW '14 Companion: Proceedings of the 23rd International Conference on World Wide WebYouTube is the world's largest video sharing platform where both professional and non-professional users participate in creating, uploading, and viewing content. In this work, we analyze content in the music category created by the non-professionals, ...
Content and Metadata Workflow for User Generated Content in Live Production
CVMP '16: Proceedings of the 13th European Conference on Visual Media Production (CVMP 2016)User generated content (UGC) can complement professional content for the coverage of live events such as concerts or sports events. Live streaming of UGC from mobile devices has recently gained popularity, but most solutions aim at providing the streams ...
The Greek Music Dataset
EANN '15: Proceedings of the 16th International Conference on Engineering Applications of Neural Networks (INNS)Music Information Research (MIR) requires musical data in order to test methods and to compare results. Greek music presents a number of unique characteristics that make its musical pieces distinct from popular tracks existing in currently available ...
Comments