skip to main content
10.1145/2713168.2713191acmconferencesArticle/Chapter ViewAbstractPublication PagesmmsysConference Proceedingsconference-collections
short-paper
Open Access

Multi-sensor concert recording dataset including professional and user-generated content

Published:18 March 2015Publication History

ABSTRACT

We present a novel dataset for multi-view video and spatial audio. An ensemble of ten musicians from the BBC Philharmonic Orchestra performed in the orchestra's rehearsal studio in Salford, UK, on 25th March 2014. This presented a controlled environment in which to capture a dataset that could be used to simulate a large event, whilst allowing control over the conditions and performance. The dataset consists of hundreds of video and audio clips captured during 18 takes of performances, using a broad range of professional-and consumer-grade equipment, up to 4K video and high-end spatial microphones. In addition to the audiovisual essence, sensor metadata has been captured, and ground truth annotations, in particular for temporal synchronization and spatial alignment, have been created. A part of the dataset has also been prepared for adaptive content streaming. The dataset is released under a Creative Commons Attribution Non-Commercial Share Alike license and hosted on a specifically adapted content management platform.

References

  1. Information technology -- Multimedia content description interface -- Part 9: Profiles and levels, Amendment 1: Extensions to profiles and levels. ISO/IEC 15938-9:2005/Amd1:2012, 2012.Google ScholarGoogle Scholar
  2. Information technology -- Dynamic adaptive streaming over HTTP (DASH) -- Part 1: Media presentation description and segment formats. ISO/IEC 23009-1: 2014, 2014.Google ScholarGoogle Scholar
  3. W. Bailer and P. Schallauer. An MPEG-7 extension for describing visual impairments. In Proc. of International Workshop on Image Analysis for Multimedia Interactive Services, Klagenfurt, AT, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. K. Braeckman, R. De Sutter, M. Matton, and T. Blomme. A media sharing platform built with open source software. In Proc. of International Conference on Distributed Multimedia Systems, Illinois, USA, October 2010.Google ScholarGoogle Scholar
  5. EBU Core Metadata Set (EBU Core), v. 1.5. EBU Tech 3293, Apr. 2014.Google ScholarGoogle Scholar
  6. R. Grandl, K. Su, and C. Westphal. On the interaction of adaptive video streaming with content-centric networking. In Proc. of International Packet Video Workshop, 2013.Google ScholarGoogle ScholarCross RefCross Ref
  7. K. Hamasaki. Multichannel Recording Techniques for Reproducing Adequate Spatial Impression. In Proceedings of AES 24th International Conference on Multichannel Audio, 2003.Google ScholarGoogle Scholar
  8. S. Lederer, C. Mueller, and C. Timmerer. Dynamic adaptive streaming over HTTP dataset. In Proc. of ACM Multimedia Systems Conference, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. S. Lederer, C. Mueller, C. Timmerer, C. Concolato, J. Le Feuvre, and K. Fliegel. Distributed DASH dataset. In Proc. of ACM Multimedia Systems Conference, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. C. Mueller, S. Lederer, and C. Timmerer. A proxy effect analyis and fair adatpation algorithm for multiple competing Dynamic Adaptive Streaming over HTTP clients. In Proc. Visual Communications and Image Processing, 2012.Google ScholarGoogle ScholarCross RefCross Ref
  11. M. Saini, S. P. Venkatagiri, W. T. Ooi, and M. C. Chan. The Jiku Mobile Video Dataset. In Proc. of ACM Multimedia Systems Conference, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. T. Stockhammer. Dynamic Adaptive Streaming over HTTP - Design Principles and Standards. In Proc. of ACM Conference on Multimedia Systems, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. G. Theile and H. Wittek. Principles in Surround Recordings with Height. In Proc. of International Conference on Spatial Audio, 2011.Google ScholarGoogle Scholar

Index Terms

  1. Multi-sensor concert recording dataset including professional and user-generated content

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        MMSys '15: Proceedings of the 6th ACM Multimedia Systems Conference
        March 2015
        277 pages
        ISBN:9781450333511
        DOI:10.1145/2713168

        Copyright © 2015 Owner/Author

        Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 18 March 2015

        Check for updates

        Qualifiers

        • short-paper

        Acceptance Rates

        MMSys '15 Paper Acceptance Rate12of41submissions,29%Overall Acceptance Rate176of530submissions,33%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader