skip to main content
10.1145/3339825.3391871acmconferencesArticle/Chapter ViewAbstractPublication PagesmmsysConference Proceedingsconference-collections
research-article
Open Access

Software-based versatile video coding decoder parallelization

Published:27 May 2020Publication History

ABSTRACT

Versatile Video Coding (VVC) standard is currently being prepared as the latest video coding standard of the ITU-T and ISO/IEC. The primary goal of the VVC, expected to be finalized in 2020, is to further improve compression performance compared to its predecessor HEVC. The frame level, slice level or Wavefront parallel processing (WPP) existing in VTM (VVC Test Model) doesn't fully utilize the CPU capabilities available in today's multicore systems. Moreover, VTM decoder sequentially processes the decoding tasks. This design is not parallelization friendly. This paper proposes re-designed decoding tasks that parallelize the decoder using: 1. Load balanced task parallelization and 2. CTU (Coding Tree Unit) based data parallelization. The design overcomes the limitations of the existing parallelization techniques by fully utilizing the available CPU computation resource without compromising on the coding efficiency and the memory bandwidth. The parallelization of CABAC and the slice decoding tasks is based on a load sharing scheme, while parallelization of each sub-module of the slice decoding task uses CTU level data parallelization. The parallelization scheme may either remain restricted within an individual decoding task or utilize between task parallelization. Such parallelization techniques achieve real-time VVC decoding on multi-core CPUs, for bitstreams generated using VTM5.0 using Random-Access configuration. An overall average decoding time reduction of 88.97% (w.r.t. VTM5.0 decoder) is achieved for 4K sequences on a 10-core processor.

References

  1. High Efficiency Video Coding (HEVC), Rec. ITU-T H.265 and ISO/IEC 23008-2, January 2013 (and later editions).Google ScholarGoogle Scholar
  2. G. J. Sullivan, J.-R. Ohm, W.-J. Han, and T. Wiegand, "Overview of the High Efficiency Video Coding (HEVC) Standard", IEEE Trans. Circuits and Systems for Video Technology, Vol. 22, No. 12, pp. 1649--1668, December 2012.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. J. Chen, Y. Chen, M. Karczewicz, X. Li, H. Liu, L. Zhang, X. Zhao, "Coding tools investigation for next generation video coding", ITU-T SG16 Doc. COM16-C806, February 2015.Google ScholarGoogle Scholar
  4. E. Alshina, A. Alshin, J. - H. Min, K. Choi, A. Saxena, M. Budagavi, "Known tools performance investigation for next generation video coding", VCEGAZ05, June 2015.Google ScholarGoogle Scholar
  5. A. Segall, V. Baroncini, J. Boyce, J. Chen, T. Suzuki, "Joint call for proposals on video compression with capability beyond HEVC", JVET-H1002, October 2017, Macau, China.Google ScholarGoogle Scholar
  6. B. Bross, K. Andersson, M. Blaser, V. Drugeon, S-H Kim, J. Lainema, J. Li, S. Liu, J. R. Ohm, G. J. Sullivan, "General Video Coding Technology in Responses to the Joint Call for Proposals on Video Compression with Capability beyond HEVC", IEEE Trans. Circuits and Systems for Video Technology, special issue October 2019Google ScholarGoogle Scholar
  7. J. Le. Tanou, M. Blestel, "Analysis of Emerging Video Codecs: Coding Tools, Compression Efficiency and Complexity", SMPTE Motion Imaging Journal, 128 (10), pp. 14--24, November 2019Google ScholarGoogle ScholarCross RefCross Ref
  8. J. Chen, Y. Ye, S. H. Kim, "Algorithm description for Versatile Video Coding and Test Model 6 (VTM 6)", JVET-O2002, July 2019.Google ScholarGoogle Scholar
  9. I. Ahmad, Y. He and M. L. Liou, "Video compression with parallel processing," Parallel Computing, 28 (7-8): 1039-1078, August 2002.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. DF De Souza, N. Roma, L. Sousa, "Co-operative CPU+GPU deblocking filter parallelization for high performance HEVC video codecs," IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 4993--4997, May 2014.Google ScholarGoogle Scholar
  11. DF De Souza, A. llic, N. Roma, L. Sousa, "GPU assisted HEVC intra decoder," Journal of Real-Time Image Processing, vol. 12, Issue 2, pp. 531--547, August 2016.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. DF De Souza, A. llic, L. Sousa, "HEVC in-loop GPU parallelization in embedded systems," International Conference on Embedded Computer System: Architectures, Modeling and Simulation (SAMOS XV), pp. 123--130, July 2015.Google ScholarGoogle Scholar
  13. W. Zhang, C. Guo, "Design and Implementation of Parallel Algorithms for Sample Adaptive Offset in HEVC Based on GPU", Sixth International Conference on Information Science and Technology, May 2016.Google ScholarGoogle Scholar
  14. Ittiam system private limited, "AHG16: Early Implementation of VVC software player and Demonstration on Mobile device", JVET-P0307, Geneva, CH, October 2019.Google ScholarGoogle Scholar
  15. A. M. Kotra, M. Raulet and O. Deforges, "Efficient Parallelization of Different HEVC Decoding Stages," 2013 Data Compression Conference, Snowbird, UT, pp. 502--502, March 2013.Google ScholarGoogle Scholar
  16. M. Alvarez-Mesa, C. C. Chi, B. Juurlink, V. George and T. Schierl, "Parallel video decoding in the emerging HEVC standard," 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Kyoto, pp. 1545--1548, March 2012.Google ScholarGoogle Scholar
  17. S. Gudumasu, Y. He, Y. Ye, Y. He, E-S. Ryu, J. Dong, X. Xiu, "Real-time SHVC software decoding with multi-threaded parallel processing", Proceedings of SPIE, Applications of Digital Image Processing, vol. 9217, September 2014.Google ScholarGoogle Scholar
  18. M. Koziri et al., "Adaptive Tile Parallelization for Fast Video Encoding in HEVC," 2016 IEEE International Conference on Internet of Things (iThings) and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, Physical and Social Computing (CPSCom) and IEEE Smart Data (SmartData), Chengdu, pp. 738--743, December 2016Google ScholarGoogle Scholar
  19. V. Sze, A. P. Chandrakasan, M. Budagavi and Minhua Zhou, "Parallel CABAC for low power video coding," 2008 15th IEEE International Conference on Image Processing, San Diego, CA, pp. 2096--2099, October 2008.Google ScholarGoogle Scholar
  20. Y. Huang, et.al., "A VVC Proposal with Quaternary Tree plus Binary-Ternary Tree Coding Block Structure and Advanced Coding Techniques," IEEE Transactions on Circuits and Systems for Video Technology, pp. 1--1, October 2019.Google ScholarGoogle Scholar
  21. M. Wang et al., "Extended Quad-Tree Partitioning for Future Video Coding," 2019 Data Compression Conference (DCC), Snowbird, UT, USA, pp. 300--309, May 2019.Google ScholarGoogle ScholarCross RefCross Ref
  22. J. Chen, M. Karczewicz, Y-W. Huang, K. Choi, J-R. Ohm, G. J. Sullivan, "The Joint Exploration Model (JEM) for Video Compression with Capability beyond HEVC", IEEE Transactions on Circuits and Systems for Video Technology, pp. 1--1, October 2019.Google ScholarGoogle Scholar
  23. H. Gao, S. Esenlik, Z. Zhao, E. Steinbach, J. Chen, "Decoder Side Motion Vector Refinement for Versatile Video Coding", IEEE 21st International Workshop on Multimedia Signal Processing (MMSP), pp. 27--29, September 2019.Google ScholarGoogle Scholar
  24. L. Zhang, K. Zhang, H. Liu, H. C. Chuang, Y. Wang, J. Xu, P. Zhao, D. Hong, "History-Based Motion Vector Prediction in Versatile Video Coding", IEEE Data Compression Conference (DCC), pp. 43--52, March 2019.Google ScholarGoogle Scholar
  25. C.-C. Chen, X. Xiu, Y. He, Y. Ye, "Generalized bi-prediction method for future video coding", Proc. IEEE Picture Coding Symp. (PCS), pp. 1--5, December 2016.Google ScholarGoogle ScholarCross RefCross Ref
  26. L. Zhao et al., "Wide Angular Intra Prediction for Versatile Video Coding," 2019 Data Compression Conference (DCC), Snowbird, UT, USA, pp. 53--62, May 2019.Google ScholarGoogle ScholarCross RefCross Ref
  27. Y. Chang et al., "Multiple Reference Line Coding for Most Probable Modes in Intra Prediction," 2019 Data Compression Conference (DCC), Snowbird, UT, USA, pp. 559--559, March 2019.Google ScholarGoogle ScholarCross RefCross Ref
  28. A. Segall, J. Zhao and I. Yamamoto, "Parallel intra prediction for video coding," 28th Picture Coding Symposium, Nagoya, pp. 310--313, December 2010.Google ScholarGoogle Scholar
  29. A. Kenneth, "Combined Intra Inter Prediction Coding Mode", Picture Coding Symposium ITU-T SG16/Q6 (VCEG) Doc. VCEGAD11, October 2006.Google ScholarGoogle Scholar
  30. Y. Wang, X. Guo, X. Fan, Y. Lu, D. Zhao and W. Gao, "Parallel In-Loop Filtering in HEVC Encoder on GPU," in IEEE Transactions on Consumer Electronics, vol. 64, no. 3, pp. 276--284, August 2018.Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. A. Norkin et al., "HEVC Deblocking Filter," in IEEE Transactions on Circuits and Systems for Video Technology, vol. 22, no. 12, pp. 1746--1754, December 2012.Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. C. Fu et al., "Sample Adaptive Offset in the HEVC Standard," in IEEE Transactions on Circuits and Systems for Video Technology, vol. 22, no. 12, pp. 1755--1764, December 2012.Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. VTM: the reference software for VVC development: https://vcgit.hhi.fraunhofer.de/jvet/VVCSoftware_VTMGoogle ScholarGoogle Scholar
  34. F. Bossen, J. Boyce, K. Suehring, X. Li and V. Seregin, "JVET common test conditions and software reference configurations", JVET-L1010, Macao, CN, October 2018.Google ScholarGoogle Scholar

Index Terms

  1. Software-based versatile video coding decoder parallelization

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        MMSys '20: Proceedings of the 11th ACM Multimedia Systems Conference
        May 2020
        403 pages
        ISBN:9781450368452
        DOI:10.1145/3339825

        Copyright © 2020 Owner/Author

        Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 27 May 2020

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        MMSys '20 Paper Acceptance Rate18of55submissions,33%Overall Acceptance Rate176of530submissions,33%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader