skip to main content
research-article

Real-time H.264 video encoding in software with fast mode decision and dynamic complexity control

Published:22 February 2010Publication History
Skip Abstract Section

Abstract

This article presents a novel real-time algorithm for reducing and dynamically controlling the computational complexity of an H.264 video encoder implemented in software. A fast mode decision algorithm, based on a Pareto-optimal macroblock classification scheme, is combined with a dynamic complexity control algorithm that adjusts the MB class decisions such that a constant frame rate is achieved. The average coding efficiency of the proposed algorithm was found to be similar to that of conventional encoding operating at half the frame rate. The proposed algorithm was found to provide lower average bitrate and distortion than static complexity scaling.

References

  1. Ahhad, A., Khan, N., Masud, S., and Maud, A. 2004. Selection of variable block sizes in H.264. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'04). 3, 173--176.Google ScholarGoogle Scholar
  2. Akyol, E., Mukherjee, D., and Liu, Y. 2007. Complexity control for Real-time video coding. In Proceedings of the IEEE International Conference on Image Processing (ICIP'07). 1, 77--80.Google ScholarGoogle Scholar
  3. Ates, H. F. and Altunbasak, Y. 2008. Rate-Distortion and complexity optimized motion estimation for H.264 video coding. IEEE Trans. Circ. Syst. Video Technol. 18, 2, 159--171. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Berger, G., Goedeken, R., and Richardson, J. 2007. Motivation and implementation of a software H.264 real-time CIF encoder for mobile TV broadcast applications. IEEE Trans. Broadcast. 53, 2, 584--587.Google ScholarGoogle ScholarCross RefCross Ref
  5. Bjontegaard, G. 2001. Calculation of average PSNR differences between RD curves. In ITU-T VCEG Meeting, document VCEG-M33.Google ScholarGoogle Scholar
  6. Chang, A., Wong, P. H. W., Yeung, Y. M., and Au, O. C. 2004. Fast multi-block selection for H.264 video coding. In Proceedings of the IEEE International Symposium on Circuits and Systems (ISCAS'04). 3, 817--820.Google ScholarGoogle Scholar
  7. Deepak, S. and Chen, T. 2001. Classification based mode decisions for video over networks. IEEE Trans. Multimedia 3, 41--52. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Everett, H. 1963. Generalized Lagrange multiplier method for solving problems of optimum allocation of resources. Oper. Res. 11, 399--417.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Feng, B., Zhu, G., and Liu, W. 2006. Fast adaptive inter mode decision method for P slices in H.264. In Proceedings of the IEEE 3rd International Conference on Consumer Communications and Networking (CCNC'06). 2, 745--748.Google ScholarGoogle Scholar
  10. Goto, K., Hatabu, A., Nishizuka, H., Matsunaga, K., Nakamura, R., Mochizuki, Y., and Miyazaki, T. 2006. H.264 video encoder implementation on a low-power DSP with low and stable computational complexity. In Proceedings of the IEEE Workshop on Signal Processing Systems Design and Implementation (SIPS '06). 101--106.Google ScholarGoogle Scholar
  11. Han, K. and Lee, Y. 2004. Fast macroblock mode decision in H.264. In Proceedings of the IEEE Region 10 Conference (TENCON'04). 347--350.Google ScholarGoogle Scholar
  12. Hong, Z., Cheng-ke, W., Yang-Li, W., and Yong, F. 2005. Fast mode decision for H.264/AVC based on macroblock correlation. In Proceedings of the IEEE 19th International Conference on Advanced Information Networking and Applications (AINA'05). 775--780. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Hsu, K. W., Xiang, L., and Chopra, R. 2005. An IC design for real-time motion estimation for H.264 digital video. In Proceedings of the 48th Symposium on Circuits and Systems. 1489--1493.Google ScholarGoogle Scholar
  14. Hu, Y., Li, Q., Ma, S., and Kuo, C. C. J. 2006. Joint rate-distortion-complexity optimization for H.264 motion search. In Proceedings of the IEEE International Conference on Multimedia and Expo (ICME'06). 1949--1952.Google ScholarGoogle Scholar
  15. ISO4964. 2002. Implementation studies group of ISO/IEC. Main results of the AVC complexity analysis. ISO/IEC JTC1/SC29/WG11, Klagenfurt.Google ScholarGoogle Scholar
  16. JM9.5. JVT reference software JM 9.5. available online at http://iphome.hhi.de/suehring/tml.Google ScholarGoogle Scholar
  17. JVT. 2003. Joint video team (JVT) of ISO/IEC MPEG and ITU-T VCEG. Draft ITU-T recommendation and final draft international standard of joint video specification (ITU-T Rec. H264 ISO/IEC 14496--10 AVC), document JVT-G050d35, 7th Meeting.Google ScholarGoogle Scholar
  18. Kaminsky, E., Grois, D., and Hadar, O. 2008. Dynamic computational complexity and bit allocation for optimizing H.264/AVC video compression. J. Vis. Comm. Image Represent. 19, 1, 56--74. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Kannangara, C. S., Richardson, I. E. G., Bystrom, M., Solera, J. R., Zhao, Y., MacLennan, A., and Cooney, R. 2006. Low-Complexity skip prediction for H.264 through Lagrangian cost estimation. IEEE Trans. Circ. Syst. Video Techno. 16, 2, 202--208. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Kant, S., Mithun, U., and Gupta, P. S. S. B. K. 2006. Real-Time H.264 video encoder implementation on a programmable DSP processor for videophone applications. In The Digest of Technical Papers of the International Conference on Consumer Electronics (ICCE'06). 93--94.Google ScholarGoogle Scholar
  21. Kim, C. and Jay Kuo, C. C. 2005. A feature-based approach to fast H.264 intra/inter mode decision. In Proceedings of the IEEE International Symposium Circuits and Systems (ISCAS'05). 1, 308--311.Google ScholarGoogle Scholar
  22. Kim, Y., Choe, Y., and Choi, Y. 2006. Fast mode decision algorithm for H.264 using AZCB prediction. In Proceedings of the International Conference on Consumer Electronics (ICCE'06). 33--34.Google ScholarGoogle Scholar
  23. Kossentini, F., Lee, Y. W., Smith, M. J. T, and Ward, R. K. 1997. Predictive RD optimised motion estimation for very low bit-rate video coding. IEEE J. Select. Areas Comm. 15, 9, 1752--1763. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Kuo, T. Y. and Chan, C. H. 2004. Fast macroblock partition prediction for H.264/AVC. In Proceedings of the IEEE International Conference on Multimedia and Expo (ICME'04). 675--678.Google ScholarGoogle Scholar
  25. Li, G. L., Chen, M. J., Li, H. J., and Hsu, C. T. 2005. Efficient motion search and mode prediction algorithms for motion estimation in H.264/AVC. In Proceedings of the IEEE International Symposium on Circuits and Systems (ISCAS'05). 5481--5484.Google ScholarGoogle Scholar
  26. Saponara, S., Blanch, C., Denolf, K., and Bormans, J. 2002. Data transfer and storage complexity analysis of the AVC codec on a tool-by-tool basis. ISO/IEC JCT1/SC29/WG11, M8547, Klagenfurt.Google ScholarGoogle Scholar
  27. Shen, J. S., Chen, C. H., Tsai, C. M., and Lin, C. W. 2007. Computation-Aware intra-mode decision for H. 264 coding and transcoding. In Proceedings of the 9th IEEE International Symposium on Multimedia Workshops (ISMW'07). 427--433. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Stottrup-Andersen, J., Forchhammer, S., and Aghito, S. M. 2004. Rate-Distortion-Complexity optimization of fast motion estimation in H.264/MPEG-4 AVC. In Proceedings of the International Conference on Image Processing (ICIP '04). 1, 111--114.Google ScholarGoogle Scholar
  29. Wang, C. N., Yang, S. W., Liu, C. M., and Chiang, T. 2004. A hierarchical N-queen decimation lattice and hardware architecture for motion estimation. IEEE Trans. Circ. Syst. Video Technol. 14, 4, 429--440. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Wang, H. and Zhu, Z. 2005. Fast mode decision and reduction of reference frames for H.264 encoder. In Proceedings of the International Conference on Control and Automation (ICCA'05). 1040--1043.Google ScholarGoogle Scholar
  31. We, Z. and Canhui, C. 2006. Realization and optimisation of DSP based H.264 encoder. In Proceedings of the IEEE International Symposium on Circuits and Systems (ISCAS'06). 4.Google ScholarGoogle Scholar
  32. Wiegand, T., Sullivan, J. S., Bjontegaard, G., and Luthra, A. 2003. Overview of the H.264/AVC video coding standard. IEEE Trans. Circ. Technol. 13, 7, 560--576. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Wu, M., Forchhammer, S., and Aghito, S. M. 2007. Complexity control of fast motion estimation in H.264/MPEG-4 AVC with rate-distortion-complexity optimization, Proceedings of SPIE, 6508, 650824.Google ScholarGoogle Scholar
  34. Xiph, org. Test media, available online at http://media.xiph.org/video/derf/.Google ScholarGoogle Scholar
  35. Yin, P., Tourapis, H. Y. C., Tourapis, A. M., and Boyce, J. Fast mode decision and motion estimation for JVT/H.264. In Proceedings of the IEEE International Conference on Image Processing (ICIP'03). 853--856.Google ScholarGoogle Scholar
  36. Yu, A. C. 2004. Efficient block-size selection algorithm for inter-frame coding in H.264/MPEG-4 AVC. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'04). 169--172.Google ScholarGoogle ScholarCross RefCross Ref
  37. Zhao, Y. and Richardson, I. 2003. Macroblock classification for complexity management of video encoders. Signal Process. Image Comm. 18, 801--811.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Real-time H.264 video encoding in software with fast mode decision and dynamic complexity control

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Multimedia Computing, Communications, and Applications
      ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 6, Issue 1
      February 2010
      138 pages
      ISSN:1551-6857
      EISSN:1551-6865
      DOI:10.1145/1671954
      Issue’s Table of Contents

      Copyright © 2010 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 22 February 2010
      • Revised: 1 December 2008
      • Accepted: 1 December 2008
      • Received: 1 April 2008
      Published in tomm Volume 6, Issue 1

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader