Abstract
This article presents a novel real-time algorithm for reducing and dynamically controlling the computational complexity of an H.264 video encoder implemented in software. A fast mode decision algorithm, based on a Pareto-optimal macroblock classification scheme, is combined with a dynamic complexity control algorithm that adjusts the MB class decisions such that a constant frame rate is achieved. The average coding efficiency of the proposed algorithm was found to be similar to that of conventional encoding operating at half the frame rate. The proposed algorithm was found to provide lower average bitrate and distortion than static complexity scaling.
- Ahhad, A., Khan, N., Masud, S., and Maud, A. 2004. Selection of variable block sizes in H.264. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'04). 3, 173--176.Google Scholar
- Akyol, E., Mukherjee, D., and Liu, Y. 2007. Complexity control for Real-time video coding. In Proceedings of the IEEE International Conference on Image Processing (ICIP'07). 1, 77--80.Google Scholar
- Ates, H. F. and Altunbasak, Y. 2008. Rate-Distortion and complexity optimized motion estimation for H.264 video coding. IEEE Trans. Circ. Syst. Video Technol. 18, 2, 159--171. Google ScholarDigital Library
- Berger, G., Goedeken, R., and Richardson, J. 2007. Motivation and implementation of a software H.264 real-time CIF encoder for mobile TV broadcast applications. IEEE Trans. Broadcast. 53, 2, 584--587.Google ScholarCross Ref
- Bjontegaard, G. 2001. Calculation of average PSNR differences between RD curves. In ITU-T VCEG Meeting, document VCEG-M33.Google Scholar
- Chang, A., Wong, P. H. W., Yeung, Y. M., and Au, O. C. 2004. Fast multi-block selection for H.264 video coding. In Proceedings of the IEEE International Symposium on Circuits and Systems (ISCAS'04). 3, 817--820.Google Scholar
- Deepak, S. and Chen, T. 2001. Classification based mode decisions for video over networks. IEEE Trans. Multimedia 3, 41--52. Google ScholarDigital Library
- Everett, H. 1963. Generalized Lagrange multiplier method for solving problems of optimum allocation of resources. Oper. Res. 11, 399--417.Google ScholarDigital Library
- Feng, B., Zhu, G., and Liu, W. 2006. Fast adaptive inter mode decision method for P slices in H.264. In Proceedings of the IEEE 3rd International Conference on Consumer Communications and Networking (CCNC'06). 2, 745--748.Google Scholar
- Goto, K., Hatabu, A., Nishizuka, H., Matsunaga, K., Nakamura, R., Mochizuki, Y., and Miyazaki, T. 2006. H.264 video encoder implementation on a low-power DSP with low and stable computational complexity. In Proceedings of the IEEE Workshop on Signal Processing Systems Design and Implementation (SIPS '06). 101--106.Google Scholar
- Han, K. and Lee, Y. 2004. Fast macroblock mode decision in H.264. In Proceedings of the IEEE Region 10 Conference (TENCON'04). 347--350.Google Scholar
- Hong, Z., Cheng-ke, W., Yang-Li, W., and Yong, F. 2005. Fast mode decision for H.264/AVC based on macroblock correlation. In Proceedings of the IEEE 19th International Conference on Advanced Information Networking and Applications (AINA'05). 775--780. Google ScholarDigital Library
- Hsu, K. W., Xiang, L., and Chopra, R. 2005. An IC design for real-time motion estimation for H.264 digital video. In Proceedings of the 48th Symposium on Circuits and Systems. 1489--1493.Google Scholar
- Hu, Y., Li, Q., Ma, S., and Kuo, C. C. J. 2006. Joint rate-distortion-complexity optimization for H.264 motion search. In Proceedings of the IEEE International Conference on Multimedia and Expo (ICME'06). 1949--1952.Google Scholar
- ISO4964. 2002. Implementation studies group of ISO/IEC. Main results of the AVC complexity analysis. ISO/IEC JTC1/SC29/WG11, Klagenfurt.Google Scholar
- JM9.5. JVT reference software JM 9.5. available online at http://iphome.hhi.de/suehring/tml.Google Scholar
- JVT. 2003. Joint video team (JVT) of ISO/IEC MPEG and ITU-T VCEG. Draft ITU-T recommendation and final draft international standard of joint video specification (ITU-T Rec. H264 ISO/IEC 14496--10 AVC), document JVT-G050d35, 7th Meeting.Google Scholar
- Kaminsky, E., Grois, D., and Hadar, O. 2008. Dynamic computational complexity and bit allocation for optimizing H.264/AVC video compression. J. Vis. Comm. Image Represent. 19, 1, 56--74. Google ScholarDigital Library
- Kannangara, C. S., Richardson, I. E. G., Bystrom, M., Solera, J. R., Zhao, Y., MacLennan, A., and Cooney, R. 2006. Low-Complexity skip prediction for H.264 through Lagrangian cost estimation. IEEE Trans. Circ. Syst. Video Techno. 16, 2, 202--208. Google ScholarDigital Library
- Kant, S., Mithun, U., and Gupta, P. S. S. B. K. 2006. Real-Time H.264 video encoder implementation on a programmable DSP processor for videophone applications. In The Digest of Technical Papers of the International Conference on Consumer Electronics (ICCE'06). 93--94.Google Scholar
- Kim, C. and Jay Kuo, C. C. 2005. A feature-based approach to fast H.264 intra/inter mode decision. In Proceedings of the IEEE International Symposium Circuits and Systems (ISCAS'05). 1, 308--311.Google Scholar
- Kim, Y., Choe, Y., and Choi, Y. 2006. Fast mode decision algorithm for H.264 using AZCB prediction. In Proceedings of the International Conference on Consumer Electronics (ICCE'06). 33--34.Google Scholar
- Kossentini, F., Lee, Y. W., Smith, M. J. T, and Ward, R. K. 1997. Predictive RD optimised motion estimation for very low bit-rate video coding. IEEE J. Select. Areas Comm. 15, 9, 1752--1763. Google ScholarDigital Library
- Kuo, T. Y. and Chan, C. H. 2004. Fast macroblock partition prediction for H.264/AVC. In Proceedings of the IEEE International Conference on Multimedia and Expo (ICME'04). 675--678.Google Scholar
- Li, G. L., Chen, M. J., Li, H. J., and Hsu, C. T. 2005. Efficient motion search and mode prediction algorithms for motion estimation in H.264/AVC. In Proceedings of the IEEE International Symposium on Circuits and Systems (ISCAS'05). 5481--5484.Google Scholar
- Saponara, S., Blanch, C., Denolf, K., and Bormans, J. 2002. Data transfer and storage complexity analysis of the AVC codec on a tool-by-tool basis. ISO/IEC JCT1/SC29/WG11, M8547, Klagenfurt.Google Scholar
- Shen, J. S., Chen, C. H., Tsai, C. M., and Lin, C. W. 2007. Computation-Aware intra-mode decision for H. 264 coding and transcoding. In Proceedings of the 9th IEEE International Symposium on Multimedia Workshops (ISMW'07). 427--433. Google ScholarDigital Library
- Stottrup-Andersen, J., Forchhammer, S., and Aghito, S. M. 2004. Rate-Distortion-Complexity optimization of fast motion estimation in H.264/MPEG-4 AVC. In Proceedings of the International Conference on Image Processing (ICIP '04). 1, 111--114.Google Scholar
- Wang, C. N., Yang, S. W., Liu, C. M., and Chiang, T. 2004. A hierarchical N-queen decimation lattice and hardware architecture for motion estimation. IEEE Trans. Circ. Syst. Video Technol. 14, 4, 429--440. Google ScholarDigital Library
- Wang, H. and Zhu, Z. 2005. Fast mode decision and reduction of reference frames for H.264 encoder. In Proceedings of the International Conference on Control and Automation (ICCA'05). 1040--1043.Google Scholar
- We, Z. and Canhui, C. 2006. Realization and optimisation of DSP based H.264 encoder. In Proceedings of the IEEE International Symposium on Circuits and Systems (ISCAS'06). 4.Google Scholar
- Wiegand, T., Sullivan, J. S., Bjontegaard, G., and Luthra, A. 2003. Overview of the H.264/AVC video coding standard. IEEE Trans. Circ. Technol. 13, 7, 560--576. Google ScholarDigital Library
- Wu, M., Forchhammer, S., and Aghito, S. M. 2007. Complexity control of fast motion estimation in H.264/MPEG-4 AVC with rate-distortion-complexity optimization, Proceedings of SPIE, 6508, 650824.Google Scholar
- Xiph, org. Test media, available online at http://media.xiph.org/video/derf/.Google Scholar
- Yin, P., Tourapis, H. Y. C., Tourapis, A. M., and Boyce, J. Fast mode decision and motion estimation for JVT/H.264. In Proceedings of the IEEE International Conference on Image Processing (ICIP'03). 853--856.Google Scholar
- Yu, A. C. 2004. Efficient block-size selection algorithm for inter-frame coding in H.264/MPEG-4 AVC. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'04). 169--172.Google ScholarCross Ref
- Zhao, Y. and Richardson, I. 2003. Macroblock classification for complexity management of video encoders. Signal Process. Image Comm. 18, 801--811.Google ScholarCross Ref
Index Terms
- Real-time H.264 video encoding in software with fast mode decision and dynamic complexity control
Recommendations
Dynamic complexity scaling for real-time H.264/AVC video encoding
MM '07: Proceedings of the 15th ACM international conference on MultimediaThe H.264 video encoding standard can achieve high coding efficiency at the expense of high computational complexity. Typically, real-time software implementation requires omission of most optional encoding tools leading to significantly reduced coding ...
Fast Bit Rate Estimation for Mode Decision of H.264/AVC
To achieve the highest coding efficiency, H.264/AVC uses rate-distortion optimization technique. This means that the encoder has to code the video by exhaustively trying all the mode combinations including the different intra- and inter-prediction ...
Macroblock mode decision scheme for fast encoding in H.264/AVC
PCM'06: Proceedings of the 7th Pacific Rim conference on Advances in Multimedia Information ProcessingTo improve coding efficiency, the H.264/AVC video coding standard uses new coding tools, such as variable block size, quarter-pixel-accuracy motion estimation, multiple reference frames, intra prediction and a loop filter. Using these coding tools, ...
Comments