skip to main content
article

Perception-motivated high dynamic range video encoding

Published:01 August 2004Publication History
Skip Abstract Section

Abstract

Due to rapid technological progress in high dynamic range (HDR) video capture and display, the efficient storage and transmission of such data is crucial for the completeness of any HDR imaging pipeline. We propose a new approach for inter-frame encoding of HDR video, which is embedded in the well-established MPEG-4 video compression standard. The key component of our technique is luminance quantization that is optimized for the contrast threshold perception in the human visual system. The quantization scheme requires only 10--11 bits to encode 12 orders of magnitude of visible luminance range and does not lead to perceivable contouring artifacts. Besides video encoding, the proposed quantization provides perceptually-optimized luminance sampling for fast implementation of any global tone mapping operator using a lookup table. To improve the quality of synthetic video sequences, we introduce a coding scheme for discrete cosine transform (DCT) blocks with high contrast. We demonstrate the capabilities of HDR video in a player, which enables decoding, tone mapping, and applying post-processing effects in real-time. The tone mapping algorithm as well as its parameters can be changed interactively while the video is playing. We can simulate post-processing effects such as glare, night vision, and motion blur, which appear very realistic due to the usage of HDR data.

Skip Supplemental Material Section

Supplemental Material

References

  1. BOGART, R., KAINZ, F., AND HESS, D. 2003. OpenEXR image file format. In ACM SIGGRAPH 2003, Sketches & Applications.Google ScholarGoogle Scholar
  2. BORDER, P., AND GUILLOTEL, P. 2000. Perceptually adapted MPEG video encoding. In IS&T/SPIE Conf. on Hum. Vis. and Electronic Imaging V, Proc. of SPIE, volume 3959, 168--175.Google ScholarGoogle Scholar
  3. BURT, P., AND KOLCZYNSKI, R. 1993. Enhanced image capture through fusion. In Proc. of International Conference on Computer Vision (ICCV), 173--182.Google ScholarGoogle ScholarCross RefCross Ref
  4. CIE. 1981. An Analytical Model for Describing the Influence of Lighting Parameters Upon Visual Performance, vol. 1. Technical Foundations, CIE 19/2.1. International Organization for Standardization.Google ScholarGoogle Scholar
  5. DALY, S. 1993. The Visible Differences Predictor: An algorithm for the assessment of image fidelity. In Digital Image and Human Vision, Cambridge, MA: MIT Press, A. Watson, Ed., 179--206. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. DEBEVEC, P., AND MALIK, J. 1997. Recovering high dynamic range radiance maps from photographs. In Proceedings of SIGGRAPH 97, Computer Graphics Proceedings, Annual Conference Series, 369--378. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. DEVLIN, K., CHALMERS, A., WILKIE, A., AND PURGATHOFER, W. 2002. Tone Reproduction and Physically Based Spectral Rendering. In Eurographics 2002: State of the Art Reports, Eurographics, 101--123.Google ScholarGoogle Scholar
  8. DRAGO, F., MYSZKOWSKI, K., ANNEN, T., AND CHIBA, N. 2003. Adaptive logarithmic mapping for displaying high contrast scenes. Computer Graphics Forum, proceedings of Eurographics 2003 22, 3, 419--426.Google ScholarGoogle Scholar
  9. DURAND, F., AND DORSEY, J. 2000. Interactive tone mapping. In Rendering Techniques 2000: 11th Eurographics Workshop on Rendering, 219--230. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. FERWERDA, J., PATTANAIK, S., SHIRLEY, P., AND GREENBERG, D. 1996. A model of visual adaptation for realistic image synthesis. In Proceedings of SIGGRAPH 96, Computer Graphics Proceedings, Annual Conference Series, 249--258. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. GOODNIGHT, N., WANG, R., WOOLLEY, C., AND HUMPHREYS, G. 2003. Interactive time-dependent tone mapping using programmable graphics hardware. In Rendering Techniques 2003: 14th Eurographics Symposium on Rendering, 26--37. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. HOOD, D., AND FINKELSTEIN, M. 1986. Sensitivity to light. In Handbook of Perception and Human Performance: 1. Sensory Processes and Perception, Wiley, New York, K. Boff, L. Kaufman, and J. Thomas, Eds., vol. 1.Google ScholarGoogle Scholar
  13. HUNT, R. 1995. The Reproduction of Colour in Photography, Printing and Television: 5th Edition. Fountain Press.Google ScholarGoogle Scholar
  14. ISO-IEC-14496-2. 1999. Information technology: Coding of audio-visual objects, Part 2: Visual. International Organization for Standardization, Geneva, Switzerland.Google ScholarGoogle Scholar
  15. KANG, S., UYTTENDAELE, M., WINDER, S., AND SZELISKI, R. 2003. High dynamic range video. ACM Transactions on Graphics 22, 3, 319--325. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. LUBIN, J., AND PICA, A. 1991. A non-uniform quantizer matched to the human visual performance. Society of Information Display Int. Symposium Technical Digest of Papers, 22, 619--622.Google ScholarGoogle Scholar
  17. NADENAU, M. 2000. Integration of Human color vision Models into High Quality Image Compression. PhD thesis, École Polytechnique Fédéral Lausane.Google ScholarGoogle Scholar
  18. NAYAR, S., AND BRANZOI, V. 2003. Adaptive dynamic range imaging: Optical control of pixel exposures over space and time. In Proc. of IEEE International Conference on Computer Vision (ICCV 2003), 1168--1175. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. NAYAR, S., AND MITSUNAGA, T. 2000. High dynamic range imaging: Spatially varying pixel exposures. In Proc. of IEEE Conf. on Computer Vision and Pattern Recognition, 472--479.Google ScholarGoogle ScholarCross RefCross Ref
  20. PATTANAIK, S., TUMBLIN, J., YEE, H., AND GREENBERG, D. 2000. Time-dependent visual adaption for realistic image display. In Proceedings of ACM SIGGRAPH 2000, Computer Graphics Proceedings, Annual Conference Series, 47--54. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. PRESS, W., TEUKOLSKY, S., VETTERLING, W., AND FLANNERY, B. 1993. Numerical Recipes in C. Cambridge Univ. Press.Google ScholarGoogle Scholar
  22. REINHARD, E., STARK, M., SHIRLEY, P., AND FERWERDA, J. 2002. Photographic tone reproduction for digital images. ACM Transactions on Graphics 21, 3, 267--276. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. SAITO, K. 1995. Electronic image pickup device. Japanese Patent 07-254965.Google ScholarGoogle Scholar
  24. SEETZEN, H., HEIDRICH, W., STUERZLINGER, W., WARD, G., WHITE-HEAD, L., TRENTACOSTE, M., GHOSH, A., AND VOROZCOVS, A. 2004. High dynamic range display systems. ACM Transactions on Graphics 23, 3. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. SEZAN, M., YIP, K., AND DALY, S. 1987. Uniform perceptual quantization: Applications to digital radiography. IEEE Transactions on Systems, Man, and Cybernetics 17, 4, 622--634.Google ScholarGoogle ScholarCross RefCross Ref
  26. SHAPLEY, R., AND ENROTH-CUGELL, C. 1984. Visual adaption and retinal gain controls. In Progress in Retinal Research, Oxford: Pergamon Press, vol. 3, 263--346.Google ScholarGoogle Scholar
  27. SHEN, K., AND DELP, E. 1999. Wavelet based rate scalable video compression. IEEE Transactions on Circuits and Systems for Video Technology 9, 1, 109--122. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. SPENCER, G., SHIRLEY, P., ZIMMERMAN, K., AND GREENBERG, D. 1995. Physically-based glare effects for digital images. In Proceedings of ACM SIGGRAPH 95, 325--334. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. THOMSPON, W. B., SHIRLEY, P., AND FERWERDA, J. A. 2002. A spatial post-processing algorithm for images of night scenes. Journal of Graphics Tools 7, 1, 1--12. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. VAN NES, F., AND BOUMAN, M. 1967. Spatial modulation transfer in the human eye. Journal of the Optical Society of America 57, 401--406.Google ScholarGoogle ScholarCross RefCross Ref
  31. WANG, Z., AND BOVIK, A. 2002. A universal image quality index. IEEE Signal Processing Letters 9, 3, 81--84.Google ScholarGoogle ScholarCross RefCross Ref
  32. WARD LARSON, G., RUSHMEIER, H., AND PIATKO, C. 1997. A visibility matching tone reproduction operator for high dynamic range scenes. IEEE Transactions on Visualization and Computer Graphics 3, 4, 291--306. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. WARD LARSON, G. 1998. Logluv encoding for full-gamut, high-dynamic range images. Journal of Graphics Tools 3, 1, 815--30. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. WARD, G. 1991. Real pixels. In Graphics Gems II, J. Arvo, Ed. Academic Press, 80--83.Google ScholarGoogle Scholar

Index Terms

  1. Perception-motivated high dynamic range video encoding

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        • Published in

          cover image ACM Transactions on Graphics
          ACM Transactions on Graphics  Volume 23, Issue 3
          August 2004
          684 pages
          ISSN:0730-0301
          EISSN:1557-7368
          DOI:10.1145/1015706
          Issue’s Table of Contents

          Copyright © 2004 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 1 August 2004
          Published in tog Volume 23, Issue 3

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • article

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader