Paper
4 March 2015 Perceptual vector quantization for video coding
Jean-Marc Valin, Timothy B. Terriberry
Author Affiliations +
Proceedings Volume 9410, Visual Information Processing and Communication VI; 941009 (2015) https://doi.org/10.1117/12.2080529
Event: SPIE/IS&T Electronic Imaging, 2015, San Francisco, California, United States
Abstract
This paper applies energy conservation principles to the Daala video codec using gain-shape vector quantization to encode a vector of AC coefficients as a length (gain) and direction (shape). The technique originates from the CELT mode of the Opus audio codec, where it is used to conserve the spectral envelope of an audio signal. Conserving energy in video has the potential to preserve textures rather than low-passing them. Explicitly quantizing a gain allows a simple contrast masking model with no signaling cost. Vector quantizing the shape keeps the number of degrees of freedom the same as scalar quantization, avoiding redundancy in the representation. We demonstrate how to predict the vector by transforming the space it is encoded in, rather than subtracting off the predictor, which would make energy conservation impossible. We also derive an encoding of the vector-quantized codewords that takes advantage of their non-uniform distribution. We show that the resulting technique outperforms scalar quantization by an average of 0.90 dB on still images, equivalent to a 24.8% reduction in bitrate at equal quality, while for videos, the improvement averages 0.83 dB, equivalent to a 13.7% reduction in bitrate.
© (2015) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Jean-Marc Valin and Timothy B. Terriberry "Perceptual vector quantization for video coding", Proc. SPIE 9410, Visual Information Processing and Communication VI, 941009 (4 March 2015); https://doi.org/10.1117/12.2080529
Lens.org Logo
CITATIONS
Cited by 13 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Quantization

Video

Computer programming

Video coding

Distortion

Radon

Contrast sensitivity

RELATED CONTENT


Back to Top