Abstract
The ever-growing video streaming services require accurate quality assessment with often no reference to the original media. One primary challenge in developing no-reference (NR) video quality metrics is achieving real-timeliness while retaining the accuracy. A real-time no-reference video quality assessment (VQA) method is proposed for videos encoded by H.264/AVC codec. Temporal and spatial features are extracted from the encoded bit-stream and pixel values to train and validate a fully connected neural network. The hand-crafted features and network dynamics are designed in a manner to ensure a high correlation with human judgment of quality as well as minimizing the computational complexities. Proof-of-concept experiments are conducted via comparison with: 1) video sequences rated by a full-reference quality metric, and 2) H.264-encoded sequences from the LIVE video dataset which are subjectively evaluated through differential mean opinion scores (DMOS). The performance of the proposed method is verified by correlation measurements with the aforementioned objective and subjective scores. The framework achieves real-time execution while outperforming state-of-art full-reference and no-reference video quality assessment methods.
Similar content being viewed by others
References
Abadi M, Agarwal A, Barham P, Brevdo E, Chen Z, Citro C, Corrado GS, Davis A, Dean J, Devin M, Ghemawat S, Goodfellow I, Harp A, Irving G, Isard M, Jia Y, Jozefowicz R, Kaiser L, Kudlur M, Levenberg J, Mané D, Monga R, Moore S, Murray D, Olah C, Schuster M, Shlens J, Steiner B, Sutskever I, Talwar K, Tucker P, Vanhoucke V, Vasudevan V, Viégas F, Vinyals O, Warden P, Wattenberg M, Wicke M, Yu Y, Zheng X (2015) TensorFlow: large-scale machine learning on heterogeneous systems. https://www.tensorflow.org/
Ahn S, Lee S (2018) No-reference video quality assessment based on convolutional neural network and human temporal behavior. In: Asia-Pacific signal and information processing association annual summit and conference (APSIPA ASC), pp 1513–1517
Born RT, Bradley DC (2005) Structure and function of visual area MT. Ann Rev Neurosci 28(1):157–189. https://doi.org/10.1146/annurev.neuro.26.041002.131052
Caruana R, Lawrence S, Giles CL (2001) Overfitting in neural nets: backpropagation, conjugate gradient, and early stopping. In: Advances in neural information processing systems, pp 402–408
Fang R, Al-Bayaty R, Wu D (2017) BNB Method for no-reference image quality assessment. IEEE Transactions on Circuits and Systems for Video Technology 27(7):1381–1391
Fazliani Y, Andrade E, Shirani S (2019) Learning based hybrid no-reference video quality assessment of compressed videos. In: 2019 IEEE international symposium on circuits and systems (ISCAS), pp 1–5
Gu J, Meng G, Redi JA, Xiang S, Pan C (2018) Blind image quality assessment via vector regression and object oriented pooling. IEEE Transactions on Multimedia 20(5):1140–1153
Gunawan IP, Ghanbari M (2008) Reduced-reference video quality assessment using discriminative local harmonic strength with motion consideration. IEEE Transactions on Circuits and Systems for Video Technology 18(1):71–83
Hou W, Gao X, Tao D, Li X (2015) Blind image quality assessment via deep learning. IEEE Transactions on Neural Networks and Learning Systems 26(6):1275–1286
Huynh-Thu Q, Ghanbari M (2008) Temporal aspect of perceived quality in mobile video broadcasting. IEEE Trans Broadcast 54(3):641–651
Keimel C, Klimpke M, Habigt J, Diepold K (2011) No-reference video quality metric for hdtv based on h.264/avc bitstream features. In: IEEE international conference on image processing, pp 3325–3328
Kim J, Lee S (2017) Fully deep blind image quality predictor. IEEE Journal of Selected Topics in Signal Processing 11(1):206–220
Kim J, Zeng H, Ghadiyaram D, Lee S, Zhang L, Bovik AC (2017) Deep convolutional neural models for picture-quality prediction: challenges and solutions to data-driven image quality assessment. IEEE Signal Proc Mag 34(6):130–141
Larson EC, Chandler DM (2010) Most apparent distortion: full-reference image quality assessment and the role of strategy. Journal of Electronic Imaging 19(1):011006:1–011006:21
Lay D (2012) Linear algebra and its applications. Addison-Wesley
Liu W, Duanmu Z, Wang Z (2018) End-to-end blind quality assessment of compressed videos using deep neural networks. In: ACM international conference on multimedia, ACM, MM ’18, pp 546–554
Mittal A, Moorthy AK, Bovik AC (2012) No-reference image quality assessment in the spatial domain. IEEE Trans Image Process 21(12):4695–4708
Mittal A, Soundararajan R, Bovik AC (2013) Making a completely blind image quality analyzer. IEEE Signal Processing Letters 20(3):209–212
Mocanu DC, Liotta A, Ricci A, Vega MT, Exarchakos G (2014) When does lower bitrate give higher quality in modern video services?. In: IEEE network operations and management symposium (NOMS), pp 1–5
Ninassi A, Meur OL, Callet PL, Barba D (2009) Considering temporal variations of spatial visual distortions in video quality assessment. IEEE Journal of Selected Topics in Signal Processing 3(2):253– 265
Péchard S, Carnec M, Callet PL, Barba D (2006) From sd to hd television: effects of h.264 distortions versus display size on quality of experience. International Conference on Image Processing: 409–412
Pinson MH, Wolf S (2004) A new standardized method for objectively measuring video quality. IEEE Trans Broadcast 50(3):312–322
Pitrey Y, Barkowsky M, Callet PL, Pépion R (2010) Subjective quality evaluation of h.264 high-definition video coding versus spatial up-scaling and interlacing. Euro ITV
Pitrey Y, Barkowsky M, Pépion R, Callet PL, Hlavacs H (2012) Influence of the source content and encoding configuration on the perceived quality for scalable video coding. SPIE Proc 8291:1–6
Richardson IE (2010) The H.264 advanced video compression standard. Wiley
Saad MA, Bovik AC, Charrier C (2014) Blind prediction of natural video quality. IEEE Trans Image Process 23(3):1352–1365
Seshadrinathan K, Bovik AC (2010) Motion tuned spatio-temporal quality assessment of natural videos. IEEE Trans Image Process 19(2):335–350
Seshadrinathan K, Soundararajan R, Bovik AC, Cormack LK (2010) Study of subjective and objective quality assessment of video. IEEE Trans Image Process 19(6):1427–1441
Shahid M, Panasiuk J, Wallendael GV, Barkowsky M, Lövström B (2015) Predicting full-reference video quality measures using hevc bitstream-based no-reference features. In: International workshop on quality of multimedia experience (QoMEX), pp 1–2
Soundararajan R, Bovik AC (2012) RRED indices: reduced reference entropic differencing for image quality assessment. IEEE Trans Image Process 21 (2):517–526
Søgaard J, Forchhammer S, Korhonen J (2015) No-reference video quality assessment using codec analysis. IEEE Transactions on Circuits and Systems for Video Technology 25(10):1637–1650
Usman MA, Shin SY, Shahid M, Lövström B (2016) A no reference video quality metric based on jerkiness estimation focusing on multiple frame freezing in video streaming. IETE Tech Rev 34(3):309–320
Vega MT, Liotta A (2016) LIMP video quality database. https://www.tue.nl/index.php?id=53688
Vega MT, Mocanu DC, Famaey J, Stavrou S, Liotta A (2017) Deep learning for quality assessment in live video streaming. IEEE Signal Processing Letters 24(6):736–740
Video Quality Experts Group (VQEG) (2000) Final report from the video quality experts group on the validation of objective models of video quality assessment, phase I. https://www.tue.nl/index.php?id=53688, http://www.its.bldrdoc.gov/vqeg/projects/frtv_phaseI
Vink JP, de Haan G (2011) No-reference metric design with machine learning for local video compression artifact level. IEEE Journal of Selected Topics in Signal Processing 5(2):297–308
Vu PV, Vu CT, Chandler DM (2011) A spatiotemporal most-apparent-distortion model for video quality assessment. In: IEEE international conference on image processing, pp 2505–2508
Wang Z, Bovik AC, Evan BL (2000) Blind measurement of blocking artifacts in images. In: International conference on image processing, vol 3, pp 981–984
Wang Z, Bovik AC, Sheikh HR, Simoncelli EP (2004) Image quality assessment: from error visibility to structural similarity. IEEE Trans Image Process 13(4):600–612
Wang Z, Sheikh HR, Bovik AC (2002) No-reference perceptual quality assessment of jpeg compressed images. In: International conference on image processing, vol 1, pp I–477–I–480
Weiss Y, Freeman WT (2007) What makes a good model of natural images?. In: IEEE conference on computer vision and pattern recognition, pp 1–8
Xu B, Pan X, Zhou Y, Li Y, Yang D, Chen Z (2017) CNN-based rate-distortion modeling for H.265/HEVC. In: 2017 IEEE visual communications and image processing (VCIP), pp 1–4
Xu J, Ye P, Liu Y, Doermann D (2014) No-reference video quality assessment via feature learning. In: IEEE international conference on image processing (ICIP), pp 491–495
Ye P, Kumar J, Kang L, Doermann D (2012) Unsupervised feature learning framework for no-reference image quality assessment. In: IEEE conference on computer vision and pattern recognition, pp 1098–1105
Zeng K, Wang Z (2010) Temporal motion smoothness measurement for reduced-reference video quality assessment. In: IEEE international conference on acoustics, speech and signal processing, pp 1010–1013
Zhu K, Hirakawa K, Asari V, Saupe D (2013) A no-reference video quality assessment based on laplacian pyramids. In: IEEE international conference on image processing, pp 49–53
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Fazliani, Y., Andrade, E. & Shirani, S. Neural network solution for a real-time no-reference video quality assessment of H.264/AVC video bitstreams. Multimed Tools Appl 81, 2409–2427 (2022). https://doi.org/10.1007/s11042-021-10654-0
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-021-10654-0