Please wait a minute...
浙江大学学报(理学版)  2017, Vol. 44 Issue (1): 57-63    DOI: 10.3785/j.issn.1008-9497.2017.01.009
电子科学     
基于失真传递的时域自适应量化算法
殷海兵, 王鸿奎, 王忠霄
中国计量大学 信息工程学院, 浙江 杭州 310018
A temporally adaptive quantization algorithm with constrained distortion propagation in video coding
YIN Haibing, WANG Hongkui, WANG Zhongxiao
College of Information Engineering, China Jiliang University, Hangzhou 310018, China
 全文: PDF(3335 KB)   HTML  
摘要: 码率控制是视频编码器中的关键模块,其算法直接决定编码器率失真性能.视频编码帧间预测导致的编码失真会在时域产生传递效应,考虑该传递效应是优化码率控制算法性能的关键.宏块树码率控制是一种典型的时域量化控制算法,核心是根据编码单元失真传递量(相对传递代价ρ)自适应地调整量化参数(偏移量δ),合适的δ-ρ映射关系是宏块树量化控制算法的核心.宏块树算法采用基于经验的δ-ρ模型,对不同视频序列的普适性有待改进,模型准确度和精度也需进一步优化.针对上述问题,将竞争决策方法用于探索最优δ-ρ映射关系,提出了一种率失真性能优化的失真时域传递自适应量化δ-ρ模型,以改进时域自适应量化算法.实验结果表明,信噪比BD-PSNR较原模型提升了0.14 dB以上,SSIM性能提升了0.29 dB.算法能更好地控制码率时域分配,降低失真时域传递恶化.
关键词: 视频编码码率控制率失真优化失真传递竞争决策    
Abstract: Rate control is crucial to rate distortion performance optimization in video coding design. In video coder, temporal prediction bring about distortion propagation along adjacent frames, and it is an efficient way to further improve the video coding efficiency by taking the temporal distortion dependency into consideration. The MBTree rate control is a typical temporal quantization control algorithm, in which the quantization parameter offset δ is employed for quantization adjustment according to the distortion propagation amount, i.e. the relative propagation cost ρ. An appropriate δ-ρ model is therefore the key for the MBTree-like adaptive quantization algorithm. Nevertheless, the current δ-ρ model in MBTree algorithm is designed in an empirical way with rough accuracy. This model has unsatisfactory universality to different video sequences, thus there is still room left to be improved. This paper focuses on this problem and applies the competitive decision mechanism in exploring the optimized δ-ρ model, and then proposes an improved δ-ρ model with rate distortion optimization. The simulation results show that the improved MBTree algorithm based on the proposed model can achieve up to 0.14 dB BD-PSNR improvement and 0.29 dB SSIM improvement. The proposed algorithm can also implement better bit allocation in temporal domain and reduce the temporal distortion fluctuation, achieving adaptive quantization control.
Key words: video coding    rate control    rate distortion optimization    distortion propagation    competitive decision
收稿日期: 2016-01-03 出版日期: 2017-01-22
CLC:  TN919  
基金资助: 国家自然科学基金资助项目(61572449);浙江省自然科学基金资助项目(LY15F020022,LY13H180011)
作者简介: 殷海兵(1974-),ORCID:http://orcid.org/0000-0002-3025-0938,男,硕士,教授,主要从事数字视频编解码研究,E-mail:haibingyin@163.com.
服务  
把本文推荐给朋友
加入引用管理器
E-mail Alert
RSS
作者相关文章  
殷海兵
王鸿奎
王忠霄

引用本文:

殷海兵, 王鸿奎, 王忠霄. 基于失真传递的时域自适应量化算法[J]. 浙江大学学报(理学版), 2017, 44(1): 57-63.

YIN Haibing, WANG Hongkui, WANG Zhongxiao. A temporally adaptive quantization algorithm with constrained distortion propagation in video coding. Journal of ZheJIang University(Science Edition), 2017, 44(1): 57-63.

链接本文:

https://www.zjujournals.com/sci/CN/10.3785/j.issn.1008-9497.2017.01.009        https://www.zjujournals.com/sci/CN/Y2017/V44/I1/57

[1] LI B, LI H Q, LI L, et al. λ domain rate control algorithm for high efficiency video coding[J]. IEEE Transactions on Image Processing, 2014,23(9):23-50.
[2] LEE B, KIM M, NGUYEN T Q. A frame-level rate control scheme based on texture and nontexture rate models for high efficiency video coding[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2014,24(3):465-479.
[3] YIN H B, YANG E H, YU X. Fast soft decision quantization with adaptive preselection and dynamic trellis graph[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2015,25(8):1362-1375.
[4] RAMCHANDRAN K, ORTEGA A, VETTERLI M. Bit allocation for dependent quantization with applications to multi resolution and MPEG video coders[J]. IEEE Transactions on Image Processing, 1994,3(5):533-545.
[5] LIU J Y, CHO Y, GUO Z M, et al. Bit allocation for spatial scalability coding of H.264/SVC with dependent rate-distortion analysis[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2010,20(7):967-981.
[6] 陈杰,虞露.视频编码中考虑参考帧质量的重建图像失真模型[D].杭州:浙江大学,2012. CHEN J,YU L. The Distortion Model of the Reconstructed Picture Considering Reference Frame in Video Coding[D]. Hangzhou:Zhejiang University, 2012.
[7] PANG C, AU O C, ZOU F, et al. An analytic framework for frame-level dependent bit allocation in hybrid video coding[J]. IEEE Transactions on Circuits and Systems for Video Technology,2013,23(6):990-1002.
[8] WANG S S, MA S W, WANG S Q, et al. Rate-GOP based rate control for high efficiency video coding[J]. IEEE Journal of Selected Topics in Signal Processing, 2013,7(6):1101-1111.
[9] 朱策,周益民,李帅,等,基于信源失真时域传播的视频编码率失真优化(AVS-M3406)[C]//The 49th Meeting of AVS. 大连:大连理工大学,2014. ZHU C, ZHOU Y M, LI S, et al. Rate distortion optimization for video coding based on source-end distortion propagation Chain (AVS-M3406)[C]//The 49th Meeting of AVS. Dalian:Dalian University of Technology,2014.
[10] ORTEGA A, RAMCHANDRAN K. Rate-distortion methods for image and video compression[J]. IEEE Signal Processing Magazine, 1998,15(6):23-50.
[11] LEE J S, EBRAHIMI T. Perceptual video compression:A survey[J]. IEEE Journal of Selected Topics in Signal Processing, 2012,6(6):684-697.
[12] 蒋刚毅,朱亚培,郁梅,等.基于感知的视频编码方法综述[J].电子与信息学报,2013,35(2):474-483.JIANG G Y, ZHU Y P, YU M, et al. Perceptual video coding:A survey[J]. Journal of Electronics & Information Technology, 2013,35(2):474-483.
[13] 崔子冠,朱秀昌.基于结构相似的H.264主观率失真性能改进机制[J].电子与信息学报,2012,34(2):433-439. CUI Z G, ZHU X C. Subjective rate-distortion performance improvement scheme for H.264 based on SSIM[J]. Journal of Electronics & Information Technology,2012,34(2):433-439.
[14] 郑明魁,苏凯雄,王卫星,等.基于视觉感知的高效视频编码标准帧内量化矩阵优化方法[J].电子与信息学报,2014,36(12):2861-2868. ZHENG M K, SU K X, WANG W X, et al. An improved intra quantization matrix for high efficiency video coding based on visual perception[J]. Journal of Electronics & Information Technology, 2014,36(12):2861-2868.
[15] JASON G. A novel macroblock-tree algorithm for high-performance optimization of dependent video coding in H.264/AVC[EB]. http://x264.nl/developers/Dark_Shikari/MBtree%20paper.pdf.
[16] FUSS I G, NAVARRO D J. Open parallel cooperative and competitive decision processes:A potential provenance for quantum probability decision models[J]. Topics in Cognitive Science, 2013,5(4):818-843.
[1] 王秀敏, 洪芳菲, 殷海兵, 李正权, 肖丙刚. 基于LDPC/Turbo双模译码器的自适应迭代译码算法研究[J]. 浙江大学学报(理学版), 2016, 43(5): 573-579.