视频编码中指导整数Karhunen-Loève变换设计的可逆-增益模型

doi:10.1631/FITEE.1500071

Front. Inform. Technol. Electron. Eng.

2015, Vol. 16

Issue (10): 883-891 DOI: 10.1631/FITEE.1500071

视频编码中指导整数Karhunen-Loève变换设计的可逆-增益模型

Xing-guo Zhu, Lu Yu

Zhejiang Provincial Key Laboratory of Information Network Technology, Institute of Information and Communication Engineering, Zhejiang University, Hangzhou 310027, China

A reversibility-gain model for integer Karhunen-Loève transform design in video coding

Xing-guo Zhu, Lu Yu

Zhejiang Provincial Key Laboratory of Information Network Technology, Institute of Information and Communication Engineering, Zhejiang University, Hangzhou 310027, China

全文: PDF

摘要： 目的：Karhunen-Loève变换（KLT）核矩阵含有无理数而需要整数化。但整数化过程通常会削弱KLT对视频信号去相关能力，同时整数KLT的非正交性也会带来失真。因而需要一个评价模型来指导整数KLT的设计。
创新点：综合考虑KLT在视频压缩中所起作用，分别对整数KLT矩阵的可逆程度和其变换编码增益（TCG）进行建模，并形成可逆-增益模型。用该模型进行KLT的整数化设计，得到的整数KLT矩阵在视频压缩效率上都高于其他整数化方法得到的矩阵。
方法：首先，充分考虑KLT的正交性，分析无量化情形下整数变换编码过程中失真的来源及其与整数变换核矩阵的解析关系，并利用此解析关系为整数KLT矩阵的可逆程度进行建模（式15）。然后，由于KLT可以最大化TCG，我们对KLT在整数化过程中的TCG损失率进行建模（式16），并分析整数余弦变换的TCG损失率（表1），以作为参考。最后，联合变换的可逆程度和TCG损失率，形成一个指导整数KLT设计的可逆-增益模型（式18）：在给定TCG损失率约束下，具有最佳可逆程度的整数KLT即是对给定KLT进行整数化的最优结果。
结论：在视频压缩中，给定任意KLT矩阵和倍乘因子下，利用本文提出的可逆-增益模型指导该KLT的整数化，能得到在压缩效率上最优的整数KLT矩阵。

关键词： 整数变换; KLT; 变换编码; 视频编码

Abstract: Karhunen-Loève transform (KLT) is the optimal transform that minimizes distortion at a given bit allocation for Gaussian source. As a KLT matrix usually contains non-integers, integer-KLT design is a classical problem. In this paper, a joint reversibility-gain (R-G) model is proposed for integer-KLT design in video coding. Specifically, the ‘reversibility’ is modeled according to distortion analysis in using forward and inverse integer transform without quantization. It not only measures how invertible a transform is, but also bounds the distortion introduced by the non-orthonormal integer transform process. The ‘gain’ means transform coding gain (TCG), which is a widely used criterion for transform design in video coding. Since KLT maximizes the TCG under some assumptions, here we define the TCG loss ratio (LR) to measure how much coding gain an integer-KLT loses when compared with the original KLT. Thus, the R-G model can be explained as follows: subject to a certain TCG LR, an integer-KLT with the best reversibility is the optimal integer transform for a given non-integer-KLT. Experimental results show that the R-G model can guide the design of integer-KLTs with good performance.

Key words: Integer transform Karhunen-Loève transform (KLT) Integer-KLT Transform coding Video coding

收稿日期: 2015-03-09 出版日期: 2015-10-08

CLC:

TN919.8

	服务
	把本文推荐给朋友
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	Xing-guo Zhu
	Lu Yu

引用本文:

Xing-guo Zhu, Lu Yu. A reversibility-gain model for integer Karhunen-Loève transform design in video coding. Front. Inform. Technol. Electron. Eng., 2015, 16(10): 883-891.

链接本文:

http://www.zjujournals.com/xueshu/fitee/CN/10.1631/FITEE.1500071 或 http://www.zjujournals.com/xueshu/fitee/CN/Y2015/V16/I10/883

[1]	En-zhong Yang, Lin-kai Zhang, Zhen Yao, Jian Yang. 软件定义网络中采用可伸缩视频组播的视频会议系统[J]. Front. Inform. Technol. Electron. Eng., 2016, 17(7): 672-681.
[2]	Kai Huang, De Ma, Rong-jie Yan, Hai-tong Ge, Xiao-lang Yan. High throughput VLSI architecture for H.264/AVC context-based adaptive binary arithmetic coding (CABAC) decoding[J]. Front. Inform. Technol. Electron. Eng., 2013, 14(6): 449-463.
[3]	Yi-xiong Zhang, Jiang-hong Shi, Wei-dong Wang. Video coding using geometry based block partitioning and reordering discrete cosine transform[J]. Front. Inform. Technol. Electron. Eng., 2012, 13(1): 71-82.
[4]	Liang Wei, Dan-dan Ding, Juan Du, Bin-bin Yu, Lu Yu. An efficient hardware design for HDTV H.264/AVC encoder[J]. Front. Inform. Technol. Electron. Eng., 2011, 12(6): 499-506.
[5]	Xin-hao Chen, Lu Yu. Distributed video coding with adaptive selection of hash functions[J]. Front. Inform. Technol. Electron. Eng., 2011, 12(5): 387-396.
[6]	Xin-hao Chen, Xing-guo Zhu, Xiao-lin Shen, Lu Yu. Hash signature saving in distributed video coding[J]. Front. Inform. Technol. Electron. Eng., 2011, 12(2): 163-170.
[7]	Cong-dao Han, Ji-lin Liu, Zhi-yu Xiang. An adaptive fast search algorithm for block motion estimation in H.264[J]. Front. Inform. Technol. Electron. Eng., 2010, 11(8): 637-644.
[8]	Wen-yi Wang, Yao-wu Chen. Is playing-as-downloading feasible in an eMule P2P file sharing system?[J]. Front. Inform. Technol. Electron. Eng., 2010, 11(6): 465-475.
[9]	Vahid BASTANI, Mohammad Sadegh HELFROUSH, Keyvan KASIRI. Image compression based on spatial redundancy removal and image inpainting[J]. Front. Inform. Technol. Electron. Eng., 2010, 11(2): 92-100.
[10]	Lu YU, Jian-peng WANG. Review of the current and future technologies for video compression[J]. Front. Inform. Technol. Electron. Eng., 2010, 11(1): 1-13.

Viewed

Full text

Abstract

Cited

Shared

Discussed