基于HRTF频谱特征优化MDCT域滤波

doi:10.3785/j.issn.1008-973X.2010.09.017

2010, Vol. 44

Issue (9): 1730-1737 DOI: 10.3785/j.issn.1008-973X.2010.09.017

无线电电子学、电信技术

基于HRTF频谱特征优化MDCT域滤波

朱梦尧, 李东晓, 张明

1. 浙江大学信息与电子工程学系, 浙江杭州 310027; 2.上海大学通信与信息工程学院, 上海 200072

Filtering optimization in MDCT domain base on spectrum character of HRTF

ZHU Meng-yao1,2, LI Dong-xiao1, ZHANG Ming1

1.Department of Information Science and Electronic Engineering, Zhejiang University, Hangzhou 310027, China;
2.School of Communication and Information Engineering, Shanghai University, Shanghai 200072, China

全文: PDF HTML

摘要：

面向音频多声道虚拟环绕处理的应用,提出一种改进离散余弦变换(MDCT)域的头相关传输函数(HRTF)高效滤波算法.通过MDCT的多相滤波结构分解,得到MDCT域滤波矩阵,再根据矩阵稀疏表征的思想,以及HRTF的频谱动态范围大的特点,对MDCT滤波矩阵进行动态邻域优化,有效地提高了MDCT域滤波的效率.通过大量的实验对比表明：该方法较以往的MDCT域滤波方法大大降低了运算复杂度,且保持滤波结果的一致性；该动态邻域算法与传统时域﹑频域处理方法相比,主观质量差异较小；该方法大大降低了虚拟环绕处理的算法复杂度,尤其适合采用MDCT编码的压缩音频格式.

Abstract:

An efficient filtering algorithm of multichannel virtual surround processing in modified discrete cosine transform (MDCT) was proposed. The MDCT filter bank can be represented in a polyphase structure and further derived to a high order matrix representation. By using the property of sparse matrix representation and the large dynamic range of headrelated transfer function (HRTF), an adaptive neighborhood algorithm was presented for fast filtering with HRTF in the MDCT domain. The width of a neighborhood can be adjusted according to the spectrum of HRTF in MDCT domain. Experimental results show that the proposed algorithm greatly reduced the implementation cost, and maintained same filtering quality. In order to compare with different multichannel virtual surround processing methods, a subjective listening test was conducted. The listening results reveal that the proposed approach can achieve basically same subjective quality as the traditional filtering methods. The complexity analysis of the virtual surround processing methods indicates that the proposed filtering algorithm of HRTF consumes minimal calculation, which is particularly suitable for audio compressed in MDCT format.

出版日期: 2010-09-01

TN 912

基金资助:

国家自然科学基金资助项目(60802013, 60872115, 60873130, 61001161); 上海大学创新基金资助项目(A10-0107-09-006); 上海市教委“电路与系统”重点学科建设资助项目(J50104).

通讯作者: 李东晓, 男, 副教授. E-mail: lidx@zju.edu.cn

作者简介: 朱梦尧(1982-), 男, 陕西西安人, 讲师, 从事多媒体信号处理与集成电路设计的研究. E-mail: zhumengyao@shu.edu.cn

	服务
	把本文推荐给朋友
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章

引用本文:

朱梦尧, 李东晓, 张明. 基于HRTF频谱特征优化MDCT域滤波[J]. J4, 2010, 44(9): 1730-1737.

SHU Meng-Yao, LI Dong-Xiao, ZHANG Meng. Filtering optimization in MDCT domain base on spectrum character of HRTF. J4, 2010, 44(9): 1730-1737.

链接本文:

http://www.zjujournals.com/eng/CN/10.3785/j.issn.1008-973X.2010.09.017 或 http://www.zjujournals.com/eng/CN/Y2010/V44/I9/1730

［1］ BREEBAART J, SCHUIJERS E. Phantom materialization: a novel method to enhance stereo audio reproduction on headphones ［J］. IEEE Transactions on Audio Speech and Language Processing, 2008, 16 (8): 15031511．
［2］ LIN W, FULIANG Y, ZHE C. An "out of head" sound field enhancement system for headphone ［C］∥Neural Networks and Signal Processing, 2008 International Conference on.［S.l.］: ［s. n.］, 2008: 517521．
［3］ PANAHPOUR T M, NIWA K, FUKUSHIMA N, et al. 3DAV integrated system featuring arbitrary listeningpoint and viewpoint generation ［C］∥Multimedia Signal Processing, 2008 IEEE 10th Workshop on.［S.l.］: ［s. n.］, 2008: 855860．
［4］程佩青. 数字信号处理教程［M］. 北京: 清华大学出版社, 2001:177183．
［5］ ISO/IEC 144963. Information technology Coding of audiovisual objects Part 3: Audio ［S］. ［S.l.］: ［s. n.］, 2005．
［6］ LANCIANI C A, SCHAFER R W. Subbanddomain filtering of MPEG audio signals ［C］∥Acoustics, Speech, and Signal Processing, 1999. ICASSP’99. Proceedings, 1999 IEEE International Conference on.［S.l.］: IEEE, 1999, 2: 917920．
［7］ TOUIMI A B. A generic framework for filtering in subbanddomain ［C］∥IEEE 9th Workshop on Digital Signal Processing. ［S.l.］: IEEE, 2000．
［8］ YU R, ROBINSON C, CHENG C. Lowcomplexity binaural decoding using time/frequency domain HRTF equalization ［M］ ∥ Advances in multimedia modeling. ［S.l.］: ［s. n.］, 2006:545556．
［9］ SURESH K, SREENIVAS T V. Linear filtering in DCT IV/DST IV and MDCT/MDST domain ［J］. Signal Processing, 2009, 89 (6): 10811089．
［10］ HENRIQUE S M. Signal processing with lapped transforms ［M］. ［S. l.］: Artech House, Inc., 1992．
［11］ SMITH J O. Spectral audio signal processing ［EB/OL］. ［20100802］. http:∥ccrma.stanford.edu/～jos/sasp/, March 2007 Draft．
［12］ LEVINE S N. Audio representations for data compression and compressed domain processing ［D］. Stanford: Stanford University, 1999．
［13］ MENGYAO Z, WEI Z, DONGXIAO L, et al. An accurate low complexity algorithm for frequency estimation in MDCT domain ［J］. Consumer Electronics, IEEE Transactions on, 2008, 54 (3): 10221028．
［14］ ITUR BS.7751. Multichannel stereophonic sound system with and without accompanying picture ［S］. ［S.l.］: ［s. n.］, 2006．
［15］ GARDNER B, MARTIN K. HRTF measurements of a KEMAR dummyhead microphone ［EB/OL］. \[20090602\]. http:∥sound.media.mit.edu/resources/KEMAR.html．
［16］ KAWANO S, TAIRA M, MATSUDAIRA M, et al. Development of the virtual sound algorithm ［J］. Consumer Electronics, IEEE Transactions on, 1998, 44 (3): 11891194．
［17］ Rec. ITUR BS.15341. Method for the subjective assessment of intermediate quality level of coding systems ［S］. ［S.l.］: ［s. n.］, 2003．
［18］ GOODWIN M M, WOLTERS M, SRIDHARAN R. Postprocessing and computation in parametric and transform audio coders ［C］∥ 22nd International Conference: Virtual, Synthetic, and Entertainment Audio. ［S.l.］: ［s. n.］, 2002.

No related articles found!

Viewed

Full text

Abstract

Cited

Shared

Discussed