计算机技术﹑电信技术 |
|
|
|
|
基于邻居相似现象的情感说话人识别 |
陈力, 杨莹春 |
浙江大学 计算机科学与技术学院,浙江 杭州 310027 |
|
Emotional speaker recognition based on similar neighbor phenomenon |
CHEN Li, YANG Ying-chun |
College of Computer Science and Technology, Zhejiang University, Hangzhou 310027, China |
[1] GHIURCAU M V, RUSU C, ASTOLA J. A study of the effect of emotional state upon textindependent speaker identification [C]∥ International Conference on Acoustics, Speech and Signal Processing. Prague: IEEE, 2011: 4944-4947.
[2] BAO H, XU M, ZHENG T F. Emotion attribute projection for speaker recognition on emotional speech [C]∥ 8th Annual Conference of the International Speech Communication Association. Antwerp: IEEE, 2007: 758-761.
[3] HUANG T, YANG Y. Applying pitchdependent difference detection and modification to emotional speaker recognition [C] ∥ 9th Annual Conference of the International Speech Communication Association. Brisbane: IEEE, 2008: 2751-2754.
[4] HUANG T, YANG Y. Learning virtual HD model for bimodel emotional speaker recognition [C]∥ International Conference on Pattern Recognition. Istanbul: IEEE, 2010: 1614-1617.
[5] 单振宇,杨莹春.基于多项式拟合的中性情感模型转换算法[J].计算机工程与应用,2006,44(21): 206-209.
SHAN Zhenyu, YANG Yingchun. Neutralemotion model transformation algorithm based on polynomial function fitting [J]. Computer Engineering and Applications, 2006, 44(21): 206-209.
[6] SHAN Z, YANG Y. Naturalemotion GMM transformation algorithm for emotional speaker recognition [C]∥ 8th Annual Conference of the International Speech Communication Association. Antwerp: IEEE, 2007: 782-785.
[7] SHAN Z, YANG Y. Learning polynomial function based neutralemotion GMM transformation for emotional speaker recognition [C]∥ International Conference on Pattern Recognition. Tampa: IEEE, 2008: 8-11.
[8] 胡平,曹伟国,李华.一类等距不变量及其在三维表情人脸识别中的应用[J].计算机辅助设计与图形学学报,2010(12): 2089-2094.
HU Ping, CAO Weiguo, LI Hua. A novel isometric invariant and its applications in 3D face recognition [J]. Journal of ComputerAided Design and Computer Graphics, 2010(12): 2089-2094.
[9] 李爱军,邵鹏飞,党建武.情感表达的跨文化多模态感知研究[J].清华大学学报:自然科学版,2009(增1): 1-8.
LI Aijun, SHAO Pengfei, DANG Jianwu. Crosscultural and multimodal investigation of emotion expression [J]. Journal of Tsinghua University: Science and Technology, 2009(suppl.1): 1-8.
[10] REYNOLDS D A, ROSE R C. Robust textindependent speaker identification using Gaussian mixture speaker models [J]. IEEE Transactions on Speech and Audio Processing, 1995, 3(1): 72-83.
[11] REYNOLDS D A, QUATIERI T F, DUNN Q B. Speaker verification using adapted Gaussian mixture models [J]. Digital Signal Processing, 2000, 10(1/2/3): 19-41.
[12] HERSHEY J R, OLSEN P A. Approximating the Kullback Leibler divergence between Gaussian mixture models [C]∥ International Conference on Acoustics, Speech, and Signal Processing. Honolulu: IEEE, 2007: 317-320.
[13] HORTON P, NAKAI K. Better prediction of protein cellular localization sites with the k nearest neighbors classifier [C]∥ American Association for Artificial Intelligence. Providence: IEEE, 1997: 147-152.
[14] WU T, YANG Y, WU Z, et al. MASC: a speech corpus in mandarin for emotion analysis and affective speaker recognition [C]∥ ODYSSEY 2006, the Speaker and Language Recognition Workshop. Brno: IEEE, 2006: 1-5.
[15] VERGIN R, O’SHAUGHNESSY D, GUPTA V. Compensated Mel frequency cepstrum coefficients [C]∥ International Conference on Acoustics, Speech, and Signal Processing. Atlanta: IEEE, 1996: 323-326. |
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|