基于数字水印的人脸与声纹融合识别算法

doi:10.3785/j.issn.1008-973X.2015.01.002

浙江大学学报(工学版)

自动化技术、信息技术

基于数字水印的人脸与声纹融合识别算法

王骕1,胡浩基1,于慧敏1,DAMPER R I2

1.浙江大学信息与电子工程学系,浙江杭州 310027； 2.南安普顿大学电子与计算机科学系,英国

Augmenting remote multimodal person verification by embedding voice characteristics into face images

WANG Su1, HU Hao-ji1, YU Hui-min1, DAMPER R I 2

1. Department of Information Science and Electronic Engineering, Zhejiang University, Hangzhou 310027, China; 2. Department of Electronics and Computer Science, University of Southampton, SO17 1BJ, UK

全文: PDF(1678 KB) HTML

摘要：

提出远程多模态的生物特征数字水印算法,将声音特征作为水印加入到人脸图像中.运用文献［1］提出的改进型量化索引调制（QIM）方法,算法加入一个脆弱型的水印用于篡改检测,同时加入一个鲁棒型水印用于隐藏声音的高斯混合模型（GMM）参数.利用人脸、声纹和多模态识别算法,提出的方法能够实现对篡改的检测,对常见的攻击,例如图片缩放、高斯噪声、模糊化、伽马校正和JPEG压缩等具有鲁棒性.在由295人组成的XM2VTS数据库上,该多模态系统能够获得95.93%的识别率,同时获得3.19%的等错误率.

Abstract:

A novel biometric watermarking algorithm was proposed to augment remote multimodal recognition by embedding voice characteristics into face images. Using the modified quantization index modulation（QIM） scheme proposed by reference ［1］, the algorithm embedded both a fragile watermark for tampering detection, and a robust watermark to represent the Gaussian mixture model (GMM) parameters extracted from voice. Using face, voice and multimodal recognition algorithms, the proposed watermarking scheme can detect tampering, and is robust to watermarking attacks such as resizing, Gaussian noise, blurring, Gamma correction and JPEG compression. On the XM2VTS database consisting of 295 persons, the multimodal system can obtain recognition rate of 95.93% for identification, and equal error rate of 3.19% for verification.Key words: face recognition| speaker recognition| digital watermarking| quantization index modulation（QIM）

出版日期: 2018-06-06

TP 391

基金资助:

国家自然科学基金资助项目（61202400)

通讯作者: 胡浩基, 男, 副教授 E-mail: haoji_hu@zju.edu.cn

作者简介: 王骕（1991-），男，博士生，从事模式识别的研究.E-mail:su_wang@zju.edu.cn

	服务
	把本文推荐给朋友
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章

引用本文:

王骕,胡浩基,于慧敏,DAMPER R I. 基于数字水印的人脸与声纹融合识别算法[J]. 浙江大学学报(工学版), 10.3785/j.issn.1008-973X.2015.01.002.

WANG Su, HU Hao-ji, YU Hui-min, DAMPER R I. Augmenting remote multimodal person verification by embedding voice characteristics into face images. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 10.3785/j.issn.1008-973X.2015.01.002.

链接本文:

http://www.zjujournals.com/eng/CN/10.3785/j.issn.1008-973X.2015.01.002 或 http://www.zjujournals.com/eng/CN/Y2015/V49/I1/6

［1］ MILLER B. Vital signs of identity biometrics ［J］. Spectrum, IEEE, 1994, 31(2): 22-30．
［2］ DONG J, TAN T. Effects of watermarking on iris recognition performance ［C］∥Proceedings of the 10th International Conference on Control, Automation, Robotics and Vision. Hanoi, Vietnam: ［s.n.］, 2008: 1156-1161.
［3］ JAIN A K, ULUDAG U. Hiding biometric data ［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2003, 25 (11): 1494-1498.
［4］ SATONAKA T. Biometric watermark authentication with multiple verification rule ［C］∥Proceedings of the 12th IEEE Workshop on Neural Networks in Signal Processing. Martigny Valais: IEEE, 2002: 597-606.
［5］ LI C, WANG Y, MA B, et al. Tamper detection and self-recovery of biometric images using salient region-based authentication watermarking scheme ［J］. Computer Standards and Interfaces, 2012, 34(4): 367-379．
［6］ ULUDAG U, PANKANTI S, PRABHAKAR S, et al. Biometric cryptosystems: issues and challenges ［J］. Proceedings of the IEEE, 2004, 92(6): 948-960．
［7］ RATHGEB C, UHL A. A survey on biometric cryptosystems and cancelable biometrics ［J］. EURASIP Journal on Information Security, 2011, 2011(1): 125．
［8］ REYNOLDS D A, ROSE R C. Robust text-independent speaker identification using Gaussian mixture speaker models ［J］. IEEE Transactions on Speech and Audio Processing, 1995, 3(1): 72-83.
［9］ LI Q, COX I J. Using perceptual models to improve fidelity and provide resistance to valumetric scaling for quantization index modulation watermarking ［J］. IEEE Transactions on Information Forensics and Security, 2007, 2(2): 127-139.
［10］ CHEN B, WORNELL G W. Quantization index modulation: a class of provably good methods for digital watermarking and information embedding ［J］. IEEE Transactions on Information Theory, 2001, 47(4): 1423-1443．
［11］ COX I J, MILLER M L, BLOOM J A. Digital watermarking and steganography ［M］. Burlington: Morgan Kaufmann Publishers, 2008.
［12］ LOWE D G. Object recognition from local scale-invariant features ［C］∥Proceedings of the International Conference on Computer Vision. Corfu: ［s. n.］, 1999: 1150-1157.
［13］ LOWE D G. Distinctive image features from scale invariant key points ［J］. International Journal of Computer Vision, 2004, 60(2): 91-110．
［14］ MESSER K, MATAS J, KITTLER J, et al. XM2VTSDB: the extended M2VTS database ［C］∥Proceedings of the 2nd International Conference on Audio and Video-based Biometric Person Authentication. Washington, DC: ［s. n.］, 1999: 72-77.
［15］ WANG Z, BOVIK A C. Image quality assessment: from error visibility to structural similarity ［J］. IEEE Transactions on Image Processing, 2004, 13(4): 600-612．
［16］ FAWCETT T. An introduction to ROC analysis ［J］. Pattern Recognition Letters, 2006, 27(8): 861-874.

[1]	何雪军, 王进, 陆国栋, 刘振宇, 陈立, 金晶. 基于三角网切片及碰撞检测的工业机器人三维头像雕刻[J]. 浙江大学学报(工学版), 2017, 51(6): 1104-1110.
[2]	王桦, 韩同阳, 周可. 公安情报中基于关键图谱的群体发现算法[J]. 浙江大学学报(工学版), 2017, 51(6): 1173-1180.
[3]	尤海辉, 马增益, 唐义军, 王月兰, 郑林, 俞钟, 吉澄军. 循环流化床入炉垃圾热值软测量[J]. 浙江大学学报(工学版), 2017, 51(6): 1163-1172.
[4]	毕晓君, 王佳荟. 基于混合学习策略的教与学优化算法[J]. 浙江大学学报(工学版), 2017, 51(5): 1024-1031.
[5]	王亮, 於志文, 郭斌. 基于双层多粒度知识发现的移动轨迹预测模型[J]. 浙江大学学报(工学版), 2017, 51(4): 669-674.
[6]	廖苗, 赵于前, 曾业战, 黄忠朝, 张丙奎, 邹北骥. 基于支持向量机和椭圆拟合的细胞图像自动分割[J]. 浙江大学学报(工学版), 2017, 51(4): 722-728.
[7]	黄正宇, 蒋鑫龙, 刘军发, 陈益强, 谷洋. 基于融合特征的半监督流形约束定位方法[J]. 浙江大学学报(工学版), 2017, 51(4): 655-662.
[8]	蒋鑫龙, 陈益强, 刘军发, 忽丽莎, 沈建飞. 面向自闭症患者社交距离认知的可穿戴系统[J]. 浙江大学学报(工学版), 2017, 51(4): 637-647.
[9]	穆晶晶, 赵昕玥, 何再兴, 张树有. 基于凹凸变换与圆周拟合的重叠气泡轮廓重构[J]. 浙江大学学报(工学版), 2017, 51(4): 714-721.
[10]	戴彩艳, 陈崚, 李斌, 陈伯伦. 复杂网络中的抽样链接预测[J]. 浙江大学学报(工学版), 2017, 51(3): 554-561.
[11]	刘磊, 杨鹏, 刘作军. 采用多核相关向量机的人体步态识别[J]. 浙江大学学报(工学版), 2017, 51(3): 562-571.
[12]	郭梦丽, 达飞鹏, 邓星, 盖绍彦. 基于关键点和局部特征的三维人脸识别[J]. 浙江大学学报(工学版), 2017, 51(3): 584-589.
[13]	王海军, 葛红娟, 张圣燕. 基于核协同表示的快速目标跟踪算法[J]. 浙江大学学报(工学版), 2017, 51(2): 399-407.
[14]	张亚楠, 陈德运, 王莹洁, 刘宇鹏. 基于增量图形模式匹配的动态冷启动推荐方法[J]. 浙江大学学报(工学版), 2017, 51(2): 408-415.
[15]	刘宇鹏, 乔秀明, 赵石磊, 马春光. 统计机器翻译中大规模特征的深度融合[J]. 浙江大学学报(工学版), 2017, 51(1): 46-56.

Viewed

Full text

Abstract

Cited

Shared

Discussed