结合年龄监督和人脸先验的语音-人脸图像重建

结合年龄监督和人脸先验的语音-人脸图像重建

何立,庞善民

Face reconstruction from voice based on age-supervised learning and face prior information

Li HE,Shan-min PANG

表 1 本研究方法与主流方法实验结果对比

Tab.1 Experimental results of proposed method compared with popular methods

模型	距离度量	ResNet-50				VGG-16			FID
模型	距离度量	Top-1/%	Top-5/%	Top-10/%		Top-1/%	Top-5/%	Top-10/%	FID
random	−	0.53	1.30	2.17		0.53	1.30	2.17	−
Speech2Face^[13]	L1	0.61	2.59	4.44		0.58	2.96	5.45	233.92
Speech2Face^[13]	cos	0.56	2.59	4.60		0.69	3.31	5.93	233.92
Voice2Face^[15]	L1	1.88	5.21	8.47		1.30	6.06	10.79	51.45
Voice2Face^[15]	cos	1.98	5.58	8.33		1.32	5.66	10.90	51.45
仅生成模块	L1	2.30	5.71	8.33		1.38	6.46	11.08	38.60
仅生成模块	cos	2.25	5.77	8.49		1.69	6.48	11.53	38.60
本研究方法	L1	2.59	5.98	9.20		1.75	6.60	11.60	40.32
本研究方法	cos	2.32	5.81	9.17		1.71	6.56	11.58	40.32