结合年龄监督和人脸先验的语音-人脸图像重建
|
何立,庞善民
|
Face reconstruction from voice based on age-supervised learning and face prior information
|
Li HE,Shan-min PANG
|
|
表 1 本研究方法与主流方法实验结果对比 |
Tab.1 Experimental results of proposed method compared with popular methods |
|
模型 | 距离度量 | ResNet-50 | | VGG-16 | FID | Top-1/% | Top-5/% | Top-10/% | Top-1/% | Top-5/% | Top-10/% | random | − | 0.53 | 1.30 | 2.17 | | 0.53 | 1.30 | 2.17 | − | Speech2Face[13] | L1 | 0.61 | 2.59 | 4.44 | | 0.58 | 2.96 | 5.45 | 233.92 | cos | 0.56 | 2.59 | 4.60 | | 0.69 | 3.31 | 5.93 | Voice2Face[15] | L1 | 1.88 | 5.21 | 8.47 | | 1.30 | 6.06 | 10.79 | 51.45 | cos | 1.98 | 5.58 | 8.33 | | 1.32 | 5.66 | 10.90 | 仅生成模块 | L1 | 2.30 | 5.71 | 8.33 | | 1.38 | 6.46 | 11.08 | 38.60 | cos | 2.25 | 5.77 | 8.49 | | 1.69 | 6.48 | 11.53 | 本研究方法 | L1 | 2.59 | 5.98 | 9.20 | | 1.75 | 6.60 | 11.60 | 40.32 | cos | 2.32 | 5.81 | 9.17 | | 1.71 | 6.56 | 11.58 |
|
|
|