采用ConvNeXt解码器和基频预测的低资源语音合成
王猛,杨鉴

Low resource speech synthesis using ConvNeXt decoder and fundamental frequency prediction
Meng WANG,Jian YANG
表 4 语音合成改进模型和模块消融实验结果
Tab.4 Experimental results of module-ablation study on improved speech synthesis model
模型越南语缅甸语泰语
MOS(↑)MCD(↓)MOS(↑)MCD(↓)MOS(↑)MCD(↓)
真实音频4.794.464.65
改进模型4.454.963.447.994.104.66
1)3.785.213.219.013.355.32
2)4.035.183.328.473.684.92
3)4.055.143.358.263.724.82