基于卷积和门控注意的两阶段视听语音增强算法
王盼蓉,贾海蓉,段淑斐

Two-stage audio-visual speech enhancement algorithm based on convolution and gated attention
Panrong WANG,Hairong JIA,Shufei DUAN
表 5 闹钟噪声5种信噪比下各模型的语音增强效果对比
Tab.5 Comparison of speech enhancement effect of various model under five signal-to-noise ratios of alarm noise
模型PESQSNR/dB
−10 dB−5 dB0 dB5 dB10 dB−10 dB−5 dB0 dB5 dB10 dB
Noisy1.701.922.252.422.64−10−50510
AV-ConvTasnet2.352.322.552.602.654.394.504.504.294.32
MuSE2.352.552.652.692.736.276.806.916.476.38
AV-Sepformer2.672.812.862.922.936.776.796.826.626.76
本文模型2.883.083.253.353.4512.8913.5714.3314.1214.39