基于卷积和门控注意的两阶段视听语音增强算法
王盼蓉,贾海蓉,段淑斐

Two-stage audio-visual speech enhancement algorithm based on convolution and gated attention
Panrong WANG,Hairong JIA,Shufei DUAN
表 4 救护车噪声5种信噪比下各模型的语音增强效果对比
Tab.4 Comparison of speech enhancement effect of various model under five signal-to-noise ratios of ambulance noise
模型PESQSNR/dB
−10 dB−5 dB0 dB5 dB10 dB−10 dB−5 dB0 dB5 dB10 dB
Noisy1.762.092.282.372.55−10−50510
AV-ConvTasnet2.272.332.422.472.523.973.964.204.044.06
MuSE2.322.422.542.622.675.926.136.476.246.17
AV-Sepformer2.692.742.842.902.936.756.586.896.776.81
本文模型2.963.103.283.393.4713.9314.0714.8014.5814.68