基于对比学习的声源定位引导视听分割模型
黄文湖,赵邢,谢亮,梁浩然,梁荣华

Contrastive learning-based sound source localization-guided audio-visual segmentation model
Wenhu HUANG,Xing ZHAO,Liang XIE,Haoran LIANG,Ronghua LIANG
表 3 去除池化层引起的特征尺寸变化对实验结果的影响
Tab.3 Effect of characteristic size change caused by removal of pool layer on experimental results
去除/保留
最大池
化层
AVSegFormerSSL2AVS
S4MS3S4MS3
$ {M_{\text{J}}} $/%$ {M_{\text{F}}} $/%$ {M_{\text{J}}} $/%$ {M_{\text{F}}} $/%$ {M_{\text{J}}} $/%$ {M_{\text{F}}} $/%$ {M_{\text{J}}} $/%$ {M_{\text{F}}} $/%
去除76.1186.443.4158.077.1686.856.1866.9
保留76.4585.949.5362.876.8786.552.4963.3