基于对比学习的声源定位引导视听分割模型
黄文湖,赵邢,谢亮,梁浩然,梁荣华

Contrastive learning-based sound source localization-guided audio-visual segmentation model
Wenhu HUANG,Xing ZHAO,Liang XIE,Haoran LIANG,Ronghua LIANG
图 3 是否去除池化层、经过预训练和使用$ {\text{ACT}} $激活的视听分割结果比较
Fig.3 Comparison of AVS results with and without pool layer, pretraining and ACT activation