基于对比学习的声源定位引导视听分割模型

黄文湖,赵邢,谢亮,梁浩然,梁荣华

Contrastive learning-based sound source localization-guided audio-visual segmentation model

Wenhu HUANG,Xing ZHAO,Liang XIE,Haoran LIANG,Ronghua LIANG

图 3 是否去除池化层、经过预训练和使用

$ {\text{ACT}} $

激活的视听分割结果比较

Fig.3 Comparison of AVS results with and without pool layer, pretraining and ACT activation