基于对比学习的声源定位引导视听分割模型
黄文湖,赵邢,谢亮,梁浩然,梁荣华

Contrastive learning-based sound source localization-guided audio-visual segmentation model
Wenhu HUANG,Xing ZHAO,Liang XIE,Haoran LIANG,Ronghua LIANG
表 7 特征增强模块对模型性能的影响
Tab.7 Impact of feature enhancement module on model performance
特征增强图像编码器$N_{\mathrm{p}} / 10^6 $S4MS3
$ {M_{\mathrm{J}}} $/%$ {M_{\mathrm{F}}} $/%$ {M_{\mathrm{J}}} $/%$ {M_{\mathrm{F}}} $/%
ResNet-50120.7577.5987.553.3764.0
PVT v2177.9884.2491.459.9671.6
ResNet-50136.4578.6888.059.5069.8
PVT v2179.7284.4391.665.1675.6