基于语义增强特征融合的多模态图像检索模型
杨帆,宁博,李怀清,周新,李冠宇

Multimodal image retrieval model based on semantic-enhanced feature fusion
Fan YANG,Bo NING,Huai-qing LI,Xin ZHOU,Guan-yu LI
表 4 MIT-States 数据集上消融实验召回率结果对比
Tab.4 Comparison of ablation recall results on MIT-States dataset
模型 R@1 R@5 R@10
%
SEFM
(without-text semantic enhancement)
13.4±0.7 35.2±0.8 47.6±1.0
SEFM
(without-image semantic enhancement)
14.6±0.8 34.5±0.9 47.7±0.8
SEFM(Lbase) 14.7±0.7 35.7±0.5 46.2±0.7
SEFM(Lbase+LRI) 14.7±0.6 34.9±0.5 46.8±0.7
SEFM(Lbase+LRT) 14.9±0.6 36.2±0.5 47.5±0.7
SEFM 15.5±0.8 37.7±1.0 49.6±1.0