基于语义增强特征融合的多模态图像检索模型
|
杨帆,宁博,李怀清,周新,李冠宇
|
Multimodal image retrieval model based on semantic-enhanced feature fusion
|
Fan YANG,Bo NING,Huai-qing LI,Xin ZHOU,Guan-yu LI
|
|
表 4 MIT-States 数据集上消融实验召回率结果对比 |
Tab.4 Comparison of ablation recall results on MIT-States dataset |
|
模型 | R@1 | R@5 | R@10 | % | SEFM (without-text semantic enhancement) | 13.4±0.7 | 35.2±0.8 | 47.6±1.0 | SEFM (without-image semantic enhancement) | 14.6±0.8 | 34.5±0.9 | 47.7±0.8 | SEFM(Lbase) | 14.7±0.7 | 35.7±0.5 | 46.2±0.7 | SEFM(Lbase+LRI) | 14.7±0.6 | 34.9±0.5 | 46.8±0.7 | SEFM(Lbase+LRT) | 14.9±0.6 | 36.2±0.5 | 47.5±0.7 | SEFM | 15.5±0.8 | 37.7±1.0 | 49.6±1.0 |
|
|
|