基于语义增强特征融合的多模态图像检索模型
|
杨帆,宁博,李怀清,周新,李冠宇
|
Multimodal image retrieval model based on semantic-enhanced feature fusion
|
Fan YANG,Bo NING,Huai-qing LI,Xin ZHOU,Guan-yu LI
|
|
表 5 Fashion IQ 数据集上消融实验结果召回率结果对比 |
Tab.5 Comparison of ablation recall results on Fashion IQ dataset |
|
模型 | R@10 | dress | shirt | top&tee | % | SEFM (without-text semantic enhancement) | 10.2±0.4 | 10.2±0.2 | 11.0±0.2 | SEFM(without-image semantic enhancement) | 10.8±0.5 | 9.1±0.5 | 11.5±0.5 | SEFM(Lbase) | 11.2±0.3 | 10.7±0.3 | 11.6±0.3 | SEFM(Lbase+LRI) | 11.3±0.3 | 11.0±0.2 | 11.5±0.3 | SEFM(Lbase+LRT) | 11.6±0.4 | 11.3±0.3 | 11.7±0.4 | SEFM | 11.9±0.3 | 11.2±0.5 | 11.7±0.3 |
|
|
|