基于视觉关系推理与上下文门控机制的图像描述
陈巧红,裴皓磊,孙麒
Image caption based on relational reasoning and context gate mechanism
Qiao-hong CHEN,Hao-lei PEI,Qi SUN
表 4
Flickr30k数据集实验性能对比
Tab.4
Comparison of experimental results on Flickr30k caption dateset
模型
BLEU1
BLEU4
METEOR
CIDEr
Hard-Attention
66.9
19.9
18.5
—
GL-Att
68.1
25.7
18.9
—
LRCA
69.8
27.7
21.5
57.4
Adaptive
67.7
25.1
20.4
53.1
NBT
69.0
27.1
21.7
57.5
本研究(XE)
73.6
30.1
23.8
60.2