基于视觉关系推理与上下文门控机制的图像描述
陈巧红,裴皓磊,孙麒

Image caption based on relational reasoning and context gate mechanism
Qiao-hong CHEN,Hao-lei PEI,Qi SUN
表 4 Flickr30k数据集实验性能对比
Tab.4 Comparison of experimental results on Flickr30k caption dateset
模型 BLEU1 BLEU4 METEOR CIDEr
Hard-Attention 66.9 19.9 18.5
GL-Att 68.1 25.7 18.9
LRCA 69.8 27.7 21.5 57.4
Adaptive 67.7 25.1 20.4 53.1
NBT 69.0 27.1 21.7 57.5
本研究(XE) 73.6 30.1 23.8 60.2