基于注意力机制和编码-解码架构的施工场景图像描述方法
农元君,王俊杰,陈红,孙文涵,耿慧,李书悦

A image caption method of construction scene based on attention mechanism and encoding-decoding architecture
Yuan-jun NONG,Jun-jie WANG,Hong CHEN,Wen-han SUN,Hui GENG,Shu-yue LI
表 2 不同方法在施工图像描述数据集中的实验结果
Tab.2 Experiment results of different methods in image caption data set of construction scene
方法 主干网络 BLEU-1 BLEU-2 BLEU-3 BLEU-4 METEOR ROUGE_L CIDEr
NIC[17] VGG-16 0.725 0.542 0.386 0.295 0.248 0.531 0.854
Adaptive[18] VGG-16 0.738 0.556 0.403 0.319 0.259 0.545 0.887
Self-critic[19] ResNet-101 0.751 0.573 0.437 0.332 0.266 0.558 0.913
Up-down[20] ResNet-101 0.764 0.587 0.455 0.344 0.271 0.572 0.946
本研究方法 ResNet-101 0.783 0.608 0.469 0.357 0.293 0.586 0.962