基于视觉Transformer时空自注意力的工人行为识别
陆昱翔,徐冠华,唐波

Worker behavior recognition based on temporal and spatial self-attention of vision Transformer
Yu-xiang LU,Guan-hua XU,Bo TANG
表 2 本研究模型对UCF101各类别视频的识别精度
Tab.2 Recognition accuracy of proposed model for UCF101 video categories
%
视频类别 Accmin Accmax Accavg
人与物体交互 92.62 93.70 93.16
单纯的肢体动作 92.19 92.26 92.28
人与人交互 96.89 96.96 96.93
演奏乐器 97.76 98.50 98.13
体育运动 92.64 93.53 93.09