Multi branch Siamese network target tracking based on double attention mechanism

doi:10.3785/j.issn.1008-973X.2023.07.005

Journal of ZheJiang University (Engineering Science)

2023, Vol. 57

Issue (7): 1307-1316 DOI: 10.3785/j.issn.1008-973X.2023.07.005

Multi branch Siamese network target tracking based on double attention mechanism

Xiao-yan LI(

),Peng WANG*(

),Jia GUO,Xue LI,Meng-yu SUN

School of Electronic and Information Engineering, Xi’an Technological University, Xi’an 710021, China

Download:

HTML

PDF(2692KB) HTML
Export: BibTeX | EndNote (RIS)

Abstract

A multi branch Siamese network target tracking algorithm based on dual attention mechanism was proposed in order to solve the problem of inaccurate positioning in the SiamRPN++ single target tracking algorithm when the target was briefly occluded and the appearance drastically changed. SiamRPN++ with lightweight backbone network was adopted as the basic algorithm. The algorithm was combined with lightweight channel and spatial attention mechanism in order to improve the anti-interference ability when dealing with occlusion challenges during the tracking process. A template branch was added from the previous frame, and the appearance changes of the target were dynamically updated. The ability to distinguish between foreground and background was enhanced during the tracking process using triplet loss. Local expansion search was conducted based on the speed of the target’s movement in order to enable timely and accurate tracking of the target even after short-term occlusion. The experimental results showed that the improved algorithm improved the success rate and precision of the OTB100 dataset by 2.4% and 1.6%, respectively, compared to the original algorithm. The average center position error decreased by 28.97 pixels, and the average overlap rate increased by 14.5%.

Key words： Siamese network attention mechanism template update local enlargement

Received: 20 June 2022 Published: 17 July 2023

CLC:

TP 391

Fund: 国家自然基金资助项目（62171360）；陕西省科技厅重点研发计划资助项目（2022GY-110）；2023年陕西省高校工程研究中心资助项目；2022年度陕西高校青年创新团队资助项目；山东省智慧交通重点实验室（筹）

Corresponding Authors: Peng WANG E-mail: 76469715@qq.com;wang_peng@xatu.edu.cn

	Service
	E-mail this article
	Add to my bookshelf
	Add to citation manager
	E-mail Alert
	RSS
	Articles by authors
	Xiao-yan LI
	Peng WANG
	Jia GUO
	Xue LI
	Meng-yu SUN

Cite this article:

Xiao-yan LI,Peng WANG,Jia GUO,Xue LI,Meng-yu SUN. Multi branch Siamese network target tracking based on double attention mechanism. Journal of ZheJiang University (Engineering Science), 2023, 57(7): 1307-1316.

URL:

https://www.zjujournals.com/eng/10.3785/j.issn.1008-973X.2023.07.005 OR https://www.zjujournals.com/eng/Y2023/V57/I7/1307

基于双注意力机制的多分支孪生网络目标跟踪

为了解决SiamRPN++单目标跟踪算法在目标被短时遮挡及外观剧烈变化时定位不准确的问题，提出基于双注意力机制的多分支孪生网络目标跟踪算法. 采用具有轻量化主干网络的SiamRPN++为基础算法，结合轻量化的通道和空间注意力机制，提升跟踪过程中应对遮挡挑战时的抗干扰能力. 新增上一帧模板分支，动态更新目标外观变化，利用三元组损失增强跟踪过程中前景与背景的判别能力. 根据目标的移动速度进行局部扩大搜索，使目标被短时遮挡后仍可以及时、准确地跟踪到目标. 实验结果表明，改进后的算法在OTB100数据集的成功率和精确度较原算法分别提高了2.4%和1.6%，平均中心位置误差降低了28.97个像素，平均重叠率提高了14.5%.

关键词： 孪生网络, 注意力机制, 模板更新, 局部扩大

Fig.1 Overall structure diagram of proposed algorithm model

Fig.2 RPN structure diagram combined with dual attention

Fig.3 Structure diagram of channel attention model

Fig.4 Structure diagram of global context attention module

Fig.5 Schematic diagram of template update

Tab.1 Tracking video sequence and attributes

Fig.6 Results of qualitative analysis

Fig.7 Curve chart of center position error results

Fig.8 Overlap rate result curve

Tab.2 Average center position error and overlap rate for 10 tracking video sequences

Fig.9 Curve of success rate and precision results under OTB100 data set

Fig.10 Overall success rate result curve under OTB100 dataset including occlusion and deformation attributes

Fig.11 Overall precision result curve under OTB100 dataset including occlusion and deformation attributes


[1]	BERTINETTO L, VALMADRE J, HENRIQUE J F, et al. Fully-convolutional siamese networks for object tracking [C]// European Conference on Computer Vision. Berlin: Springer, 2016: 850-865.

[2]	VALMADRE J, BERTINETTO L, HENRIQUES J, et al. End-to-end representation learning for correlation filter based tracking [C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Hawaii: IEEE, 2017: 2805-2813.

[3]	HE A, LUO C, TIAN X, et al. A twofold Siamese network for real-time object tracking [C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Salt Lake City: IEEE, 2018: 4834-4843.

[4]	LI B, WU W, WANG Q, et al. SiamRPN++: evolution of siamese visual tracking with very deep networks [C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Long Beach: IEEE, 2019: 4282-4291.

[5]	HE K, ZHANG X, REN S, et al. Deep residual learning for image recognition [C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas: IEEE, 2016: 770-778.

[6]	WANG Q, ZHANG L, BERTINETTO L, et al. Fast online object tracking and segmentation: a unifying approach [C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Long Beach: IEEE, 2019: 1328-1338.

[7]	XU Y, WANG Z, LI Z, et al. SiamFC++: towards robust and accurate visual tracking with target estimation guidelines [C]// Proceedings of the AAAI Conference on Artificial Intelligence. New York: AAAI, 2020: 12549-12556.

[8]	VASWANI A, SHAZEER N, PARMAR N, et al. Attention is all you need [EB/OL]. [2022-06-15].https://xueshu.baidu.com/usercenter/paper/show?paperid=93f237b1172b174c55f3bdfd91d2f2d2&site=xueshu_se&apm;hitarticle=1, 2022.6.10.

[9]	WANG Q L, WU B G, ZHU P F, et al. ECA-Net: efficient channel attention for deep convolutional neural networks [C]// Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. Seattle: IEEE, 2020: 11531-11539.

[10]	HU J, SHEN L, SUN G. Squeeze-and-excitation networks [C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Salt Lake City: IEEE, 2018: 7132-7141.

[11]	CAO Y, XU J, LIN S, et al. GCNet: non-local networks meet squeeze-excitation networks and beyond [C]// Proceedings of the IEEE International Conference on Computer Vision Workshops. Seoul: IEEE, 2019.

[1]	Jun HAN,Xiao-ping YUAN,Zhun WANG,Ye CHEN. UAV dense small target detection algorithm based on YOLOv5s[J]. Journal of ZheJiang University (Engineering Science), 2023, 57(6): 1224-1233.

[2]	Xue-yong XIANG,Li WANG,Wen-peng ZONG,Guang-yun LI. Point cloud instance segmentation based on attention mechanism KNN and ASIS module[J]. Journal of ZheJiang University (Engineering Science), 2023, 57(5): 875-882.

[3]	Yu-ting SU,Rong-xuan LU,Wei ZHANG. Vehicle re-identification algorithm based on attention mechanism and adaptive weight[J]. Journal of ZheJiang University (Engineering Science), 2023, 57(4): 712-718.

[4]	Bai-cheng BIAN,Tian CHEN,Ru-jun WU,Jun LIU. Improved YOLOv3-based defect detection algorithm for printed circuit board[J]. Journal of ZheJiang University (Engineering Science), 2023, 57(4): 735-743.

[5]	Yan-fen CHENG,Jia-jun WU,Fan HE. Aspect level sentiment analysis based on relation gated graph convolutional network[J]. Journal of ZheJiang University (Engineering Science), 2023, 57(3): 437-445.

[6]	Fan YANG,Bo NING,Huai-qing LI,Xin ZHOU,Guan-yu LI. Multimodal image retrieval model based on semantic-enhanced feature fusion[J]. Journal of ZheJiang University (Engineering Science), 2023, 57(2): 252-258.

[7]	Chao LIU,Bing KONG,Guo-wang DU,Li-hua ZHOU,Hong-mei CHEN,Chong-ming BAO. Deep clustering via high-order mutual information maximization and pseudo-label guidance[J]. Journal of ZheJiang University (Engineering Science), 2023, 57(2): 299-309.

[8]	Lin-tao WANG,Qi MAO. Position measurement method for tunnel segment grabbing based on RGB and depth information fusion[J]. Journal of ZheJiang University (Engineering Science), 2023, 57(1): 47-54.

[9]	Li-zhou FENG,Yang YANG,You-wei WANG,Gui-jun YANG. New method for news recommendation based on Transformer and knowledge graph[J]. Journal of ZheJiang University (Engineering Science), 2023, 57(1): 133-143.

[10]	Kun HAO,Kuo WANG,Bei-bei WANG. Lightweight underwater biological detection algorithm based on improved Mobilenet-YOLOv3[J]. Journal of ZheJiang University (Engineering Science), 2022, 56(8): 1622-1632.

[11]	Ren-peng MO,Xiao-sheng SI,Tian-mei LI,Xu ZHU. Bearing life prediction based on multi-scale features and attention mechanism[J]. Journal of ZheJiang University (Engineering Science), 2022, 56(7): 1447-1456.

[12]	You-wei WANG,Shuang TONG,Li-zhou FENG,Jian-ming ZHU,Yang LI,Fu CHEN. New inductive microblog rumor detection method based on graph convolutional network[J]. Journal of ZheJiang University (Engineering Science), 2022, 56(5): 956-966.

[13]	Xiao-chen JU,Xin-xin ZHAO,Sheng-sheng QIAN. Self-attention mechanism based bridge bolt detection algorithm[J]. Journal of ZheJiang University (Engineering Science), 2022, 56(5): 901-908.

[14]	Xue-qin ZHANG,Tian-ren LI. Breast cancer pathological image classification based on Cycle-GAN and improved DPN network[J]. Journal of ZheJiang University (Engineering Science), 2022, 56(4): 727-735.

[15]	Meng XU,Dan WANG,Zhi-yuan LI,Yuan-fang CHEN. IncepA-EEGNet: P300 signal detection method based on fusion of Inception network and attention mechanism[J]. Journal of ZheJiang University (Engineering Science), 2022, 56(4): 745-753, 782.

Viewed

Full text

Abstract

Cited

Shared

Discussed