Visual object tracking based on policy gradient

doi:10.3785/j.issn.1008-973X.2020.10.008

Journal of ZheJiang University (Engineering Science)

2020, Vol. 54

Issue (10): 1923-1928 DOI: 10.3785/j.issn.1008-973X.2020.10.008

Visual object tracking based on policy gradient

Kang-hao WANG(

),Hai-bing YIN*(

),Xiao-feng HUANG

College of Communication Engineering, Hangzhou Dianzi University, Hangzhou 310018, China

Download:

HTML

PDF(847KB) HTML
Export: BibTeX | EndNote (RIS)

Abstract

An object tracking method based on policy gradient was proposed aiming at the problems of occlusion, deformation and fast motion in the process of object tracking. The policy gradient algorithm was used to train the policy network. The policy network can make action decisions founded on the reliability of current tracking results to avoid the incorrect template update or re-detect the missing targets. During the decision-making process, the robustness and accuracy of the current tracking result were both analyzed by calculating the weighted confidence margin, which helped the policy network to evaluate the tracking results more accurately. During the re-detection process, an efficient re-detection method was proposed to filter a large number of searching areas, which greatly improved the search efficiency. The decision-making module was utilized to examine the re-detected result, which ensured the accuracy of the re-detected results. The proposed algorithm was evaluated on OTB dataset and LaSOT dataset. The experimental results show that the proposed tracking algorithm improves performance by 2.5%-4.0% based on the original algorithm.

Key words： visual object tracking decision making policy gradient re-detection template update

Received: 05 September 2019 Published: 28 October 2020

CLC:

TP 391

Corresponding Authors: Hai-bing YIN E-mail: wangkh@hdu.edu.cn;yhb@hdu.edu.cn

	Service
	E-mail this article
	Add to my bookshelf
	Add to citation manager
	E-mail Alert
	RSS
	Articles by authors
	Kang-hao WANG
	Hai-bing YIN
	Xiao-feng HUANG

Cite this article:

Kang-hao WANG,Hai-bing YIN,Xiao-feng HUANG. Visual object tracking based on policy gradient. Journal of ZheJiang University (Engineering Science), 2020, 54(10): 1923-1928.

URL:

http://www.zjujournals.com/eng/10.3785/j.issn.1008-973X.2020.10.008 OR http://www.zjujournals.com/eng/Y2020/V54/I10/1923

基于策略梯度的目标跟踪方法

针对目标跟踪过程中的遮挡、形变和快速运动等问题，提出基于策略梯度的目标跟踪方法. 该方法利用策略梯度算法训练策略网络. 该策略网络能够根据当前跟踪结果的可靠性进行动作决策，以避免错误的模板更新或者重新检测丢失的目标. 在决策过程中，通过计算加权置信度差值分析当前跟踪结果的鲁棒性和准确性，使得策略网络能够更准确地评估跟踪结果. 在重检测过程中，提出有效的重检测方法，对大量的搜索区域进行过滤，大大提高了搜索效率，利用决策模块检验重检测结果，确保重检测结果的准确性. 利用提出的算法在OTB数据集及LaSOT数据集上进行评估. 实验结果表明，提出的跟踪算法在原算法的基础上提高了2.5%~4.0%的性能.

关键词： 目标跟踪, 决策, 策略梯度, 重检测, 模板更新

Fig.1 Overall framework of proposed algorithm

Fig.2 Markov decision process

Fig.3 OPE result comparison on OTB benchmark dataset

Tab.1 Performance of proposed algorithm with different settings

Fig.4 Tracking results of video sequences


[1]	熊昌镇, 王润玲, 邹建成基于多高斯相关滤波的实时跟踪算法[J]. 浙江大学学报: 工学版, 2019, 53 (8): 1488- 1495 XIONG Chang-zhen, WANG Run-ling, ZOU Jian-cheng Real time tracking algorithm based on multi Gaussian correlation filtering[J]. Journal of Zhejiang University: Engineering Science, 2019, 53 (8): 1488- 1495

[2]	WANG N, ZHOU W, LI H Reliable re-detection for long-term tracking[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2019, 29 (3): 730- 743 doi: 10.1109/TCSVT.2018.2816570

[3]	MA C, YANG X, ZHANG C, et al. Long-term correlation tracking [C]// Proceedings of CVPR. Boston: IEEE, 2015: 5388-5396.

[4]	BOLME D, BEVERIDGE J, DRAPER B, et al. Visual object tracking using adaptive correlation filters [C]// Proceedings of CVPR. San Francisco: IEEE, 2010: 2544-2550.

[5]	WANG M, LIU Y, HUANG Z. Large margin object tracking with circulant feature maps [C]// Proceedings of CVPR. Hawaii: IEEE, 2017: 4021-4029.

[6]	HUANG C, LUCEY S, RAMANAN D. Learning policies for adaptive tracking with deep feature cascades [C]// Proceedings of ICCV. Venice: IEEE, 2017: 105-114.

[7]	CHOI J, KWON J, LEE K Real-time visual tracking by deep reinforced decision making[J]. Computer Vision and Image Understanding, 2018, 171 (2): 10- 19

[8]	SUPANCIC J, RAMANAN D. Tracking as online decision-making: learning a policy from streaming videos with reinforce-ment learning [C]// Proceedings of ICCV. Venice: IEEE, 2017: 322-331.

[9]	BERTINETTO L, VALMADRE J, HENRIQUES J F, et al. Fully-convolutional Siamese networks for object tracking [C]// Proceedings of ECCV. Amsterdam: Springer, 2016: 850–865.

[10]	HAUSKNECHT M, STONE P. Deep recurrent Q-learning for partially observable MDPs [C]// Proceedings of AAAI. Austin: Springer, 2015: 29-37.

[11]	BHAT G, JOHNANDER J, DANELLJAN M, et al. Unveiling the power of deep tracking [C]// Proceedings of ECCV. Munich: Springer, 2018: 483-498.

[12]	江宝安, 卢焕章粒子滤波器及其在目标跟踪中的应用[J]. 雷达科学与技术, 2003, (3): 170- 174 JIANG Bao-an, LU Huan-zhang Particle filter and its application in object tracking[J]. Radar Science and Technology, 2003, (3): 170- 174 doi: 10.3969/j.issn.1672-2337.2003.03.010

[13]	FAN H, LIN L, YANG F, et al. LaSOT: a high-quality benchmark for large-scale single object tracking [C]// Proceedings of CVPR. Long Beach: IEEE, 2019: 5374-5383.

[14]	WU Y, LIM J, YANG M. Online object tracking: a benchmark [C]// Proceedings of CVPR. Portland: IEEE, 2013: 2411-2418.

[15]	HENRIQUES J, CASEIRO R, MARTINS P, et al High speed tracking with kernelized correlation filters[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37 (3): 583- 596 doi: 10.1109/TPAMI.2014.2345390

[16]	DANELLJAN M, HAGER G, KHAN F, et al. Accurate scale estimation for robust visual tracking [C]// Proceedings of British Machine Vision Conference. Nottingham: BMVA, 2014: 1–11.

[17]	LI Y, ZHU J. A scale adaptive kernel correlation filter tracker with feature integration [C]// Proceedings of ECCV. Heidelberg: Springer, 2014: 254–265.

[1]	Peng SONG,De-dong YANG,Chang LI,Chang GUO. An adaptive siamese network tracking algorithm based on global feature channel recognition[J]. Journal of ZheJiang University (Engineering Science), 2021, 55(5): 966-975.

[2]	YU Yang, XIA Chun-he, HU Xiao-yun. Defense scheme generation method using mixed path attack graph[J]. Journal of ZheJiang University (Engineering Science), 2017, 51(9): 1745-1759.

[3]	WANG Ren chao, DENG Jian, CAO Ting ting. Selection of construction project schedule alternatives based on improved VIKOR method[J]. Journal of ZheJiang University (Engineering Science), 2016, 50(10): 2025-2030.

[4]	PEI Zhi, LU Jian-sha, ZHENG Li. Yard allocation decision making in container terminals with generalized intuitionistic fuzzy numbers[J]. Journal of ZheJiang University (Engineering Science), 2012, 46(12): 2274-2279.

Viewed

Full text

Abstract

Cited

Shared

Discussed