复杂背景下的小目标检测算法

doi:10.3785/j.issn.1008-973X.2020.09.014

浙江大学学报(工学版)

2020, Vol. 54

Issue (9): 1777-1784 DOI: 10.3785/j.issn.1008-973X.2020.09.014

计算机技术

复杂背景下的小目标检测算法

郑浦1(

),白宏阳1,*(

),李伟2,郭宏伟1

1. 南京理工大学能源与动力工程学院，江苏南京 210094
2. 中国人民解放军96037部队，陕西宝鸡 721000

Small target detection algorithm in complex background

Pu ZHENG1(

),Hong-yang BAI1,*(

),Wei LI2,Hong-wei GUO1

1. School of Energy and Power Engineering, Nanjing University of Science and Technology, Nanjing 210094, China
2. 96037 PLA Troops, Baoji 721000, China

全文: PDF(1443 KB) HTML

摘要：

提出一种改进的多类别单阶检测器（SSD）算法. 借鉴特征金字塔算法的思想，将Conv4-3层的特征与Conv7、Conv3-3层的特征进行融合，同时增加融合后特征图每个位置对应的默认框数量. 在网络结构中增加裁剪-权重分配网络（SENet），对每层的特征通道进行权重分配，提升有用的特征权重并抑制无效的特征权重. 为了增强网络的泛化能力，对训练数据集进行一系列增强处理. 实验结果表明，改进后的算法在VOC数据集（07+12）上的检测效果良好，平均精度均值为80.4%，比改进前的算法提高了2.7%；在COCO数据集（2017）上的平均精度均值为42.5%，比改进前的算法提高了2.3%. 所提算法能够准确检测出不小于16×16像素的目标.

关键词： 深度学习; 目标检测; 多类别单阶检测器（SSD）算法; 特征融合; 特征增强

Abstract:

An improved single-shot-multibox-detector (SSD) algorithm was proposed. Referring to the feature pyramid networks (FPN) algorithm, the features of the Conv4-3 layer were merged with the features of Conv7 and Conv3-3 layers, and the number of default boxes at each location in merged feature map was increased. The squeeze-and-excitation networks (SENet) was added to the network structure; the feature channels of each layer were weighted, in order to enhance the useful feature weights and suppress the invalid feature weights. A series of enhancements were performed on the training data to enhance the generalization performance of the network. The experimental results show that the improved algorithm has a better performance on the VOC (07+12) dataset; the mean average precision (mAP) value of the improved algorithm is 80.4%, which is 2.7% higher than that of the original algorithm; the mAP value of the improved algorithm on COCO dataset (2017) is 42.5%, which is 2.3% higher than that of the original algorithm. Thus, the proposed algorithm can accurately detect the target with a size of at least 16×16 pixels.

Key words: deep learning target detection single-shot-multibox-detector (SSD) algorithm feature fusion feature enhancement

收稿日期: 2019-08-28 出版日期: 2020-09-22

CLC:

TP 391

基金资助: 国家自然科学基金资助项目（61603189）

通讯作者: 白宏阳 E-mail: 117108022106@njust.edu.cn;hongyang@njust.edu.cn

作者简介: 郑浦（1995—），男，硕士生，从事图像处理研究. orcid.org/0000-0002-3128-5219. E-mail： 117108022106@njust.edu.cn

	服务
	把本文推荐给朋友
	加入引用管理器
	E-mail Alert
	作者相关文章
	郑浦
	白宏阳
	李伟
	郭宏伟

引用本文:

郑浦,白宏阳,李伟,郭宏伟. 复杂背景下的小目标检测算法[J]. 浙江大学学报(工学版), 2020, 54(9): 1777-1784.

Pu ZHENG,Hong-yang BAI,Wei LI,Hong-wei GUO. Small target detection algorithm in complex background. Journal of ZheJiang University (Engineering Science), 2020, 54(9): 1777-1784.

链接本文:

http://www.zjujournals.com/eng/CN/10.3785/j.issn.1008-973X.2020.09.014 或 http://www.zjujournals.com/eng/CN/Y2020/V54/I9/1777

图 1 多类别单阶检测器（SSD）网络模型示意图

图 2 SSD网络不同层的特征图输出比较

图 3 SSD网络不同通道的特征热力图

图 4 特征融合示意图

图 5 SE网络结构示意图

图 6 改进后的SSD网络结构

图 7 3种算法在不同数据集、不同类别上（瓶子、人、汽车、鸟）的准确率-召回率曲线对比

图 8 F_SE_SSD与SSD算法在VOC数据集上的检测结果对比

图 9 改进后算法（F_SE_SSD）在复杂背景下的检测效果

图 10 改进后算法（F_SE_SSD）检测小目标的效果

1	YILMAZ A, JAVED O, SHAH M Object tracking: a survey[J]. ACM Computing Surveys, 2006, 38 (4): 1- 29
2	李旭冬, 叶茂, 李涛基于卷积神经网络的目标检测研究综述[J]. 计算机应用研究, 2017, 34 (10): 2881- 2886 LI Xu-dong, YE Mao, LI Tao Review of object detection based on convolutional neural networks[J]. Application Research of Computers, 2017, 34 (10): 2881- 2886 doi: 10.3969/j.issn.1001-3695.2017.10.001
3	周晓彦, 王珂, 李凌燕基于深度学习的目标检测算法综述[J]. 电子测量技术, 2017, 40 (11): 89- 93 ZHOU Xiao-yan, WANG Ke, LI Ling-yan Review of object detection based on deep learning[J]. Electronic Measurement Technology, 2017, 40 (11): 89- 93 doi: 10.3969/j.issn.1002-7300.2017.11.020
4	GIRSHICK R, DONAHUE J, DARRELL T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation [C] // IEEE Conference on Computer Vision and Pattern Recognition. Columbus: IEEE, 2014: 580-587.
5	GIRSHICK R. Fast R-CNN [C] // IEEE Conference on Computer Vision and Pattern Recognition. Santiago: IEEE, 2015: 1440-1448.
6	REN S Q, HE K M, GIRSHICK R, et al Faster R-CNN: towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 39 (6): 1137- 1149
7	REDMON J, DIVVALA S, GIRSHICK R, et al. You only look once: unified, real-time object detection [C] // IEEE Conference on Computer Vision and Pattern Recognition. Washington: IEEE, 2016: 779-788.
8	LIU W, ANGUELOV D, ERHAN D, et al. SSD: single shot multibox detector [C] // Proceedings of European Conference on Computer Vision. Amsterdam: ECCV, 2016: 21-37.
9	张焕龙, 胡士强, 杨国胜基于外观模型学习的视频目标跟踪方法综述[J]. 计算机研究与发展, 2015, 52 (1): 177- 190 ZHANG Huan-long, HU Shi-qiang, YANG Guo-sheng Video object tracking based on appearance models learning[J]. Journal of Computer Research and Development, 2015, 52 (1): 177- 190 doi: 10.7544/issn1000-1239.2015.20130995
10	尹宏鹏, 陈波, 柴毅, 等基于视觉的目标检测与跟踪综述[J]. 自动化学报, 2016, 42 (10): 1466- 1489 YIN Hong-peng, CHEN Bo, CHAI Yi, et al Vision-based object detection and tracking: a review[J]. Acta Automatica Sinica, 2016, 42 (10): 1466- 1489
11	葛宝义, 左宪章, 胡永江视觉目标跟踪方法研究综述[J]. 中国图象图形学报, 2018, 23 (08): 1091- 1107 GE Bao-yi, ZUO Xian-zhang, HU Yong-jiang Review of visual object tracking technology[J]. Journal of Image and Graphics, 2018, 23 (08): 1091- 1107
12	方路平, 何杭江, 周国民目标检测算法研究综述[J]. 计算机工程与应用, 2018, 54 (13): 11- 18 FANG Lu-ping, HE Hang-jiang, ZHOU Guo-min Research overview of object detection methods[J]. Computer Engineering and Applications, 2018, 54 (13): 11- 18 doi: 10.3778/j.issn.1002-8331.1804-0167
13	朱明明, 许悦雷, 马时平, 等基于特征融合与软判决的遥感图像飞机检测[J]. 光学学报, 2019, 39 (2): 71- 77 ZHU Ming-ming, XU Yue-lei, et al Airplane detection based on feature fusion and soft decision in remote sensing images[J]. Acta Optica Sinica, 2019, 39 (2): 71- 77
14	辛鹏, 许悦雷, 唐红, 等全卷积网络多层特征融合的飞机快速检测[J]. 光学学报, 2018, 38 (3): 344- 350 XIN Peng, XU Yue-lei, TANG Hong, et al Fast airplane detection based on multi-layer feature fusion of fully convolutional networks[J]. Acta Optica Sinica, 2018, 38 (3): 344- 350
15	朱敏超, 冯涛, 张钰基于FD-SSD的遥感图像多目标检测方法[J]. 计算机应用与软件, 2019, 36 (1): 232- 238 ZHU Min-chao, FENG Tao, ZHANG Yu Remote sensing image multi-target detection method based on FD-SSD[J]. Computer Applications and Software, 2019, 36 (1): 232- 238 doi: 10.3969/j.issn.1000-386x.2019.01.042
16	LIN T Y, DOLLAR P, GIRSHICK R, et al. Feature pyramid networks for object detection [C] // IEEE Conference on Computer Vision and Pattern Recognition. Hawaii: IEEE, 2017: 936-944.
17	陈幻杰, 王琦琦, 杨国威, 等多尺度卷积特征融合的SSD目标检测算法[J]. 计算机科学与探索, 2019, 13 (6): 1049- 1061 CHEN Huan-jie, WANG Qi-qi, YANG Guo-wei, et al SSD object detection algorithm with multi-scale convolution feature fusion[J]. Journal of Frontiers of Computer Science and Technology, 2019, 13 (6): 1049- 1061
18	ZEILER M D, FERGUS R. Visualizing and understanding convolutional networks [C] // European Conference on Computer Vision. Zurich: ECCV, 2014: 818-833.
19	王俊强, 李建胜, 周学文, 等改进的SSD算法及其对遥感影像小目标检测性能的分析[J]. 光学学报, 2019, 39 (6): 373- 382 WANG Jun-qiang, LI Jians-heng, ZHOU Xue-wen, et al Improved SSD algorithm and its performance analysis of small target detection in remote sensing images[J]. Acta Optica Sinica, 2019, 39 (6): 373- 382
20	张焯林, 赵建伟, 曹飞龙构建带空洞卷积的深度神经网络重建高分辨率图像[J]. 模式识别与人工智能, 2019, 32 (3): 259- 267 ZHANG Zhuo-lin, ZHAO Jian-wei, CAO Fei-long Building deep neural networks with dilated convolutions to reconstruct high-resolution image[J]. Pattern Recognition and Artificial Intelligence, 2019, 32 (3): 259- 267
21	LONG J, SHELHAMER E, DARRELL T Fully convolutional networks for semantic segmentation[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2014, 39 (4): 640- 651
22	HU J, SHEN L, SAMUEL A, et al. Squeeze-and-excitation networks [J]. arXiv Preprint arXiv: 1709.01507, 2017.

[1]	许佳辉,王敬昌,陈岭,吴勇. 基于图神经网络的地表水水质预测模型[J]. 浙江大学学报(工学版), 2021, 55(4): 601-607.
[2]	王虹力,郭斌,刘思聪,刘佳琪,仵允港,於志文. 边端融合的终端情境自适应深度感知模型[J]. 浙江大学学报(工学版), 2021, 55(4): 626-638.
[3]	张腾,蒋鑫龙,陈益强,陈前,米涛免,陈彪. 基于腕部姿态的帕金森病用药后开-关期检测[J]. 浙江大学学报(工学版), 2021, 55(4): 639-647.
[4]	徐利锋,黄海帆,丁维龙,范玉雷. 基于改进DenseNet的水果小目标检测[J]. 浙江大学学报(工学版), 2021, 55(2): 377-385.
[5]	许豪灿,李基拓,陆国栋. 由LeNet-5从单张着装图像重建三维人体[J]. 浙江大学学报(工学版), 2021, 55(1): 153-161.
[6]	黄毅鹏,胡冀苏,钱旭升,周志勇,赵文露,马麒,沈钧康,戴亚康. SE-Mask-RCNN：多参数MRI前列腺癌分割方法[J]. 浙江大学学报(工学版), 2021, 55(1): 203-212.
[7]	陈巧红,陈翊,李文书,贾宇波. 多尺度SE-Xception服装图像分类[J]. 浙江大学学报(工学版), 2020, 54(9): 1727-1735.
[8]	周登文,田金月,马路遥,孙秀秀. 基于多级特征并联的轻量级图像语义分割[J]. 浙江大学学报(工学版), 2020, 54(8): 1516-1524.
[9]	明涛,王丹,郭继昌,李锵. 基于多尺度通道重校准的乳腺癌病理图像分类[J]. 浙江大学学报(工学版), 2020, 54(7): 1289-1297.
[10]	张峻宁,苏群星,刘鹏远,王正军,谷宏强. 基于空间约束的自适应单目3D物体检测算法[J]. 浙江大学学报(工学版), 2020, 54(6): 1138-1146.
[11]	闫旭,范晓亮,郑传潘,臧彧,王程,程明,陈龙彪. 基于图卷积神经网络的城市交通态势预测算法[J]. 浙江大学学报(工学版), 2020, 54(6): 1147-1155.
[12]	汪周飞,袁伟娜. 基于深度学习的多载波系统信道估计与检测[J]. 浙江大学学报(工学版), 2020, 54(4): 732-738.
[13]	郑晨斌,张勇,胡杭,吴颖睿,黄广靖. 目标检测强化上下文模型[J]. 浙江大学学报(工学版), 2020, 54(3): 529-539.
[14]	杨冰,莫文博,姚金良. 融合局部特征与深度学习的三维掌纹识别[J]. 浙江大学学报(工学版), 2020, 54(3): 540-545.
[15]	洪炎佳,孟铁豹,黎浩江,刘立志,李立,徐硕瑀,郭圣文. 多模态多维信息融合的鼻咽癌MR图像肿瘤深度分割方法[J]. 浙江大学学报(工学版), 2020, 54(3): 566-573.

Viewed

Full text

Abstract

Cited

Shared

Discussed