用于图像分类的卷积神经网络中激活函数的设计

doi:10.3785/j.issn.1008-973X.2019.07.016

浙江大学学报(工学版)

2019, Vol. 53

Issue (7): 1363-1373 DOI: 10.3785/j.issn.1008-973X.2019.07.016

自动化技术、计算机技术

用于图像分类的卷积神经网络中激活函数的设计

王红霞(

),周家奇,辜承昊,林泓*(

)

武汉理工大学计算机科学与技术学院，湖北武汉 430063

Design of activation function in CNN for image classification

Hong-xia WANG(

),Jia-qi ZHOU,Cheng-hao GU,Hong LIN*(

)

School of Computer Science and Technology, Wuhan University of Technology, Wuhan 430063, China

全文: PDF(1107 KB) HTML

摘要：

为了提高图像分类效果，针对卷积神经网络中常用激活函数relu在x负半轴的导数恒为零，导致训练过程中容易造成神经元“坏死”以及现有组合激活函数relu-softplus在模型收敛情况下学习率过小导致收敛速度慢的问题，提出新的组合激活函数relu-softsign. 分析激活函数在训练过程中的作用，给出激活函数在设计时需要考虑的要点；根据这些要点，将relu和softsign函数于x轴正、负半轴进行分段组合，使其x负半轴导数不再恒为零；分别在MNIST、PI100、CIFAR-100和Caltech256数据集上，与单一的激活函数和relu-softplus组合激活函数进行对比实验. 实验结果表明，使用relu-softsign组合激活函数提高了模型分类准确率，简单有效地缓解了神经元不可逆“坏死”现象；加快了模型的收敛速度，在复杂数据集上该组合函数的收敛性能更好.

关键词： 图像分类; 卷积神经网络; 激活函数; relu; 神经元坏死; 组合激活函数

Abstract:

A new combinatorial activation function called relu-softsign was proposed aiming at the problem that the derivative of the commonly used activation function relu in the convolutional neural network is constant to zero at the x negative axis, which makes it easy to cause neuron necrosis during training, and the existing combinatorial activation function relu-softplus can only use the small learning rate in the case of model convergence, which leads to slow convergence. The image classification effect was improved. The role of the activation function during training was analyzed, and the key points that need to be considered in the design of the activation function were given. The relu and softsign functions were combined piecewise in the positive and negative semi axis of the x axis according to these points, so that the derivative of x negative semi axis was no longer constant to zero. Then comparision with the single activation function and relu-softplus combination activation function was conducted on the MNIST, PI100, CIFAR-100 and Caltech256 datasets. The experimental results show that the combinatorial activation function relu-softsign improves the model classification accuracy, simply and effectively mitigates the irreversible " necrosis” phenomenon of neurons. The convergence speed of the model is accelerated, especially on complex data sets.

Key words: image classification convolutional neural network activation function relu neurons necrosis combinatorial activation function

收稿日期: 2018-10-31 出版日期: 2019-06-25

CLC:

TP 391

通讯作者: 林泓 E-mail: 99575522@qq.com;linhong@whut.edu.cn

作者简介: 王红霞（1977—），女，副教授，博士，从事模式识别、图像分类研究. orcid.org/0000-0003-1213-7760. E-mail： 99575522@qq.com

	服务
	把本文推荐给朋友
	加入引用管理器
	E-mail Alert
	作者相关文章
	王红霞
	周家奇
	辜承昊
	林泓

引用本文:

王红霞,周家奇,辜承昊,林泓. 用于图像分类的卷积神经网络中激活函数的设计[J]. 浙江大学学报(工学版), 2019, 53(7): 1363-1373.

Hong-xia WANG,Jia-qi ZHOU,Cheng-hao GU,Hong LIN. Design of activation function in CNN for image classification. Journal of ZheJiang University (Engineering Science), 2019, 53(7): 1363-1373.

链接本文:

http://www.zjujournals.com/eng/CN/10.3785/j.issn.1008-973X.2019.07.016 或 http://www.zjujournals.com/eng/CN/Y2019/V53/I7/1363

图 1 sigmoid函数与tanh函数曲线图

图 2 softsign函数与tanh函数曲线图

图 3 relu函数曲线图

图 4 relu-softplus组合激活函数曲线图

图 5 softplus函数和softsign函数的曲线图

图 6 softplus和softsign的导数曲线对比图

图 7 relu-softsign函数曲线图

表 1 MNIST上的实验结果

图 8 不同激活函数在MNIST上的losst

表 2 PI100上的实验结果

图 9 不同激活函数在PI100上的losst

表 3 CIFAR-100上的实验结果

图 10 不同激活函数在CIFAR-100上的losst

表 4 Caltech256上的实验结果

图 11 不同激活函数在Caltech256上的losst

表 5 MNIST和PI100上不同学习率下的收敛情况对比

表 6 CIFAR-100和Caltech256上不同学习率下的收敛情况对比

图 12 Caltech256上relu-softsign和relu-softplus的losst

1	黄凯奇, 任伟强, 谭铁牛图像物体分类与检测算法综述[J]. 计算机学报, 2014, 36 (6): 1225- 1240 HUANG Kai-qi, REN Wei-qiang, TAN Tie-niu A review on image object classification and detection[J]. Chinese Journal of Computers, 2014, 36 (6): 1225- 1240
2	常亮, 邓小明, 周明全, 等图像理解中的卷积神经网络[J]. 自动化学报, 2016, 42 (9): 1300- 1312 CHANG Liang, DENG Xiao-ming, ZHOU Ming-quan, et al Convolution neural network in image understanding[J]. Acta Automatica Sinica, 2016, 42 (9): 1300- 1312
3	吴正文. 卷积神经网络在图像分类中的应用研究[D]. 成都: 电子科技大学, 2015. WU Zheng-wen. Application of convolution neural network in image classification [D]. Chengdu: University of Electronic Science and Technology of China, 2015.
4	KRIZHEVSKY A, SUTSKEVER I, HINTON G E. ImageNet classification with deep convolutional neural networks [C] // International Conference on Neural Information Processing Systems. Lake Tahoe: Springer, 2012: 1097-1105.
5	NAIR V, HINTON G E. Rectified linear units improve restricted Boltzmann machines [C] // Proceedings of the 27th International Conference on Machine Learning (ICML-10). Haifa: Omnipress, 2010: 807-814.
6	DOLEZEL P, SKRABANEK P, GAGO L Weight initialization possibilities for feedforward neural network with linear saturated activation functions[J]. IFAC-PapersOnLine, 2016, 49 (25): 49- 54 doi: 10.1016/j.ifacol.2016.12.009
7	MAAS A L, HANNUN A Y, NG A Y. Rectifier nonlinearities improve neural network acoustic models [C] // Proceedings of the 30th International Conference on Machine Learning. Atlanta: ACM, 2013: 456-462.
8	CLEVERT D A, UNTERTHINER T, HOCHREITER S Fast and accurate deep network learning by exponential linear units (ELUs)[J]. Computer Science, 2015, 5 (2): 716- 730
9	HE K, ZHANG X, REN S, et al. Delving deep into rectifiers: surpassing human-level performance on ImageNet classification [C] // Proceedings of the IEEE international conference on computer vision. Santiago: IEEE, 2015: 1026-1034.
10	石琪. 基于卷积神经网络图像分类优化算法的研究与验证[D]. 北京: 北京交通大学, 2017. SHI Qi. Research and verification of image classification optimization algorithm based on convolutional neural network [D]. Beijing: Beijing Jiaotong University, 2017.
11	LECUN Y, BOTTOU L, BENGIO Y, et al Gradient-based learning applied to document recognition[J]. Proceedings of the IEEE, 1998, 86 (11): 2278- 2324 doi: 10.1109/5.726791
12	Microsoft Research: product image categorization data set (PI 100)[DB/OL]. [2010-11-01]. http://research.microsoft.com/en-us/people/xingx/pi100.aspx.
13	FERRARI V, JURIE F, SCHMID C From images to shape models for object detection[J]. International Journal of Computer Vision, 2010, 87 (3): 284- 303 doi: 10.1007/s11263-009-0270-9
14	GRIFFIN G, HOULUB A, PERONA P. The Caltech-256. Technical report [R]. Pasadena: California Institute of Technology, 2007.
15	李明威. 图像分类中的卷积神经网络方法研究 [D]. 南京: 南京邮电大学, 2016. LI Ming-wei. Research of convolutional neural network in image classification [D]. Nanjing: Nanjing University of Posts and Telecommunications, 2016.
16	DUDA R O, HART P E, STORK D G. Pattern classification [M]. [S.l.]: Wiley, 2004.
17	贾世杰. 基于内容的商品图像分类方法研究[D]. 大连: 大连理工大学, 2013. JIA Shi-jie. Research on content based classification of commodity image [D]. Dalian: Dalian University of Technology, 2013.

[1]	郑英杰,吴松荣,韦若禹,涂振威,廖进,刘东. 基于目标图像FCM算法的地铁定位点匹配及误报排除方法[J]. 浙江大学学报(工学版), 2021, 55(3): 586-593.
[2]	陈巧红,陈翊,李文书,贾宇波. 多尺度SE-Xception服装图像分类[J]. 浙江大学学报(工学版), 2020, 54(9): 1727-1735.
[3]	沈麒,赵琰,周晓炜,袁晓冉. 结合结构与梯度的图像哈希算法[J]. 浙江大学学报(工学版), 2020, 54(8): 1525-1533.
[4]	周登文,田金月,马路遥,孙秀秀. 基于多级特征并联的轻量级图像语义分割[J]. 浙江大学学报(工学版), 2020, 54(8): 1516-1524.
[5]	张彦楠,黄小红,马严,丛群. 基于深度学习的录音文本分类方法[J]. 浙江大学学报(工学版), 2020, 54(7): 1264-1271.
[6]	康庄,杨杰,郭濠奇. 基于机器视觉的垃圾自动分类系统设计[J]. 浙江大学学报(工学版), 2020, 54(7): 1272-1280.
[7]	明涛,王丹,郭继昌,李锵. 基于多尺度通道重校准的乳腺癌病理图像分类[J]. 浙江大学学报(工学版), 2020, 54(7): 1289-1297.
[8]	闫旭,范晓亮,郑传潘,臧彧,王程,程明,陈龙彪. 基于图卷积神经网络的城市交通态势预测算法[J]. 浙江大学学报(工学版), 2020, 54(6): 1147-1155.
[9]	沈宗礼,余建波. 基于迁移学习与深度森林的晶圆图缺陷识别[J]. 浙江大学学报(工学版), 2020, 54(6): 1228-1239.
[10]	汪海晋,尹宗宇,柯臻铮,郭英杰,董辉跃. 基于一维卷积神经网络的螺旋铣刀具磨损监测[J]. 浙江大学学报(工学版), 2020, 54(5): 931-939.
[11]	杨萍,王丹,康子健,李童,付利华,余悦任. 基于模式识别和集成CNN-LSTM的阵发性房颤预测模型[J]. 浙江大学学报(工学版), 2020, 54(5): 1039-1048.
[12]	金列俊,詹建明,陈俊华,王涛. 基于一维卷积神经网络的钻杆故障诊断[J]. 浙江大学学报(工学版), 2020, 54(3): 467-474.
[13]	付晓峰,牛力,胡卓群,李建军,吴卿. 基于过渡帧概念训练的微表情检测深度网络[J]. 浙江大学学报(工学版), 2020, 54(11): 2128-2137.
[14]	叶刚,李毅波,马逐曦,成杰. 基于ViBe的端到端铝带表面缺陷检测识别方法[J]. 浙江大学学报(工学版), 2020, 54(10): 1906-1914.
[15]	贾子钰,林友芳,张宏钧,王晶. 基于深度卷积神经网络的睡眠分期模型[J]. 浙江大学学报(工学版), 2020, 54(10): 1899-1905.

Viewed

Full text

Abstract

Cited

Shared

Discussed