Please wait a minute...
浙江大学学报(工学版)
自动化技术、控制技术     
基于主动学习和加权支持向量机的工业故障识别
朱东阳, 沈静逸, 黄炜平, 梁军
浙江大学 控制科学与工程学院,浙江 杭州 310027
Fault classification based on modified active learning and weighted SVM
ZHU Dong-yang, SHEN Jing-yi, HUANG Wei-ping, LIANG Jun
College of Control Science and Engineering,Zhejiang University,Hangzhou 310027,China
 全文: PDF(1277 KB)   HTML
摘要:

针对流程工业过程中有标签故障样本少,样本标注代价昂贵,样本集存在类不平衡以及样本孤点问题,研究基于最优次优标号(BvSB)和加权支持向量机(WSVM)的工业故障分类方法.通过综合考虑样本的信息度和代表性以及样本中可能存在的孤立点,提出改进的主动学习算法,用于挖掘那些对当前分类器模型最有价值的样本进行标注.在支持向量机训练学习中,对不同样本采用不同的权重系数,不同类别赋予不同的惩罚因子,减少了样本分布不平衡时对主动学习和分类精度的影响,充分考虑样本点在特征空间的分布情况,提出新的惩罚系数选取方法.以TE过程为例,实验结果证明,提出的方法能够在获得较高故障分类准确率的情况下减少标注负担.

Abstract:

A new method based on modified Best versus Second-Best (BvSB) active learning and weighted support vector machine for fault classification in real-world industrial process was presented in order to solve the problems that large-scale labeled fault samples are not easy to acquire, labeling cost is expensive, datasets are usually imbalanced and contaminated with outliers. An improved BvSB selection method was proposed to iteratively select the most valuable data and query their labels by comprehensively measuring the informativeness and representativeness of unlabeled instances and reducing the impact of outliers. WeightedSVM was introduced to tackle the impact of imbalanced class distribution on active learning and classification accuracy, using different weight factors for classes and individual samples. A new efficient method of determining the penalty coefficient was presented. Case study on TE process verifies that the proposed approach can achieve superior classification accuracy while reducing the labeling cost.

出版日期: 2017-04-25
CLC:  TP 181  
基金资助:

国家自然科学基金资助项目(61174114,U1509203);教育部高校博士点专项科研基金资助项目(20120101130016);浙江省公益性技术应用研究计划资助项目(2014C31019).

通讯作者: 梁军,男,教授,博导. ORCID: 0000-0003-1115-0824.     E-mail: jliang@iipc.zju.edu.cn
作者简介: 朱东阳(1992—),女,硕士生,从事模式识别、故障诊断等研究. ORCID: 0000-0002-1900-3477. E-mail: dyzhu@iipc.zju.edu.cn
服务  
把本文推荐给朋友
加入引用管理器
E-mail Alert
作者相关文章  

引用本文:

朱东阳, 沈静逸, 黄炜平, 梁军. 基于主动学习和加权支持向量机的工业故障识别[J]. 浙江大学学报(工学版), 10.3785/j.issn.1008-973X.2017.04.009.

ZHU Dong-yang, SHEN Jing-yi, HUANG Wei-ping, LIANG Jun. Fault classification based on modified active learning and weighted SVM. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 10.3785/j.issn.1008-973X.2017.04.009.

[1] JAIN P, KAPOOR A. Active learning for large multi-class problems\[C\]∥Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Miami: IEEE, 2009.
[2] DASGUPTA S. Coarse sample complexity bounds for active learning [J]. Neural Information Processing Systems, 2005, 18: 235-242.
[3] 唐明珠,阳春华,桂卫华.基于改进的QBC和CS-SVM的故障检测[J].控制与决策,2012, 27(10): 1489-1493.
TANG Ming-zhu, YANG Chun-hua, GUI Wei-hua. Fault detection based on modified QBC and CS-SVM \[J\]. Control and Decision, 2012, 27(10): 1489-1493.
[4] YAN Z, CHEN J. Enhancing quality of statistic monitoring models by training set eesign with active learning approach [J]. Chemometrics and Intelligent Laboratory Systems, 2015, 151(2016): 201-218.
[5] TONG S, KOLLER D. Support vector machine active learning with applications to text classification [J]. Journal of Machine Learning Research, 2012, 2(1):45-66.
[6] WANG R, KWONG S, CHEN D. Inconsistency-based active learning for support vector machines [J]. Pattern Recognition, 2012, 45(10): 3751-3767.
[7] 徐海龙,别晓峰,冯卉,等.一种基于QBC的SVM主动学习算法[J].系统工程与电子技术,2015, 37(12): 2865-2871.
XU Hai-long, BIE Xiao-feng, FENG Hui, et al. Active learning algorithm for SVM based on QBC \[J\]. System Engineering and Electronics, 2015, 37(12): 2865-2871.
[8] COHN D A, GHAHRAMANI Z, JORDAN M I. Active learning with statistical models [J]. Journal of Artificial Intelligence Research, 1996, 4(1): 705-712.
[9] 陈荣,曹永锋,孙洪.基于主动学习和半监督学习的多类图像分类 [J]. 自动化学报, 2011, 37(8): 954-962.
CHEN Rong, CAC Yong-feng, SUN Hong. Multiclass image classification with active learning and semi-supervised learning \[J\]. Acta Automatica Sinica, 2011,37(8): 954-962.
[10] JOSHI A J, PORIKLI F, PAPANIKOLOPOULOS N. Multi-class active learning for image classification\[C\]∥IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Miami: IEEE, 2009.
[11] PERRY T, BADER-EL-DEN M, COOPER S. Imbalanced classification using genetically optimized cost sensitive classifiers\[C\]∥2015 IEEE Congress on Evolutionary Computation. Sendai: IEEE, 2015.
[12] BURNAEV E, EROFEEV P, PAPANOV A. Influence of resampling on accuracy of imbalanced classification \[C\]∥Proceedings of the 8th International Conference on Machine Vision. Barcelona: \[s.n.\], 2015.
[13] YANG Y, MA Z, NIE F, et al. Multi-class active learning by uncertainty sampling with diversity maximization [J]. International Journal of Computer Vision, 2015, 113(2): 113-127.
[14] 施化吉,周书勇,李星毅,等.基于平均密度的孤立点检测研究[J].电子科技大学学报,2007, 36(6): 1286-1288.
SHI Hua-ji, ZHOU Shu-yong, LI Xing-yi, et al. Average density-based outliers detection \[J\]. Journal of University of Electronic Science and Technology of China, 2007, 36(6): 1286-1288.
[15] 金鑫,李玉鑑.面向非平衡数据处理的样例惩罚支持向量机 [J]. 武汉大学学报:理学版, 2012, 58(2): 139-143.
JIN Xin, LI Yu-jian. Support vector machine with example dependent costs for dealing with imbalanced data \[J\]. Journal of Wuhan University: Natural Science Edition, 2012, 58(2): 139-143.
[16] GE Z, SONG Z, GAO F. Review of recent research on data-based process monitoring [J]. Industrial and Engineering Chemistry Research, 2013, 52(10):3543-3562.
[17] LIU Y, LIAN J, BARTOLACCI M R, et al. Density-based penalty parameter optimization on C-SVM [J]. The Scientific World Journal, 2014(2014):851814.
[18] 金鑫,李玉鑑.不平衡支持向量机的惩罚因子选择方法[J].计算机工程与应用,2011,47(33): 129-133.
JIN Xin, LI Yu-jian. Error-cost selection for biased support vector machines \[J\]. Computer Engineering and Applications, 2011, 47(33): 129-133.
[19] TAX D M, DUIN R P. Support vector data description [J]. Machine Learning, 2004, 54(1): 45-66.
[20] DOWNS J J, VOGEL E F. A plant-wide industrial process control problem [J]. Computers and Chemical Engineering, 1993, 17(3): 245-255.
[21] 石向荣.面向过程监控的非线性特征提取方法研究[D].杭州:浙江大学,2014.
SHI Xiang-rong. Study on nonlinear feature extraction for process monitoring \[D\]. Hangzhou: Zhejiang University, 2014.

[1] 刘晓伟,陈赟,张思,陈康. FDM型增材制造中送丝机构动态监测与识别[J]. 浙江大学学报(工学版), 2021, 55(3): 548-554.
[2] 陈巧红,陈翊,李文书,贾宇波. 多尺度SE-Xception服装图像分类[J]. 浙江大学学报(工学版), 2020, 54(9): 1727-1735.
[3] 李文书,邹涛涛,王洪雁,黄海. 基于双尺度长短期记忆网络的交通事故量预测模型[J]. 浙江大学学报(工学版), 2020, 54(8): 1613-1619.
[4] 胡天磊,王皓波,尹文栋. 基于深度双向分类器链的多标签新闻分类算法[J]. 浙江大学学报(工学版), 2019, 53(11): 2110-2117.
[5] 徐兵,刘潇,汪子扬,刘飞虎,梁军. 采用梯度提升决策树的车辆换道融合决策模型[J]. 浙江大学学报(工学版), 2019, 53(6): 1171-1181.
[6] 王硕朋,杨鹏,孙昊,刘迈. 两级参考点匹配位置指纹声源定位方法[J]. 浙江大学学报(工学版), 2019, 53(6): 1198-1204.
[7] 丰小月, 梁艳春, 林希珣, 管仁初. 永恒语言学习研究与发展[J]. 浙江大学学报(工学版), 2017, 51(1): 82-88.
[8] 裘日辉, 刘康玲, 谭海龙, 梁军. 基于极限学习机的分类算法及在故障识别中的应用[J]. 浙江大学学报(工学版), 2016, 50(10): 1965-1972.
[9] 居斌, 钱沄涛, 叶敏超. 基于结构投影非负矩阵分解的协同过滤算法[J]. 浙江大学学报(工学版), 2015, 49(7): 1319-1325.
[10] 谭海龙, 刘康玲, 金鑫, 石向荣, 梁军. 基于μσ-DWC特征和树结构M-SVM的多维时间序列分类[J]. 浙江大学学报(工学版), 2015, 49(6): 1061-1069.
[11] 林亦宁, 韦巍, 戴渊明. 半监督Hough Forest跟踪算法[J]. J4, 2013, 47(6): 977-983.
[12] 李侃,黄文雄,黄忠华. 基于支持向量机的多传感器探测目标分类方法[J]. J4, 2013, 47(1): 15-22.
[13] 姚伏天, 钱沄涛, 李吉明. 空间约束半监督高斯过程下的高光谱图像分类[J]. J4, 2012, 46(7): 1295-1300.
[14] 戴兴虎,钱沄涛,唐凤仙,居斌. 基于图表标题信息的在线生物文献MRI图像检测[J]. J4, 2012, 46(7): 1307-1313.
[15] 王洪波, 赵光宙, 齐冬莲, 卢达. 一类支持向量机的快速增量学习方法[J]. J4, 2012, 46(7): 1327-1332.