Please wait a minute...
浙江大学学报(工学版)
计算机技术﹑电信技术     
基于μσ-DWC特征和树结构M-SVM的多维时间序列分类
谭海龙1, 刘康玲1, 金鑫1, 石向荣2, 梁 军1
1.浙江大学 控制科学与工程学系,浙江 杭州 310027;2.浙江财经大学 信息管理系 浙江 杭州 310018
Multivariate time series classification based on μσ-DWC feature and tree-structured M-SVM
TAN Hailong1, LIU Kangling1, JIN Xin1, SHI Xiangrong2, LIANG Jun1
1. Department of Control Science and Engineering, Zhejiang University, Hangzhou 310027, China; 2. Department of Information Management, Zhejiang University of Finance and Economics, Hangzhou 310018, China
 全文: PDF(1116 KB)   HTML
摘要:

为了实现多维时间序列的分类,提出基于统计量-小波系数(μσ-DWC)的序列特征提取方法和新型树结构多分类支持向量机M-SVM模型.分类算法的实现过程如下:利用该特征提取方法将原始多维时间序列映射到特征空间,获得原始序列的压缩表示,即特征向量;得到训练集的特征向量表示之后,训练和构建树结构M-SVM模型;提取未知序列的特征向量并输入已训练完成的树结构M-SVM模型,得到未知序列的类标号,完成分类.实验结果表明:该算法比传统的分类方法具有更高的分类准确率和预测速度,同时可以保证较理想的训练速度.

Abstract:

Aiming at the realization of multivariate time series classification, a feature extraction method based on dimension statistics-wavelet coefficients (μσ-DWC) and a new classification model based on multi-class support vector machine (M-SVM) with tree structure were proposed. The classification algorithm was realized as follows. Firstly, map original multivariate time series to feature space by the proposed feature extraction method. The compressed representation of original time series, namely eigenvector, was obtained. Secondly, tree-structured M-SVM model was trained and constructed after getting the eigenvector representation of training set. Finally, the eigenvector of an unknown time series was extracted and put into the trained tree-structured M-SVM model. Thus, the classification process was completed and the class label of the unknown time series was obtained. The experimental results showed that the proposed algorithm provided higher classification accuracy and faster prediction speed than traditional classification methods, and gave relatively ideal training speed at the same time.

出版日期: 2015-06-01
:  TP 181  
基金资助:

国家自然科学基金资助项目(61174114);教育部博士点基金优先领域资助项目(20120101130016);浙江省公益性技术应用研究计划资助项目(2014C31019);浙江省网络媒体云处理与分析工程技术中心资助项目(2012E10023-7)

通讯作者: 梁军,男,教授     E-mail: jliang@iipc.zju.edu.cn
作者简介: 谭海龙(1988—),男,硕士生,从事机器学习与数据挖掘研究.Email: hhtan@iipc.zju.edu.cn
服务  
把本文推荐给朋友
加入引用管理器
E-mail Alert
RSS
作者相关文章  

引用本文:

谭海龙, 刘康玲, 金鑫, 石向荣, 梁军. 基于μσ-DWC特征和树结构M-SVM的多维时间序列分类[J]. 浙江大学学报(工学版), 10.3785/j.issn.1008973X.2015.06.009.

TAN Hailong, LIU Kangling, JIN Xin, SHI Xiang rong, LIANG Jun. Multivariate time series classification based on μσ-DWC feature and tree-structured M-SVM. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 10.3785/j.issn.1008973X.2015.06.009.

链接本文:

http://www.zjujournals.com/eng/CN/10.3785/j.issn.1008973X.2015.06.009        http://www.zjujournals.com/eng/CN/Y2015/V49/I6/1061

[1] AGRAWAL R, FALOUTSOS C, SWAMI A. Efficient similarity search in sequence databases[M]. Springer Berlin Heidelberg: Springer, 1993:  69-84.
[2] CHAN K P, FU A W C. Efficient time series matching by wavelets [C]∥ Proceedings of the International Conference on Data Engineering, Washington: IEEE, 1999:  126-133.
[3] KORN F, JAGADISH H V, FALOUTSOS C. Efficiently supporting ad hoc queries in large datasets of time sequences[J]. ACM SIGMOD Record, 1997, 26(2): 289-300.
[4] KEOGH E J, SMYTH P. A probabilistic approach to fast pattern matching in time series databases [C]∥Proceedings of 3rd International Conference on Knowledge Discovery and Data Mining, Newport Beach: AAAI, 1997: 24-30.
[5] MISHRA S, BHENDE C N, PANIGRAHI K B. Detection and classification of power quality disturbances using Stransform and probabilistic neural network[J]. IEEE Transactions on Power Delivery, 2008, 23(1): 280-287.
[6] MADZAROV G, GJORGJEVIKJ D, CHORBEV I. A multiclass SVM classifier utilizing binary decision tree[J]. Informatica, 2009:  233-241.
[7] 刁智华,赵春江,郭新宇,等.一种新的基于平衡决策树的SVM多分类算法[J].控制与决策,2011,26(1): 149-152.
DIAO Zhihua, ZHAO Chunjiang, GUO Xinyu, et al. A new SVM multiclass classification algorithm based on
balance decision tree [J].Control and Decision, 2011,26 (1): 149-152.
[8] ERI??塁TI H, UAR A, DEMIR Y. Waveletbased feature extraction and selection for classification of power system disturbances using support vector machines [J].Electric Power Systems Research, 2010, 80(7): 743-752.
[9] CHAN K P, FU A W C. Efficient time series matching by wavelets [C]∥IEEE International Conference on Data Engineering, \[S.l.\]: IEEE Computer Society, 1999:  126-133.
[10] 李航.统计学习方法[M].北京:清华大学出版社. 2012:  96-124.
[11] CHANG C, LIN C. LIBSVM: A library for support vector machines[J]. ACM Transactions on Intelligent Systems and Technology, 2001, 2(3): 389396.
[12] 葛哲学.神经网络理论与Matlab R2007实现[M].北京:电子工业出版社.2007:  126-127.
[13] MOHAMMED W K. Highquality recordings of Australian sign language signs[EB/OL]. [20020226]. http:∥archive.ics.uci.edu/ml/machinelearningdatabases/auslan2mld/auslan.html.
[14] MOULDI B. Spoken Arabic digit[EB/OL].[20100913]. http:∥archive.ics.uci.edu/ml/machinelearningdatabases/00195/.
[1] 朱东阳, 沈静逸, 黄炜平, 梁军. 基于主动学习和加权支持向量机的工业故障识别[J]. 浙江大学学报(工学版), 2017, 51(4): 697-705.
[2] 丰小月, 梁艳春, 林希珣, 管仁初. 永恒语言学习研究与发展[J]. 浙江大学学报(工学版), 2017, 51(1): 82-88.
[3] 裘日辉, 刘康玲, 谭海龙, 梁军. 基于极限学习机的分类算法及在故障识别中的应用[J]. 浙江大学学报(工学版), 2016, 50(10): 1965-1972.
[4] 居斌, 钱沄涛, 叶敏超. 基于结构投影非负矩阵分解的协同过滤算法[J]. 浙江大学学报(工学版), 2015, 49(7): 1319-1325.
[5] 林亦宁, 韦巍, 戴渊明. 半监督Hough Forest跟踪算法[J]. J4, 2013, 47(6): 977-983.
[6] 李侃,黄文雄,黄忠华. 基于支持向量机的多传感器探测目标分类方法[J]. J4, 2013, 47(1): 15-22.
[7] 姚伏天, 钱沄涛, 李吉明. 空间约束半监督高斯过程下的高光谱图像分类[J]. J4, 2012, 46(7): 1295-1300.
[8] 王洪波, 赵光宙, 齐冬莲, 卢达. 一类支持向量机的快速增量学习方法[J]. J4, 2012, 46(7): 1327-1332.
[9] 戴兴虎,钱沄涛,唐凤仙,居斌. 基于图表标题信息的在线生物文献MRI图像检测[J]. J4, 2012, 46(7): 1307-1313.
[10] 艾解清, 高济, 彭艳斌, 郑志军. 基于直推式支持向量机的协商决策模型[J]. J4, 2012, 46(6): 967-973.
[11] 潘俊, 孔繁胜, 王瑞琴. 局部敏感判别直推学习机[J]. J4, 2012, 46(6): 987-994.
[12] 金卓军, 钱徽, 朱淼良. 基于倾向性分析的轨迹评测技术[J]. J4, 2011, 45(10): 1732-1737.
[13] 潘俊,孔繁胜,王瑞琴. 加权成对约束投影半监督聚类[J]. J4, 2011, 45(5): 934-940.
[14] 顾弘, 赵光宙. 广义局部图像距离函数下的图像分类与识别[J]. J4, 2011, 45(4): 596-601.
[15] 罗建宏,陈德钊. 兼顾正确率和差异性的自适应集成算法及应用[J]. J4, 2011, 45(3): 557-562.