Please wait a minute...
浙江大学学报(理学版)  2021, Vol. 48 Issue (1): 9-17    DOI: 10.3785/j.issn.1008-9497.2021.01.002
图像理解与数据分析     
排序支持的交互数据分类算法及其应用
邓惠俊
万博科技职业学院 智能信息学院,安徽 合肥 230031
Ranking-supported interactive data classification method and its application
DENG Huijun
Institute of Intelligent Information, Wanbo Institute of Science and Technology, Hefei 230031, China
 全文: PDF(3772 KB)   HTML  
摘要: 交互分类是解决数据复杂分类问题的主要手段之一。在现有的大多交互分类系统中,用户能准确识别数据类别,但在有些分类场景中,类别之间的顺序关系更容易被识别,为此,提出一种排序支持的交互数据分类算法。为提升交互分类精度,引入数据的顺序信息,为降低标记难度,提出候选样本推荐策略。另外,提出一种评估分类算法性能的可视化方法,用包含基本车况、交通违法记录、交通事故记录等信息的车辆数据集进行实验验证,将相关车辆分为高危车辆、中危车辆、低危车辆3类,算法的分类结果模型一致度达近98%,验证了方法的有效性。
关键词: 交互分类排序支持向量机(SVM)高维数据    
Abstract: Interactive classification is one of the main means schemes to solve complex classification problems of data.In most existing interactive classification system,users can accurately identify the categories of data.However,in some classification scenarios,there is an order relationship between categories,and it is easier for users to identify the order relationship. This paper presented an interactive data classification method supported by sorting.It introduces the user's cognition of the data order information to improve the accuracy of interactive classification with a candidate sample recommendation strategy to reduce users' difficulty of identifying.To demonstrate the potential of the new approach, we propose a visualization method for classification algorithm performance measurement based on the vehicle data set of accident information including basic vehicle information,illegal driving information and vehicle traffic records.The relevant vehicles are divided into high-risk vehicles,medium-risk vehicles,and low-risk vehicles.The classification consistency reaches nearly 98%,which verifies the effectiveness of the proposed method.
Key words: multi-dimensional data    supported vector machine (SVM)    ranking    interactive classification
收稿日期: 2020-09-23 出版日期: 2021-01-20
CLC:  TP 391.41  
基金资助: 安徽省高等学校自然科学研究重点项目(KJ2020A1171,KJ2019A1039);安徽省高等学校省级质量工程项目(2017JXTD129).
作者简介: 邓惠俊(1978—),ORCID:http://orcid.org/0000-0002-5083-8499,女,硕士,副教授,主要从事数据可视化、可视分析、计算机辅助设计研;
服务  
把本文推荐给朋友
加入引用管理器
E-mail Alert
RSS
作者相关文章  
邓惠俊

引用本文:

邓惠俊. 排序支持的交互数据分类算法及其应用[J]. 浙江大学学报(理学版), 2021, 48(1): 9-17.

DENG Huijun. Ranking-supported interactive data classification method and its application. Journal of Zhejiang University (Science Edition), 2021, 48(1): 9-17.

链接本文:

https://www.zjujournals.com/sci/CN/10.3785/j.issn.1008-9497.2021.01.002        https://www.zjujournals.com/sci/CN/Y2021/V48/I1/9

1 PAIVA J G S,SCHWARTZ W R,PEDRINI H,et al.An approach to supporting incremental visual data classification[J].IEEE Transactions on Visualization and Computer Graphics,2014,21(1):4-17. DOI:10.1109/TVCG.2014.2331979
2 马昱欣,曹震东,陈为.可视化驱动的交互式数据挖掘方法综述[J].计算机辅助设计与图形学学报,2016,28(1):1-8.DOI:10.3969/j.issn.1003-9775. 2016.01.001 MA Y X,CAO Z D,CHEN W.A survey of visualization-driven interactive data mining approaches[J].Journal of Computer-Aided Design,2016,28(1):1-8.DOI:10.3969/j.issn.1003-9775.2016. 01.001
3 SACHA D,SEDLMAIR M,ZHANG L,et al.What you see is what you can change:Human-centered machine learning by interactive visualization[J].Neurocomputing,2017,268:164-175.DOI:10.1016/j.neucom.2017.01.105
4 MING Y,QU H,BERTINI E.RuleMatrix:Visualizing and understanding classifiers with rules[J].IEEE Transactions on Visualization and Computer Graphics,2018,25(1):342-352.DOI:10.1109/TVCG.2018.2864812
5 AFFONSO C,ROSSI A L D,VIEIRA F H A,et al.Deep learning for biological image classification[J].Expert Systems with Applications,2017,85:114-122. DOI:10.1016/j.eswa.2017.05.039
6 KUMAR P,GUPTA A.Active learning query strategies for classification,regression,and clustering:A survey[J].Journal of Computer Science and Technology,2020,35(4):913-945.DOI:10.1007/s11390-020-9487-4
7 JOSHI A J,PORIKLI F,PAPANIKOLOPOULOS N P.Scalable active learning for multiclass image classification[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2012,34(11):2259-2273. DOI:10.1109/tpami.2012.21
8 BELUCH W H,GENEWEIN T,NÜRNBERGER A,et al.The power of ensembles for active learning in image classification[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Salt Lake City:IEEE,2018:9368-9377.
9 DO T N.Towards Simple,Easy to Understand,An Interactive Decision Tree Algorithm[R].Can Tho:Can Tho University,2007.
10 VAN DEN ELZEN S,VAN WIJK J J.BaobabView:Interactive construction and analysis of decision trees[C]//2011 IEEE Conference on Visual Analytics Science and Technology (VAST). Providence:IEEE,2011:151-160.
11 CHOO J,LEE H,KIHM J,et al.iVisClassifier:An interactive visual analytics system for classification based on supervised dimension reduction[C]//2010 IEEE Symposium on Visual Analytics Science and Technology.Salt Lake City:IEEE,2010:27-34.DOI:10.1109/vast.2010.5652443
12 LEGG P A,CHUNG D H S,PARRY M L,et al.Transformation of an uncertain video search pipeline to a sketch-based visual analytics loop[J].IEEE Transactions on Visualization and Computer Graphics,2013,19(12):2109-2118.DOI:10.1109/tvcg.2013.207
13 HÖFERLIN B,NETZEL R,HÖFERLIN M,et al.Interactive learning of Ad-hoc classifiers for video visual analytics[C]//2012 IEEE Conference on Visual Analytics Science and Technology (VAST).Seattle:IEEE,2012:23-32.
14 MAATEN L,HINTON G.Visualizing data using t-SNE[J].Journal of Machine Learning Research,2008,9:2579-2605. DOI:10.1007/s10846-008-9235-4
15 RAO R,CARD S K.The Table Lens:Merging graphical and symbolic representations in an interactive focus+ context visualization for tabular information[C]//Proceedings of the ACM SIGCHI Conference on Human Factors in Computing Systems.Boston:ACM,1994:318-322. DOI:10.1145/191666.191776
16 CARENINI G,LOYD J.ValueCharts:Analyzing linear models expressing preferences and evaluations[C]//Proceedings of the Working Conference on Advanced Visual Interfaces.Gallipoli:ACM,2004:150-157.
17 GRATZL S,LEX A,GEHLENBORG N,et al.LineUp:Visual analysis of multi-attribute rankings[J].IEEE Transactions on Visualization and Computer Graphics,2013,19(12):2277-2286.DOI:10.1109/TVCG.2013.173
18 VUILLEMOT R,PERIN C.Investigating the direct manipulation of ranking tables for time navigation[C]//Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems.Seoul:ACM,2015:2703-2706.DOI:10.1145/2702123.2702237
19 DI SCIASCIO C,SABOL V,VEAS E E.Rank as you go:User-driven exploration of search results[C]//Proceedings of the 21st International Conference on Intelligent User Interfaces.Sonoma:ACM,2016:118-129.
20 COSSOCK D,ZHANG T.Subset ranking using regression[C]//International Conference on Computational Learning Theory.Heidelberg:Springer,2006:605-619. DOI:10.1007/1177 6420_44
21 WALL E,DAS S,CHAWLA R,et al.Podium:Ranking data using mixed-initiative visual analytics[J].IEEE Transactions on Visualization and Computer Graphics,2017,24(1):288-297.DOI:10.1109/TVCG.2017.2745078
22 WANG Y,FENG K,CHU X,et al.A perception-driven approach to supervised dimensionality reduction for visualization[J].IEEE Transactions on Visualization and Computer Graphics,2017,24(5):1828-1840. DOI:10.1007/978-3-319-12610-4_3
23 LAWAL I A.Incremental SVM learning[C]//Learning from Data Streams in Evolving Environments. Cham:Springer,2019:279-296.
24 JOACHIMS T.Training linear SVMs in linear time[C]//Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.Philadelphia:ACM,2006:217-226.DOI:10.1145/1150402.1150429
25 JOACHIMS T.Supported vector machine for ranking[EB/OL].[2009-03-21].http://www.cs.cornell.edu/people/tj/svm_light/svm_rank.html ?tdsourcetag=s_pcqq_aiomsg.
[1] 陶玉杰,索春凤. 基于曲边梯形面积刻画毕达哥拉斯模糊数的排序方法[J]. 浙江大学学报(理学版), 2022, 49(4): 391-397.
[2] 林杨, 王应明, 陈磊. 基于决策单元交叉排序的中立DEA评价方法[J]. 浙江大学学报(理学版), 2018, 45(2): 162-168.
[3] 万绍春, 张安, 陈永, 陈光亭. 关于带时间约束的单机排序的一个注记[J]. 浙江大学学报(理学版), 2018, 45(1): 14-17.
[4] 荣建华. 具有服务等级的可拒绝平行机排序问题[J]. 浙江大学学报(理学版), 2016, 43(6): 685-688.
[5] 荣建华, 侯丽英. 带有到达时间和拒绝费用工件的同类机排序问题[J]. 浙江大学学报(理学版), 2016, 43(5): 545-549.
[6] 何勇. 带有工件调整时间的排序复杂性[J]. 浙江大学学报(理学版), 1999, 26(3): 39-43.