Please wait a minute...
浙江大学学报(工学版)  2019, Vol. 53 Issue (3): 522-532    DOI: 10.3785/j.issn.1008-973X.2019.03.013
计算机技术     
采用评论挖掘修正用户评分的改进协同过滤算法
王红霞(),陈健,程艳芬*()
武汉理工大学 计算机科学与技术学院,湖北 武汉 430063
Improved collaborative filtering algorithm to revise users' rating by review mining
Hong-xia WANG(),Jian CHEN,Yan-fen CHENG*()
School of Computer Science and Technology, Wuhan University of Technology, Wuhan 430063, China
 全文: PDF(992 KB)   HTML
摘要:

针对当前电子商务网站用户评分过于集中而区分度不明显,以及整数评分可信度不高导致协同过滤推荐效果较差的问题,提出一种改进协同过滤算法. 利用改进的词性路径模板算法挖掘评论中包含的产品特征和情感词,分析并建立评论特征偏好向量;依据评论特征偏好向量计算评论中包含的情感态度,利用用户评论中包含的情感态度对评分进行修正,使得修正后的评分更接近于用户的真实评分意愿;利用修正后的评分计算评分相似度,与偏好相似度结合产生推荐. 实验结果表明,该算法有效地增加了评分区分度与可信度,提高了最近邻居的质量,从而提高了推荐结果的准确度.

关键词: 评论挖掘情感态度评论特征偏好向量评分修正协同过滤    
Abstract:

An improved collaborative filtering algorithm was proposed aiming at the problems that user ratings for current e-commerce websites are too concentrated, the distinguishing degree is not obvious, and the credibility of the integer rating is not very well. Firstly, the improved part-of-speech path template algorithm was used to mine the product features and sentiment words contained in user’s reviews, and then the preference vector of the review was analyzed and established. Secondly, the emotional attitudes included in the review were calculated according to the review’s preference vector, and the emotional attitudes were used to revise the user’s rating, so that the revised rating was closer to the user’s true score willingness. Finally, the revised rating similarity and the preference similarity were combined to produce recommendations. The experimental results show that the proposed algorithm can effectively increase the classification and credibility of user’s ratings, and thus improves the quality of the nearest neighbors, as well as the accuracy of the recommended results.

Key words: review mining    emotional attitude    comment feature preference vector    rating revising    collaborative filtering
收稿日期: 2018-02-06 出版日期: 2019-03-04
CLC:  TP 391  
通讯作者: 程艳芬     E-mail: 99575522@qq.com;995132428@qq.com
作者简介: 王红霞(1977—),女,副教授,博士,从事模式识别、图像分类研究. orcid.org/0000-0003-1213-7760. E-mail: 99575522@qq.com
服务  
把本文推荐给朋友
加入引用管理器
E-mail Alert
作者相关文章  
王红霞
陈健
程艳芬

引用本文:

王红霞,陈健,程艳芬. 采用评论挖掘修正用户评分的改进协同过滤算法[J]. 浙江大学学报(工学版), 2019, 53(3): 522-532.

Hong-xia WANG,Jian CHEN,Yan-fen CHENG. Improved collaborative filtering algorithm to revise users' rating by review mining. Journal of ZheJiang University (Engineering Science), 2019, 53(3): 522-532.

链接本文:

http://www.zjujournals.com/eng/CN/10.3785/j.issn.1008-973X.2019.03.013        http://www.zjujournals.com/eng/CN/Y2019/V53/I3/522

图 1  采用评论挖掘修正用户评分算法的整体框架
评论内容 词性标注 词性句子 文献[19]算法 改进后的算法
手机外观不错 手机/n外观/n不错/a /n/n/a (外观,不错)
手机颜色也很漂亮 手机/n颜色/n也/d很/d漂亮/a /n/n/d/d/a (颜色,漂亮)
后盖是金属的 后盖/n是/v金属/n的/u /n/v/n/u
有重量 有/v重量/n /v/n
铃声大 铃声/n大/a /n/a (铃声,大) (铃声,大)
屏幕大 屏幕/n大/a /n/a (屏幕,大) (屏幕,大)
操作真的简单 操作/v真/a的/u简单/a /v/a/u/a (操作,简单)
适合中老年使用 适合/v中老年/n使用/v /v/n/v
表 1  词性路径模板算法提取效果对比
图 2  整数评分修正后的范围
Ncom Nuser Ncom Nuser
1 26 173 9 7
2 783 10 4
3 135 11 4
4 58 12 1
5 9 13 2
6 7 19 1
7 18 21 1
8 6
表 2  评论次数与对应用户数量
图 3  修正后的用户评分分布
评论内容 原评分 提取特征情感词对 情感态度 修正后的评分
整体比较满意,就感觉下面功能键不好,反应慢 5 {(整体,满意),(反应,慢)} 1.98 4.70
手机不错,外观漂亮,运行流畅,相机可以,给京东
物流一个赞
5 {(手机,不错),(外观,漂亮),(运行,流畅),
(相机,可以),(物流,赞)}
4.88 5.47
表 3  原评分与修正后评分的可信度对比
图 4  仅考虑评分时所提算法与传统协同过滤算法的MAE值对比
图 5  采用评论挖掘的不同协同过滤算法的MAE值对比
图 7  采用评论挖掘的不同协同过滤算法的多样性对比
图 6  采用评论挖掘的不同协同过滤算法的召回率对比
图 8  不同权重因子对MAE值的影响
图 9  不同权重因子对召回率的影响
1 YU X, LIU Y, HUANG X, et al Mining online reviews for predicting sales performance: a case study in the movie domain[J]. IEEE Transactions on Knowledge and Data engineering, 2012, 24 (4): 720- 734
2 TAN Y, ZHANG Y, ZHANG M, et al. A unified framework for emotional elements extraction based on finite state matching machine [M] // Natural Language Processing and Chinese Computing. Berlin Heidelberg: Springer, 2013: 60–71.
3 ZHANG Y, LAI G, ZHANG M, et al. Explicit factor models for explainable recommendation based on phrase-level sentiment analysis [C] // Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval. Gold Coast: ACM, 2014: 83–92.
4 KIRAN M V K, VINODHINI R E, ARCHANAA R, et al. User specific product recommendation and rating system by performing sentiment analysis on product reviews [C] // 4th International Conference on Advanced Computing and Communication Systems (ICACCS). Coimbatore: IEEE, 2017: 1–5.
5 CHEN G, CHEN L Augmenting service recommender systems by incorporating contextual opinions from user reviews[J]. User Modeling and User-Adapted Interaction, 2015, 25 (3): 295- 329
6 XU X, DUTTA K, DATTA A Functionality-based mobile app recommendation by identifying aspects from user reviews[J]. Journal of the Association for Information Science and Technology, 2018, 69 (2): 242- 255
7 RESCHKE K, VOGEL A, JURAFSKY D. Generating recommendation dialogs by extracting information from user reviews [C] // Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics. Sofia: ACL, 2013: 499–504.
8 GANU G, MARIAN A, ELHADAD N. Ursa-User review structure analysis: understanding online reviewing trends [R/OL]. Rutgers DCS Technical Report, No.668, April 2010. http://spidr-ursa.rutgers.edu/resources/TR_LRE.pdf.
9 GANU G, KAKODKAR Y, MARIAN A L Improving the quality of predictions using textual information in online user reviews[J]. Information Systems, 2013, 38 (1): 1- 15
10 MCAULEY J, LESKOVEC J. Hidden factors and hidden topics: understanding rating dimensions with review text [C] // Proceedings of the 7th ACM conference on Recommender systems. New York: ACM, 2013: 165–172.
11 DEHKORDI Y H, THOMO A, GANTI S. Incorporating user reviews as implicit feedback for improving recommender systems [C] // The 4th IEEE International Conference on Big Data and Cloud Computing. Sydney: IEEE, 2014: 455–462.
12 张付志, 刘赛, 李忠华, 等 融合用户评论和环境信息的协同过滤推荐算法[J]. 小型微型计算机系统, 2014, 35 (2): 228- 232
ZHANG Fu-zhi, LIU Sai, LI Zhong-hua, et al Collaborative filtering recommendation algorithm incorporating user's reviews and contextual information[J]. Journal of Chinese Computer Systems, 2014, 35 (2): 228- 232
13 谭云志, 张敏, 刘奕群, 等 基于用户评分和评论信息的协同推荐框架[J]. 模式识别与人工智能, 2016, 29 (4): 359- 366
TAN Yun-zhi, ZHANG Min, LIU Yi-qun, et al Collaborative recommendation framework based on ratings and textual reviews[J]. Pattern Recognition and Artificial Intelligence, 2016, 29 (4): 359- 366
14 李伟霖, 王成良, 文俊浩 基于评论与评分的协同过滤算法[J]. 计算机应用研究, 2017, 34 (2): 361- 364
LI Wei-lin, WANG Cheng-liang, WEN Jun-hao Collaborative filtering recommendation algorithm based on reviews and ratings[J]. Application Research of Computers, 2017, 34 (2): 361- 364
15 王伟军, 宋艳秋, 张婷婷, 等 基于话语标记理论的在线商品评分修正方法[J]. 情报学报, 2016, 35 (4): 358- 368
WANG Wei-jun, SONG Yan-qiu, ZHANG Ting-ting A correction method for score deviation of online product reviews based on discourse markers theory[J]. Journal of The China Society for Scientific and Technical Information, 2016, 35 (4): 358- 368
16 郑修猛, 陈福才, 黄瑞阳, 等 面向协同推荐的评论文本情感打分机制研究[J]. 信息工程大学学报, 2017, 18 (4): 464- 469
ZHENG Xiu-meng, CHEN Fu-cai, HUANG Rui-yang, et al Sentiment rating mechanism research on review text for CF recommendation systems[J]. Journal of Information Engineering University, 2017, 18 (4): 464- 469
17 YANG H, SUH Y. Sentiment analysis of online customer reviews for product recommendation: comparison with traditional CF-based recommendation [C] // The Korea Society of Management Information Systems. Seoul: KMIS, 2015: 801–805.
18 WU Y, ZHANG Q, HUANG X, et al. Phrase dependency parsing for opinion mining [C] // Proceedings of the 2009 Conference on Empirical Methods in Natural Language. Singapore: EMNLP, 2009: 1533–1541.
19 扈中凯, 郑小林, 吴亚峰, 等 基于用户评论挖掘的产品推荐算法[J]. 浙江大学学报: 工学版, 2013, 47 (8): 1475- 1485
HU Zhong-kai, ZHENG Xiao-lin, WU Ya-feng, et al Product recommendation algorithm based on users' reviews mining[J]. Journal of Zhejiang University: Engineering Science, 2013, 47 (8): 1475- 1485
20 PANG B, LEE L Opinion mining and sentiment analysis[J]. Foundations and Trends in Information Retrieval, 2008, 2 (1-2): 1- 135
21 朱郁筱, 吕琳媛 推荐系统评价指标综述[J]. 电子科技大学学报, 2012, 41 (2): 163- 175
ZHU Yu-xiao, LU Lin-yuan Evaluation metrics for recommender systems[J]. Journal of University of Electronic Science and Technology of China, 2012, 41 (2): 163- 175
[1] 李诺,郭斌,刘琰,景瑶,於志文. 神经协同过滤智能商业选址方法[J]. 浙江大学学报(工学版), 2019, 53(9): 1788-1794.
[2] 董立岩,金佳欢,方塬程,王越群,李永丽,孙铭会. 基于非负矩阵分解的Slope One算法[J]. 浙江大学学报(工学版), 2019, 53(7): 1349-1353.
[3] 厉小军,柳虹,施寒潇,朱柳青,张亚辉. 基于深度学习的课程推荐模型[J]. 浙江大学学报(工学版), 2019, 53(11): 2139-2145.
[4] 任迪, 万健, 殷昱煜, 周丽, 高敏. 基于贝叶斯分类的Web服务质量预测方法研究[J]. 浙江大学学报(工学版), 2017, 51(6): 1242-1251.
[5] 毛宜钰, 刘建勋, 胡蓉, 唐明董. 基于Logistic函数和用户聚类的协同过滤算法[J]. 浙江大学学报(工学版), 2017, 51(6): 1252-1258.
[6] 居斌, 钱沄涛, 叶敏超. 基于结构投影非负矩阵分解的协同过滤算法[J]. 浙江大学学报(工学版), 2015, 49(7): 1319-1325.
[7] 扈中凯, 郑小林, 吴亚峰, 陈德人. 基于用户评论挖掘的产品推荐算法[J]. J4, 2013, 47(8): 1475-1485.