Please wait a minute...
Journal of ZheJiang University (Engineering Science)  2019, Vol. 53 Issue (3): 522-532    DOI: 10.3785/j.issn.1008-973X.2019.03.013
Computer Technology     
Improved collaborative filtering algorithm to revise users' rating by review mining
Hong-xia WANG(),Jian CHEN,Yan-fen CHENG*()
School of Computer Science and Technology, Wuhan University of Technology, Wuhan 430063, China
Download: HTML     PDF(992KB) HTML
Export: BibTeX | EndNote (RIS)      

Abstract  

An improved collaborative filtering algorithm was proposed aiming at the problems that user ratings for current e-commerce websites are too concentrated, the distinguishing degree is not obvious, and the credibility of the integer rating is not very well. Firstly, the improved part-of-speech path template algorithm was used to mine the product features and sentiment words contained in user’s reviews, and then the preference vector of the review was analyzed and established. Secondly, the emotional attitudes included in the review were calculated according to the review’s preference vector, and the emotional attitudes were used to revise the user’s rating, so that the revised rating was closer to the user’s true score willingness. Finally, the revised rating similarity and the preference similarity were combined to produce recommendations. The experimental results show that the proposed algorithm can effectively increase the classification and credibility of user’s ratings, and thus improves the quality of the nearest neighbors, as well as the accuracy of the recommended results.



Key wordsreview mining      emotional attitude      comment feature preference vector      rating revising      collaborative filtering     
Received: 06 February 2018      Published: 04 March 2019
CLC:  TP 391  
Corresponding Authors: Yan-fen CHENG     E-mail: 99575522@qq.com;995132428@qq.com
Cite this article:

Hong-xia WANG,Jian CHEN,Yan-fen CHENG. Improved collaborative filtering algorithm to revise users' rating by review mining. Journal of ZheJiang University (Engineering Science), 2019, 53(3): 522-532.

URL:

http://www.zjujournals.com/eng/10.3785/j.issn.1008-973X.2019.03.013     OR     http://www.zjujournals.com/eng/Y2019/V53/I3/522


采用评论挖掘修正用户评分的改进协同过滤算法

针对当前电子商务网站用户评分过于集中而区分度不明显,以及整数评分可信度不高导致协同过滤推荐效果较差的问题,提出一种改进协同过滤算法. 利用改进的词性路径模板算法挖掘评论中包含的产品特征和情感词,分析并建立评论特征偏好向量;依据评论特征偏好向量计算评论中包含的情感态度,利用用户评论中包含的情感态度对评分进行修正,使得修正后的评分更接近于用户的真实评分意愿;利用修正后的评分计算评分相似度,与偏好相似度结合产生推荐. 实验结果表明,该算法有效地增加了评分区分度与可信度,提高了最近邻居的质量,从而提高了推荐结果的准确度.


关键词: 评论挖掘,  情感态度,  评论特征偏好向量,  评分修正,  协同过滤 
Fig.1 Overall framework of improved collaborative filtering algorithm based on revising user’s ratings by mining reviews
评论内容 词性标注 词性句子 文献[19]算法 改进后的算法
手机外观不错 手机/n外观/n不错/a /n/n/a (外观,不错)
手机颜色也很漂亮 手机/n颜色/n也/d很/d漂亮/a /n/n/d/d/a (颜色,漂亮)
后盖是金属的 后盖/n是/v金属/n的/u /n/v/n/u
有重量 有/v重量/n /v/n
铃声大 铃声/n大/a /n/a (铃声,大) (铃声,大)
屏幕大 屏幕/n大/a /n/a (屏幕,大) (屏幕,大)
操作真的简单 操作/v真/a的/u简单/a /v/a/u/a (操作,简单)
适合中老年使用 适合/v中老年/n使用/v /v/n/v
Tab.1 Comparison of extraction effects of speech path template algorithm
Fig.2 Range of revised integer rating
Ncom Nuser Ncom Nuser
1 26 173 9 7
2 783 10 4
3 135 11 4
4 58 12 1
5 9 13 2
6 7 19 1
7 18 21 1
8 6
Tab.2 Number of reviews and corresponding users
Fig.3 Distribution of revised user rating distribution
评论内容 原评分 提取特征情感词对 情感态度 修正后的评分
整体比较满意,就感觉下面功能键不好,反应慢 5 {(整体,满意),(反应,慢)} 1.98 4.70
手机不错,外观漂亮,运行流畅,相机可以,给京东
物流一个赞
5 {(手机,不错),(外观,漂亮),(运行,流畅),
(相机,可以),(物流,赞)}
4.88 5.47
Tab.3 Credibility comparison of original rating and revised rating
Fig.4 Comparison of MAE values between proposed algorithm and traditional collaborative filtering algorithm considering only score
Fig.5 Comparison of MAE values among different collaborative algorithms based on review mining
Fig.7 Comparison of diversity values between different collaborative algorithms based on review mining
Fig.6 Comparison of recall rate among different collaborative algorithms based on review mining
Fig.8 Effect of different weighting factors on MAE value
Fig.9 Effect of different weight factors on recall rate
[1]   YU X, LIU Y, HUANG X, et al Mining online reviews for predicting sales performance: a case study in the movie domain[J]. IEEE Transactions on Knowledge and Data engineering, 2012, 24 (4): 720- 734
[2]   TAN Y, ZHANG Y, ZHANG M, et al. A unified framework for emotional elements extraction based on finite state matching machine [M] // Natural Language Processing and Chinese Computing. Berlin Heidelberg: Springer, 2013: 60–71.
[3]   ZHANG Y, LAI G, ZHANG M, et al. Explicit factor models for explainable recommendation based on phrase-level sentiment analysis [C] // Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval. Gold Coast: ACM, 2014: 83–92.
[4]   KIRAN M V K, VINODHINI R E, ARCHANAA R, et al. User specific product recommendation and rating system by performing sentiment analysis on product reviews [C] // 4th International Conference on Advanced Computing and Communication Systems (ICACCS). Coimbatore: IEEE, 2017: 1–5.
[5]   CHEN G, CHEN L Augmenting service recommender systems by incorporating contextual opinions from user reviews[J]. User Modeling and User-Adapted Interaction, 2015, 25 (3): 295- 329
[6]   XU X, DUTTA K, DATTA A Functionality-based mobile app recommendation by identifying aspects from user reviews[J]. Journal of the Association for Information Science and Technology, 2018, 69 (2): 242- 255
[7]   RESCHKE K, VOGEL A, JURAFSKY D. Generating recommendation dialogs by extracting information from user reviews [C] // Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics. Sofia: ACL, 2013: 499–504.
[8]   GANU G, MARIAN A, ELHADAD N. Ursa-User review structure analysis: understanding online reviewing trends [R/OL]. Rutgers DCS Technical Report, No.668, April 2010. http://spidr-ursa.rutgers.edu/resources/TR_LRE.pdf.
[9]   GANU G, KAKODKAR Y, MARIAN A L Improving the quality of predictions using textual information in online user reviews[J]. Information Systems, 2013, 38 (1): 1- 15
[10]   MCAULEY J, LESKOVEC J. Hidden factors and hidden topics: understanding rating dimensions with review text [C] // Proceedings of the 7th ACM conference on Recommender systems. New York: ACM, 2013: 165–172.
[11]   DEHKORDI Y H, THOMO A, GANTI S. Incorporating user reviews as implicit feedback for improving recommender systems [C] // The 4th IEEE International Conference on Big Data and Cloud Computing. Sydney: IEEE, 2014: 455–462.
[12]   张付志, 刘赛, 李忠华, 等 融合用户评论和环境信息的协同过滤推荐算法[J]. 小型微型计算机系统, 2014, 35 (2): 228- 232
ZHANG Fu-zhi, LIU Sai, LI Zhong-hua, et al Collaborative filtering recommendation algorithm incorporating user's reviews and contextual information[J]. Journal of Chinese Computer Systems, 2014, 35 (2): 228- 232
[13]   谭云志, 张敏, 刘奕群, 等 基于用户评分和评论信息的协同推荐框架[J]. 模式识别与人工智能, 2016, 29 (4): 359- 366
TAN Yun-zhi, ZHANG Min, LIU Yi-qun, et al Collaborative recommendation framework based on ratings and textual reviews[J]. Pattern Recognition and Artificial Intelligence, 2016, 29 (4): 359- 366
[14]   李伟霖, 王成良, 文俊浩 基于评论与评分的协同过滤算法[J]. 计算机应用研究, 2017, 34 (2): 361- 364
LI Wei-lin, WANG Cheng-liang, WEN Jun-hao Collaborative filtering recommendation algorithm based on reviews and ratings[J]. Application Research of Computers, 2017, 34 (2): 361- 364
[15]   王伟军, 宋艳秋, 张婷婷, 等 基于话语标记理论的在线商品评分修正方法[J]. 情报学报, 2016, 35 (4): 358- 368
WANG Wei-jun, SONG Yan-qiu, ZHANG Ting-ting A correction method for score deviation of online product reviews based on discourse markers theory[J]. Journal of The China Society for Scientific and Technical Information, 2016, 35 (4): 358- 368
[16]   郑修猛, 陈福才, 黄瑞阳, 等 面向协同推荐的评论文本情感打分机制研究[J]. 信息工程大学学报, 2017, 18 (4): 464- 469
ZHENG Xiu-meng, CHEN Fu-cai, HUANG Rui-yang, et al Sentiment rating mechanism research on review text for CF recommendation systems[J]. Journal of Information Engineering University, 2017, 18 (4): 464- 469
[17]   YANG H, SUH Y. Sentiment analysis of online customer reviews for product recommendation: comparison with traditional CF-based recommendation [C] // The Korea Society of Management Information Systems. Seoul: KMIS, 2015: 801–805.
[18]   WU Y, ZHANG Q, HUANG X, et al. Phrase dependency parsing for opinion mining [C] // Proceedings of the 2009 Conference on Empirical Methods in Natural Language. Singapore: EMNLP, 2009: 1533–1541.
[19]   扈中凯, 郑小林, 吴亚峰, 等 基于用户评论挖掘的产品推荐算法[J]. 浙江大学学报: 工学版, 2013, 47 (8): 1475- 1485
HU Zhong-kai, ZHENG Xiao-lin, WU Ya-feng, et al Product recommendation algorithm based on users' reviews mining[J]. Journal of Zhejiang University: Engineering Science, 2013, 47 (8): 1475- 1485
[20]   PANG B, LEE L Opinion mining and sentiment analysis[J]. Foundations and Trends in Information Retrieval, 2008, 2 (1-2): 1- 135
[21]   朱郁筱, 吕琳媛 推荐系统评价指标综述[J]. 电子科技大学学报, 2012, 41 (2): 163- 175
ZHU Yu-xiao, LU Lin-yuan Evaluation metrics for recommender systems[J]. Journal of University of Electronic Science and Technology of China, 2012, 41 (2): 163- 175
[1] Nuo LI,Bin GUO,Yan LIU,Yao JING,Zhi-wen YU. Intelligent commercial site recommendation with neural collaborative filtering[J]. Journal of ZheJiang University (Engineering Science), 2019, 53(9): 1788-1794.
[2] Li-yan DONG,Jia-huan JIN,Yuan-cheng FANG,Yue-qun WANG,Yong-li LI,Ming-hui SUN. Slope One algorithm based on nonnegative matrix factorization[J]. Journal of ZheJiang University (Engineering Science), 2019, 53(7): 1349-1353.
[3] Xiao-jun LI,Hong LIU,Han-xiao SHI,Liu-qing ZHU,Ya-hui ZHANG. Deep learning based course recommendation model[J]. Journal of ZheJiang University (Engineering Science), 2019, 53(11): 2139-2145.
[4] REN Di, WAN Jian, YIN Yu-yu, ZHOU Li, GAO Min. Web services QoS prediction method based on Bayes classification[J]. Journal of ZheJiang University (Engineering Science), 2017, 51(6): 1242-1251.
[5] MAO Yi-yu, LIU Jian-xun, HU Rong, TANG Ming-dong. Collaborative filtering algorithm based on Logistic function and user clustering[J]. Journal of ZheJiang University (Engineering Science), 2017, 51(6): 1252-1258.
[6] JU Bin, QIAN Yun-tao, YE Min-chao. Collaborative filtering algorithm based on structured projective nonnegative matrix factorization[J]. Journal of ZheJiang University (Engineering Science), 2015, 49(7): 1319-1325.
[7] HU Zhong-kai, ZHENG Xiao-lin, WU Ya-feng, CHEN De-ren. Product recommendation algorithm based on users’ reviews mining[J]. Journal of ZheJiang University (Engineering Science), 2013, 47(8): 1475-1485.