Please wait a minute...
浙江大学学报(工学版)
通信工程、自动化技术     
基于结构投影非负矩阵分解的协同过滤算法
居斌1,2, 钱沄涛1, 叶敏超1
1.浙江大学 计算机学院,浙江 杭州 310027;2.浙江省卫生信息中心,浙江 杭州 310006
Collaborative filtering algorithm based on structured projective nonnegative matrix factorization
JU Bin1,2, QIAN Yun-tao1, YE Min-chao1
1.College of Computer Science, Zhejiang University, Hangzhou 310027, China; 2.Health Information Center of Zhejiang Province, Hangzhou 310006, China
 全文: PDF(759 KB)   HTML
摘要:

针对在协同过滤算法中,传统矩阵分解技术在降维过程中会破坏数据相邻结构的问题,提出基于结构投影非负矩阵分解的协同过滤算法(CF-SPNMF).该算法包含离线学习和在线搜索2个阶段.在离线学习阶段,通过对用户评分矩阵的投影非负矩阵分解,同时保留用户特征的聚类结构,得到低维的用户潜在兴趣因子.在线搜索阶段,将用户潜在兴趣因子进行余弦相似性匹配,发现目标用户与训练样本用户之间兴趣最相似的邻域集合.在实际数据集上的实验结果表明,提出的CF-SPNMF算法与单纯使用矩阵分解和单纯在原评分矩阵上进行用户聚类的推荐算法相比,能够更有效地预测用户实际评分.

Abstract:

In collaborative filtering algorithm, the classical matrix factorization may destroy the adjacent structures among data points from high dimension to low dimension. A  novel collaborative filtering algorithm  based on structured projective nonnegative matrix factorization (CF-SPNMF)  was proposed in order to overcome the problem. The algorithm contains both offline learning and online searching. In the offline learning stage, projective nonnegative matrix factorization was applied to obtain the low dimensional latent factors of user preference without changing the intrinsic structure of users cluster. In the online searching stage, cosine similarity was used to measure the similarity between the target user and training users based on the latent factors inferred in the offline stage. Then the most similar neighbor set was further found. The extensive experiments on real-world data set demonstrate that the proposed CF-SPNMF achieves better rating prediction performance than traditional methods using either matrix factorization or users clustering in original rating matrix.

出版日期: 2015-09-10
:  TP 181  
基金资助:

浙江省自然科学基金资助项目(Y1101359);国家科技支撑计划资助项目(2011BAD24B03)

通讯作者: 钱沄涛,男,教授,博导     E-mail: ytqian@zju.edu.cn
作者简介: 居斌(1975-),男,博士生,从事机器学习、数据挖掘的研究.E-mail:jubin_hz@163.com
服务  
把本文推荐给朋友
加入引用管理器
E-mail Alert
RSS
作者相关文章  

引用本文:

居斌, 钱沄涛, 叶敏超. 基于结构投影非负矩阵分解的协同过滤算法[J]. 浙江大学学报(工学版), 10.3785/j.issn.1008-973X.2015.07.017.

JU Bin, QIAN Yun-tao, YE Min-chao. Collaborative filtering algorithm based on structured projective nonnegative matrix factorization. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 10.3785/j.issn.1008-973X.2015.07.017.

链接本文:

http://www.zjujournals.com/eng/CN/10.3785/j.issn.1008-973X.2015.07.017        http://www.zjujournals.com/eng/CN/Y2015/V49/I7/1319

[1]LU L, MEDO M, YEUNG C H, et al. Recommender systems [J]. Physics Reports, 2012, 519(1): 149.
[2]SU X, KHOSHGOFTAAR T M. A survey of collaborative filtering techniques [J]. Advances in Artificial Intelligence, 2009, 2009(10): 424.
[3]LINDEN G, SMITH B, YORK J. Amazon. com recommendations: item-to-item collaborative filtering [J]. Internet Computing, IEEE, 2003, 7(1): 76-80.
[4]ADOMAVICIUS G, TUZHILIN A. Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions [J]. IEEE Transactions on Knowledge and Data Engineering, 2005, 17(6): 734-749.
[5]KOREN Y, BELL R, VOLINSKY C. Matrix factorization techniques for recommender systems [J]. Computer, 2009, 42(8): 30-37.
[6]PATEREK A. Improving regularized singular value decomposition for collaborative filtering[C]∥Proceedings of KDD Cup and Workshop. California: ACM, 2007: 58.
[7]ZHANG S, WANG W, FORD J, et al. Using singular value decomposition approximation for collaborative filtering [C]∥ 7th IEEE International Conference on E-Commerce Technology. Ankara: IEEE, 2005: 257-264.
[8]SREBRO N, JAAKKOLA T. Weighted low-rank approximations [C]∥ICML. Washington: DBML, 2003: 720-727.
[9]MNIH A, SALAKHUTDINOV R. Probabilistic matrix factorization [C]∥Advances in Neural Information Processing Systems. Vancouver: MIT, 2007: 1257-1264.
[10]LEE D D, SEUNG H S. Learning the parts of objects by non-negative matrix factorization [J]. Nature, 1999, 401(6755): 788-791.
[11]HOYER P O. Non-negative sparse coding [C]∥ Proceedings of the 2002 12th IEEE Workshop on Neural Networks for Signal Processing. London: IEEE, 2002: 557-565.
[12]胡俐蕊,吴建国,汪磊. 线性投影非负矩阵分解方法及应用[J]. 计算机科学, 2013, 40(10): 269-273.
HU Li-rui, WU Jian-guo,WANG Lei.Application and method for linear projective non-negative matrix factorization \[J\].Computer Science,2013,40(10):269-273.
[13]YUAN Z, YANG Z, OJA E. Projective nonnegative matrix factorization: sparseness, orthogonality, and clustering [J]. Neural Processing Letters, 2009, 2009(1): 33-47.
[14]YANG J, YANG S, FU Y, et al. Non-negative graph embedding [C]∥ IEEE Conference on Computer Vision and Pattern Recognition. Hongkong: IEEE, 2008: 18.
[15]LIU X, YAN S, JIN H. Projective nonnegative graph embedding [J]. IEEE Transactions on Image Processing, 2010, 19(5): 1126-1137.
[16]BELKIN M, NIYOGI P. Laplacian eigenmaps and spectral techniques for embedding and clustering [C]∥NIPS. Vancouver: MIT, 2001: 585-591.
[17]NIYOGI X. Locality preserving projections [C]∥Neural Information Processing Systems. Los Angeles: MIT, 2004: 153-161.
[18]CAI D, HE X, HAN J, et al. Graph regularized nonnegative matrix factorization for data representation [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2011, 33(8): 1548-1560.
[19]SEUNG D, LEE L. Algorithms for non-negative matrix factorization [J]. Advances in Neural Information Processing Systems, 2001, 2001(3): 556-562.
[20]ZHANG Z, ZHAO K, ZHA H. Inducible regularization for low-rank matrix factorizations for collaborative filtering [J]. Neurocomputing, 2012, 97(1): 52-62.
[21]HU Li-rui, WU Jian-guo, WANG Lei. Application and method for linear projective non-negative matrix factorization [J]. Computer Science, 2013, 40(10): 269-273.

[1] 朱东阳, 沈静逸, 黄炜平, 梁军. 基于主动学习和加权支持向量机的工业故障识别[J]. 浙江大学学报(工学版), 2017, 51(4): 697-705.
[2] 丰小月, 梁艳春, 林希珣, 管仁初. 永恒语言学习研究与发展[J]. 浙江大学学报(工学版), 2017, 51(1): 82-88.
[3] 裘日辉, 刘康玲, 谭海龙, 梁军. 基于极限学习机的分类算法及在故障识别中的应用[J]. 浙江大学学报(工学版), 2016, 50(10): 1965-1972.
[4] 谭海龙, 刘康玲, 金鑫, 石向荣, 梁军. 基于μσ-DWC特征和树结构M-SVM的多维时间序列分类[J]. 浙江大学学报(工学版), 2015, 49(6): 1061-1069.
[5] 林亦宁, 韦巍, 戴渊明. 半监督Hough Forest跟踪算法[J]. J4, 2013, 47(6): 977-983.
[6] 李侃,黄文雄,黄忠华. 基于支持向量机的多传感器探测目标分类方法[J]. J4, 2013, 47(1): 15-22.
[7] 戴兴虎,钱沄涛,唐凤仙,居斌. 基于图表标题信息的在线生物文献MRI图像检测[J]. J4, 2012, 46(7): 1307-1313.
[8] 姚伏天, 钱沄涛, 李吉明. 空间约束半监督高斯过程下的高光谱图像分类[J]. J4, 2012, 46(7): 1295-1300.
[9] 王洪波, 赵光宙, 齐冬莲, 卢达. 一类支持向量机的快速增量学习方法[J]. J4, 2012, 46(7): 1327-1332.
[10] 艾解清, 高济, 彭艳斌, 郑志军. 基于直推式支持向量机的协商决策模型[J]. J4, 2012, 46(6): 967-973.
[11] 潘俊, 孔繁胜, 王瑞琴. 局部敏感判别直推学习机[J]. J4, 2012, 46(6): 987-994.
[12] 金卓军, 钱徽, 朱淼良. 基于倾向性分析的轨迹评测技术[J]. J4, 2011, 45(10): 1732-1737.
[13] 潘俊,孔繁胜,王瑞琴. 加权成对约束投影半监督聚类[J]. J4, 2011, 45(5): 934-940.
[14] 顾弘, 赵光宙. 广义局部图像距离函数下的图像分类与识别[J]. J4, 2011, 45(4): 596-601.
[15] 罗建宏,陈德钊. 兼顾正确率和差异性的自适应集成算法及应用[J]. J4, 2011, 45(3): 557-562.