基于评分矩阵与评论文本的深度推荐模型被引量：42

Joint Deep Modeling of Rating Matrix and Reviews for Recommendation

在线阅读下载PDF

导出

摘要基于评分矩阵的矩阵分解模型被广泛研究与应用,但是数据稀疏性问题严重制约了该模型的推荐效果.基于评论文本的推荐模型能够从文本信息中刻画用户偏好和商品特征,有效缓解了评分数据的稀疏性,但忽略了评分矩阵中用户和商品的潜在因子.为了进一步提高推荐质量,融合评分矩阵和评论文本的推荐模型被相继提出,但其仅仅局限在浅层线性特征层面,而且用户特征与商品的高级抽象特征未被充分挖掘,因此本文提出深度学习模型DeepCLFM(Deep Collaborative Latent Factor Model).该模型基于预训练的BERT模型,结合双向GRU和注意力机制从用户评论和商品评论中提取用户和商品的深层非线性特征向量,并根据用户和商品的编号映射出用户和商品的潜在隐向量.为了充分融合深层非线性特征和隐特征,DeepCLFM将用户和商品的深层特征向量与潜在隐向量以一、二阶特征项的方式产生深度特征项来预测出用户对商品的评分.在5组公开数据集上,以推荐结果的均方误差MSE作为评估指标进行对比实验,结果表明DeepCLFM的预测误差比多个优秀的基准算法更低,且平均预测误差最大降低了6.402%. With the growing popularity of the Internet and smart mobile devices,people’s online time is rising.In order to improve office efficiency and consumption experience,the company provides a variety of products and services to meet the different needs of users,but it is also more difficult for users to quickly make satisfactory decisions from a large amount of information.Due to it can help different users to find out the items they are interested in through their historical behavior,the recommender system has become an extremely important part of online activities,such as online shopping,reading articles,and watching movies.To provide a personalized recommendation service,how to accurately predict the user’s rating of the item is a key issue that the recommender system needs to solve.Based on rating matrix,one of the most outstanding methods is matrix factorization,which has been widely studied and applied to model user preferences and item characteristics through rating data.However,the performance of these methods is severely restricted by the data sparsity problem,which can be seen as a phenomenon of the shortage of trainable data.To overcome this limitation,the recommendation models based on the review text can capture the user preferences and item features from the text data,effectively alleviating the sparsity of the rating data,but they ignore the latent factors of users and items in the rating matrix.With the comprehensive consideration of the above models and to further improve the recommendation quality,the model of combining rating matrix and review text has been proposed one after another.However,they are only limited to the linear latent feature level,in which the high-level abstract features of users and items fail to be fully explored.Therefore,this paper proposes deep learning model DeepCLFM(Deep Collaborative Latent Factor Model).First,the pre-trained BERT is used as the encoder of review text,which is a general-purpose "language understanding"model trained on a large text corpus like Wikipedia.Second,with the purpose of considering the latent relationship between different reviews in a review set,DeepCLFM extracts deep nonlinear feature vectors of users and items from review embeddings through a bidirectional GRU.Additionally,DeepCLFM introduces attention mechanism to measure the contribution of each review,and adopts matrix factorization module to learn latent factors according to the IDs of users and items.Finally,to fully integrate deep nonlinear features and latent factors,DeepCLFM generates deep interaction of them in the first and second order fashion to predict the user’s rating of the item.Experiments are conducted on five public datasets called Amazon Product Review,in which each sample contains user ID,item ID,user’s rating on the item(1~5 points),and user’s review text on the item.The mean square error(MSE)of the recommendation results is used as the evaluation metric.The results show that the prediction error of DeepCLFM is lower than that of many excellent benchmark algorithms,and the prediction error is reduced by a maximum of 6.402%.Moreover,DeepCLFM achieves a better performance than traditional matrix factorization in the"cold start"scenario.

作者冯兴杰曾云泽 FENG Xing-Jie;ZENG Yun-Ze(School of Computer Science and Technology,Civil Aviation University of China,Tianjin 300300)

机构地区中国民航大学计算机科学与技术学院

出处《计算机学报》 EI CSCD 北大核心 2020年第5期884-900,共17页 Chinese Journal of Computers

基金国家自然科学基金委员会与中国民用航空局联合基金项目(U1233113,U1633110) 国家自然科学青年基金资助项目(61301245,61201414)资助。

关键词推荐系统评论文本评分矩阵神经网络冷启动 recommender systems review text rating matrix neural network cold start

分类号 TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献2

1李琳,刘锦行,孟祥福,苏畅,李鑫,钟珞.融合评分矩阵与评论文本的商品推荐模型[J].计算机学报,2018,41(7):1559-1573. 被引量：28
2高祎璠,余文喆,晁平复,郑芷凌,张蓉.基于评论分析的评分预测与推荐[J].华东师范大学学报（自然科学版）,2015(3):80-90. 被引量：10

二级参考文献25

1RAJARAMAN A, ULLMAN J D. Mining of Massive Datasets[M]. London: Cambridge University Press, 2011.
2BLANCO-FERNANDEZ Y, PAZOS-ARIAS J J, GIL-SOLLA A, et al. A flexible semantic inference methodolo- gy to reason about user preferences in knowledge-based recommender systems[J]. Knowledge-Based Systems, 2008, 21(4) : 305-320.
3MCAULEY J, LESKOVEC J. Hidden factors and hidden topics: understanding rating dimensions with review text [C]//Proeeedings of the 7th ACM conference on Recommender systems. ACM, 2013: 165-172.
4SARWAR B, KARYPIS G, KONSTAN J, et al. Item-based collaborative filtering recommendation algorithms [C]//Proceedings of the 10th international conference on World Wide Web. ACM, 2001 : 285-295.
5KOREN Y, BELL R, VOLINSKY C. Matrix factorization techniques for recommender systems[J]. Computer, 2009, 42(8) : 30-37.
6KOREN Y, BELL R. Advances in collaborative filtering[M]//KANTOR P B, RICCI F, ROKACH L, et al. Recommender Systems Handbook. New York: Springer, 2010: 145 -186.
7BRODY S, EIHADAD N. An unsupervised aspect-sentiment model for online reviews[C]//Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics. Association for Computational Linguistics, 2010: 804-812.
8JO Y, OH A H. Aspect and sentiment unification model for online review analysis[C]//Proceedings of the fourth ACM international conference on Web search and data mining. ACM, 2011: 815-824.
9TITOV I, MCDONALD R. Modeling online reviews with multi-grain topic models[C]//Proceedings of the 17th international conference on World Wide Web. ACM, 2008: 111-120.
10TITOV I, MCDONALD R T. A Joint Model of Text and Aspect Ratings for Sentiment Summarization[C]//ACL, 2008(8) : 308-316.

共引文献36

1浩庆波,徐岩,高慧.基于情境信息的移动广告推荐系统的研究[J].电子技术（上海）,2018,47(11):1-2.
2李伟霖,王成良,文俊浩.基于评论与评分的协同过滤算法[J].计算机应用研究,2017,34(2):361-364. 被引量：27
3邓日升,岳昆,武浩,刘惟一.面向商品评分预测的隐变量模型构建与推理[J].小型微型计算机系统,2017,38(2):352-356. 被引量：2
4田博,陈舜杰,周雯.基于最小风险贝叶斯决策理论的在线评价排名方法研究[J].上海管理科学,2018,40(2):91-95. 被引量：1
5李琳,刘锦行,孟祥福,苏畅,李鑫,钟珞.融合评分矩阵与评论文本的商品推荐模型[J].计算机学报,2018,41(7):1559-1573. 被引量：28
6邱秀连,邹珞彬,王峥.基于文本摘要的影评评分预测研究[J].计算机与数字工程,2019,47(1):146-151. 被引量：2
7张宜浩,朱小飞,徐传运,董世都.基于用户评论的深度情感分析和多视图协同融合的混合推荐方法[J].计算机学报,2019,42(6):1316-1333. 被引量：59
8雷震,阚伊戎,孙正宝,岳昆.评价数据中的用户偏好建模:一种基于隐变量模型的方法[J].云南大学学报（自然科学版）,2019,41(4):669-677. 被引量：1
9张志鹏,张尧,任永功.基于覆盖约简的个性化协同过滤推荐方法[J].模式识别与人工智能,2019,32(7):607-614. 被引量：7
10鲍凯丽,刘其成,牟春晓.融合朴素贝叶斯和协同过滤的外卖推荐并行算法研究[J].计算机应用与软件,2019,36(11):250-255. 被引量：3

同被引文献308

1邓玉睿,周勇,从伟,程旭东,祁智慧,唐芳.基于朴素贝叶斯算法的粮食霉变概率预测模型研究[J].中国粮油学报,2019,34(S02):35-38. 被引量：5
2张雪峰,陈秀莉,僧德文.融合用户信任和影响力的top-N推荐算法[J].浙江大学学报（工学版）,2020,54(2):311-319. 被引量：2
3赖奕安,张玉洁,杜雨露,孟祥武.一种基于协同上下文关系学习的同城活动推荐算法[J].软件学报,2020,31(2):421-438. 被引量：5
4燕彩蓉,黄颜,徐光伟,黄永锋.基于时间动态性的场感知分解机模型[J].控制与决策,2020,35(1):169-173. 被引量：2
5刘航,李锡祚.基于深度学习的协同过滤推荐算法[J].智能计算机与应用,2020(8):100-104. 被引量：2
6赵胜利,刘燕.基于RBF网络的商品混凝土强度预测分析[J].计算机工程,2005,31(18):36-37. 被引量：9
7林艳华,吴健,周胜杰.基于Excel的电力监控通用报表系统的设计与实现[J].计算机工程与科学,2008,30(4):124-127. 被引量：7
8蔡晓妍,戴冠中,杨黎斌.谱聚类算法综述[J].计算机科学,2008,35(7):14-18. 被引量：189
9陈树勇,宋书芳,李兰欣,沈杰.智能电网技术综述[J].电网技术,2009,33(8):1-7. 被引量：1131
10李迎春,孙学波.非交互式报表生成器组件的设计和实现[J].计算机工程与设计,2009,30(10):2558-2560. 被引量：3

引证文献42

1任远一.基于矩阵分解的推荐系统在教学资源平台中的应用[J].数学学习与研究,2020(1):151-151.
2牛耀强,孟昱煜,牛全福.基于异质注意力循环神经网络的文本推荐[J].计算机工程,2020,46(10):52-59. 被引量：4
3王红,卢林燕,王童.航空安全事件知识图谱补全方法[J].西南大学学报（自然科学版）,2020,42(11):31-42. 被引量：5
4朱建平,王炫力.新冠肺炎网络舆情的启示:复工复产的文本挖掘与分析[J].数学建模及其应用,2020,9(4):37-48.
5陈海涵,吴国栋,李景霞,王静雅,陶鸿.基于注意力机制的深度学习推荐研究进展[J].计算机工程与科学,2021,43(2):370-380. 被引量：31
6陆卫忠,曹燕,宋正伟,周楠.基于深度学习的建筑安全事故预防策略综述[J].苏州科技大学学报（自然科学版）,2021,38(1):8-14. 被引量：2
7邱宁佳,王宪勇,王鹏.多头注意力评论量化的聚类优化推荐算法[J].计算机应用研究,2021,38(5):1376-1380. 被引量：1
8倪美玉,曹为刚.基于深度神经网络的个性化混合商品推荐模型[J].计算机系统应用,2021,30(5):184-189. 被引量：3
9冯勇,刘洋,徐红艳,王嵘冰,张永刚.融合近邻评论的GRU商品推荐模型[J].数据分析与知识发现,2021,5(3):78-87.
10张天蔚.基于深度网络的推荐系统偏置项改良研究[J].信息技术与网络安全,2021,40(8):42-46. 被引量：1

二级引证文献110

1袁蓉.基于SECI模型的航空公司安全运营管理的知识学习模式研究[J].知识管理论坛,2023(6):554-565.
2黎园园,刘海隆.基于注意力机制的时间卷积网络河流总磷预测[J].环境工程,2023,41(5):163-171. 被引量：3
3张蕗怡,张阳.基于深度学习的个性化推荐研究[J].信息与电脑,2021,33(11):73-75.
4赵金龙,赵中英.基于异质信息网络表示学习与注意力神经网络的推荐算法[J].计算机科学,2021,48(8):72-79. 被引量：6
5何小勇,张轶,王朝香,谢俊吉,白显军.智慧建造在建筑工程安全施工管理中的应用[J].智能建筑与智慧城市,2021(9):86-87. 被引量：18
6耿浩,孙佳华,李艺,魏永长.基于BiGRU-Attention网络的新型冠状病毒肺炎疫情预测[J].武汉科技大学学报,2022,45(1):75-80. 被引量：7
7王磊,刘寅,晏燕雄.认识线性代数中的化归思想,培养学生的化归意识[J].西南师范大学学报（自然科学版）,2021,46(12):144-148. 被引量：3
8赖颖婕.深度学习中基于注意力机制的推荐模型研究[J].电脑编程技巧与维护,2021(12):117-118.
9边琳丽,刘泽惠,李琦.基于反馈神经网络的财务服务机器人研究[J].自动化与仪器仪表,2021(12):167-171.
10周贞云,邱均平.面向人工智能的我国知识图谱研究的分布特点与发展趋势[J].情报科学,2022,40(1):184-192. 被引量：10

1黄玲,余霞.基于云平台的电子商务商品智能推荐系统[J].现代电子技术,2020,43(5):183-186. 被引量：11
2原福永,刘宏阳,王领,冯凯东,黄国言.融合多特征的垃圾评论检测模型[J].小型微型计算机系统,2020,41(3):539-543. 被引量：2
3杨阳,练冲,马超.晴天光伏发电功率的日内变化规律及预测方法研究[J].天津大学学报（自然科学与工程技术版）,2020,53(6):565-572. 被引量：6
4刘双婷,焦永刚,高博,王海花.并联矩形突扩微通道气液两相流动特性研究[J].制冷学报,2020,41(2):130-135. 被引量：6
5李扬,丁顺玉,许猛,马博钊,何月杰,康景麟.基于六西格玛方法的高温合金焊接工艺改进技术研究[J].电焊机,2020,50(3):102-109. 被引量：2
6Cheng Yong Tang,Yingying Fan,Yinfei Kong.Precision Matrix Estimation by Inverse Principal Orthogonal Decomposition[J].Communications in Mathematical Research,2020,36(1):68-92.
7陈钊,范剑青,王丹.高维因子模型及其在统计机器学习中的应用[J].中国科学：数学,2020,50(4):447-490. 被引量：4
8Veronika BOLSHAKOVA,Annie GUERRIERO,Gilles HALIN.Identifying stakeholders’roles and relevant project documents for 4D-based collaborative decision making[J].Frontiers of Engineering Management,2020,7(1):104-118. 被引量：1
9张义凯,熊良山,杨鹤青.面向流屑角突变建模的切削状态参数经验公式的获得及应用[J].工具技术,2020,54(4):68-72. 被引量：1
10Xinbo YU,Wei HE,Yanan LI,Chengqian XUE,Yongkun SUN,Yu WANG.Adaptive NN impedance control for an SEA-driven robot[J].Science China(Information Sciences),2020,63(5):217-219. 被引量：1

计算机学报

2020年第5期

浏览历史

内容加载中请稍等...

基于评分矩阵与评论文本的深度推荐模型被引量：42

参考文献2

二级参考文献25

共引文献36

同被引文献308

引证文献42

二级引证文献110

相关作者

相关机构

相关主题

浏览历史

基于评分矩阵与评论文本的深度推荐模型 被引量：42

参考文献2

二级参考文献25

共引文献36

同被引文献308

引证文献42

二级引证文献110

相关作者

相关机构

相关主题

浏览历史

基于评分矩阵与评论文本的深度推荐模型被引量：42