Please wait a minute...
Data Analysis and Knowledge Discovery  2023, Vol. 7 Issue (7): 146-155    DOI: 10.11925/infotech.2096-3467.2022.0715
Current Issue | Archive | Adv Search |
Literature Recommendation Algorithm Integrating High-Order Similarity of Motif Structure
Chen Liu,Guo Yuhong()
School of Cyber Science and Engineering, University of International Relations, Beijing 100091, China
Download: PDF (994 KB)   HTML ( 16
Export: BibTeX | EndNote (RIS)      
Abstract  

[Objective] This paper applies the collaborative filtering method to the field of literature recommendation. It incorporates high-order similarity features reflected by the Motif structure in the user cosine similarity network to improve the recommendation quality. [Methods] Firstly, we constructed the user preference data for literature using their behavior information of collecting literature and the citation relationship between literature. Secondly, in the user cosine similarity network based on user literature collection behavior information, we captured the high-order similarity with subgraph—Motif structure within the network. Finally, we integrated user cosine and high-order similarity based on Motif structure into the matrix factorization recommendation algorithm to predict user preferences for literature. [Results] Compared with the traditional matrix factorization recommendation algorithms, this algorithm's RMSE and MAE metrics were reduced by 0.0482 and 0.0379, respectively. [Limitations] The proposed algorithm does not consider the temporal decay of the literature. [Conclusions] The new algorithm reduces the prediction error of user preferences and improves the literature recommendation quality.

Key wordsLiterature Recommendation      Motif Structure      User High-Order Similarity      Matrix Factorization     
Received: 11 July 2022      Published: 21 March 2023
ZTFLH:  TP391  
  G250  
Fund:Academic Training Project for UIR Students(3262021SYJ007)
Corresponding Authors: Guo Yuhong,ORCID:0000-0003-3336-0611,E-mail: yhguo@uir.cn。   

Cite this article:

Chen Liu, Guo Yuhong. Literature Recommendation Algorithm Integrating High-Order Similarity of Motif Structure. Data Analysis and Knowledge Discovery, 2023, 7(7): 146-155.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.2096-3467.2022.0715     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2023/V7/I7/146

Four Node Network Example
Seven Three-Node Motif Structures
Calculation of Adjacency Matrix Based on M4 Structure
符号 描述
P m × n 用户-文献收藏矩阵
Q n × n 文献-文献引用矩阵
F 用户集合
L 文献集合
C m × n 用户-文献初始偏好矩阵
R m × n 用户-文献偏好矩阵
i , k 用户集中的某位用户
j 文献集中的某篇文献
T m × m 用户余弦相似度矩阵
H m × m 基于矩阵 T且对角线上值为0的矩阵
G 基于邻接矩阵H的加权有向图
A 基于图G的无权邻接矩阵
D m × m 基于Motif结构的用户高阶相似度矩阵
S m × m 用户增强相似度矩阵
R ^ m × n 用户-文献预测偏好矩阵
U m × d 用户特征矩阵
V n × d 文献特征矩阵
Symbol Definition
Algorithm Flow
Example of Motif Structure
User-Literature Collection Relationship and Literature-Literature Citation Relationship
Calculation Example of High-Order Similarity Matrix Based on Motif Structure
参数名称 默认值 参数名称 默认值
梯度下降步长 0.01 正则项参数β 0.001
迭代次数 300 融合参数σ 11
特征维度 20 平衡参数α 10
Default Value of Experimental Parameters
算法 RMSE MAE
BiasSVD 0.350 6 0.233 1
MF 0.346 3 0.231 9
SVD++ 0.347 7 0.231 7
JMF-UCP 0.345 7 0.224 4
本文算法 0.298 1 0.194 0
Performance Comparison of Different Algorithms
Changes of RMSE Under Different Fusion Parameters σ
Changes of MAE Under Different Fusion Parameters σ
平衡参数α RMSE MAE
0 0.304 8 0.194 3
10 0.298 1 0.194 0
20 0.300 1 0.196 4
Results of Different Equilibrium Parameters α
[1] 陈碧毅, 黄玲, 王昌栋, 等. 融合显式反馈与隐式反馈的协同过滤推荐算法[J]. 软件学报, 2020, 31(3): 794-805.
[1] (Chen Biyi, Huang Ling, Wang Changdong, et al. Explicit and Implicit Feedback Based Collaborative Filtering Algorithm[J]. Journal of Software, 2020, 31(3): 794-805.)
[2] 丁浩, 艾文华, 胡广伟, 等. 融合用户兴趣波动时序的个性化推荐模型[J]. 数据分析与知识发现, 2021, 5(11): 45-58.
[2] (Ding Hao, Ai Wenhua, Hu Guangwei, et al. A Personalized Recommendation Model with Time Series Fluctuation of User Interest[J]. Data Analysis and Knowledge Discovery, 2021, 5(11): 45-58.)
[3] 焦富森, 李树青. 基于物品质量和用户评分修正的协同过滤推荐算法[J]. 数据分析与知识发现, 2019, 3(8): 62-67.
[3] (Jiao Fusen, Li Shuqing. Collaborative Filtering Recommendation Based on Item Quality and User Ratings[J]. Data Analysis and Knowledge Discovery, 2019, 3(8): 62-67.)
[4] 王井. 一种基于订阅记录的图书协同过滤推荐方法研究[J]. 情报科学, 2020, 38(3): 54-59.
[4] (Wang Jing. A Study of Collaborative Filtering Recommendation Method Based on Subscription Records[J]. Information Science, 2020, 38(3): 54-59.)
[5] 张艳菊, 马璐. 数据缺失下的IFRSIFCM协同过滤推荐算法[J]. 控制工程, 2022, 29(3): 542-550.
[5] (Zhang Yanju, Ma Lu. IFRSIFCM Collaborative Filtering Recommendation Algorithm with Missing Data[J]. Control Engineering of China, 2022, 29(3): 542-550.)
[6] 贾俊杰, 姚叶旺, 陈旺虎. 基于非负矩阵分解的群组推荐算法[J]. 计算机工程与科学, 2022, 44(5): 933-943.
[6] (Jia Junjie, Yao Yewang, Chen Wanghu. A Group Recommendation Algorithm Based on Non-Negative Matrix Factorization[J]. Computer Engineering and Science, 2022, 44(5): 933-943.)
[7] 武聪, 马文明, 王冰, 等. 融合用户标签相似度的矩阵分解算法[J]. 南京大学学报(自然科学), 2022, 58(1): 143-152.
[7] (Wu Cong, Ma Wenming, Wang Bing, et al. Matrix Factorization Algorithm Combined with User Tag Similarity[J]. Journal of Nanjing University (Natural Science), 2022, 58(1): 143-152.)
[8] 韩立锋, 陈莉, 史晓龙. 融合项目属性偏好的矩阵分解推荐模型[J]. 西安电子科技大学学报, 2022, 49(3): 147-159.
[8] (Han Lifeng, Chen Li, Shi Xiaolong. Matrix Decomposition Recommendation Model Incorporating Item Attribute Preference[J]. Journal of Xidian University, 2022, 49(3): 147-159.)
[9] Shi Y, Serdyukov P, Hanjalic A, et al. Personalized Landmark Recommendation Based on Geotags from Photo Sharing Sites[C]// Proceedings of the 5th International AAAI Conference on Weblogs and Social Media. 2011, 5(1): 622-625.
[10] 何海洋, 王勇, 蔡国永. 基于用户类别偏好相似度和联合矩阵分解的推荐算法[J]. 数据采集与处理, 2018, 33(1): 179-185.
[10] (He Haiyang, Wang Yong, Cai Guoyong. Recommendation Algorithm Based on User Category Preference Similarity and Joint Ma-Trix Factorization[J]. Journal of Data Acquisition & Processing, 2018, 33(1): 179-185.)
[11] 王根生, 潘方正. 融合加权异构信息网络的矩阵分解推荐算法[J]. 数据分析与知识发现, 2020, 4(12): 76-84.
[11] (Wang Gensheng, Pan Fangzheng. Matrix Factorization Algorithm with Weighted Heterogeneous Information Network[J]. Data Analysis and Knowledge Discovery, 2020, 4(12): 76-84.)
[12] 郭磊, 余文森, 吴清寿. 融合信任关系的联合矩阵分解推荐算法仿真[J]. 计算机仿真, 2021, 38(2): 378-382.
[12] (Guo Lei, Yu Wensen, Wu Qingshou. Simulation of Joint Matrix Factorization Recommendation Algorithm Integrating Trust Relationship[J]. Computer Simulation, 2021, 38(2): 378-382.)
[13] 陈珏伊, 朱颖琪, 周刚, 等. 基于迁移的联合矩阵分解的协同过滤算法[J]. 四川大学学报(自然科学版), 2020, 57(6): 1096-1102.
[13] (Chen Jueyi, Zhu Yingqi, Zhou Gang, et al. Collaborative Filtering Recommendation Based on Transfer Learning and Joint Matrix Decomposition[J]. Journal of Sichuan University (Natural Science Edition), 2020, 57(6): 1096-1102.)
[14] 文凯, 朱传亮. 融合社交网络和兴趣的正则化矩阵分解推荐模型[J]. 计算机应用, 2018, 38(9): 2523-2528.
doi: 10.11772/j.issn.1001-9081.2018030683
[14] (Wen Kai, Zhu Chuanliang. Regularized Matrix Decomposition Recommendation Model Integrating Social Networks and Interest Correlation[J]. Journal of Computer Applications, 2018, 38(9): 2523-2528.)
doi: 10.11772/j.issn.1001-9081.2018030683
[15] 屈冰洋, 王亚民. 基于深度学习的科技信息文献推荐模型研究[J]. 情报理论与实践, 2021, 44(11): 160-165.
doi: 10.16353/j.cnki.1000-7490.2021.11.021
[15] (Qu Bingyang, Wang Yamin. Research on Science and Technology Information Papers Recommendation Model Based on Deep Learning[J]. Information Studies:Theory & Application, 2021, 44(11): 160-165.)
doi: 10.16353/j.cnki.1000-7490.2021.11.021
[16] Ribeiro P, Silva F, Lopes L. Parallel Discovery of Network Motifs[J]. Journal of Parallel and Distributed Computing, 2012, 72(2): 144-154.
doi: 10.1016/j.jpdc.2011.08.007
[17] Milo R, Shen-Orr S, Itzkovitz S, et al. Network Motifs: Simple Building Blocks of Complex Networks[J]. Science, 2002, 298(5594): 824-827.
doi: 10.1126/science.298.5594.824 pmid: 12399590
[18] Boyen P, Neven F, van Dyck D, et al. Mining Minimal Motif Pair Sets Maximally Covering Interactions in a Protein-Protein Interaction Network[J]. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2013, 10(1): 73-86.
doi: 10.1109/TCBB.2012.165 pmid: 23702545
[19] Czeizler E, Hirvola T, Karhu K. A Graph-Theoretical Approach for Motif Discovery in Protein Sequences[J]. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2017, 14(1): 121-130.
doi: 10.1109/TCBB.2015.2511750 pmid: 28055896
[20] Jiang J W, Hu Y S, Li X S, et al. Analyzing Online Transaction Networks with Network Motifs[C]// Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. New York: ACM, 2022: 3098-3106.
[21] Zhao H, Xu X G, Song Y Q, et al. Ranking Users in Social Networks with Higher-Order Structures[C]// Proceedings of the 32nd AAAI Conference on Artificial Intelligence and 30th Innovative Applications of Artificial Intelligence Conference and 8th AAAI Symposium on Educational Advances in Artificial Intelligence. 2018: 232-239.
[22] Li P Z, Huang L, Wang C D, et al. Community Detection by Motif-Aware Label Propagation[J]. ACM Transactions on Knowledge Discovery from Data, 2020, 14(2): 1-19.
[23] Zhao H, Zhou Y Q, Song Y Q, et al. Motif Enhanced Recommendation over Heterogeneous Information Network[C]// Proceedings of the 28th ACM International Conference on Information and Knowledge Management. New York: ACM, 2019: 2189-2192.
[24] 王婕, 耿秀丽. 基于直觉模糊余弦相似度的云制造服务质量评价[J]. 计算机集成制造系统, 2021, 27(4): 1128-1134.
doi: 10.13196/j.cims.2021.04.017
[24] (Wang Jie, Geng Xiuli. Evaluation of Cloud Manufacturing Service Quality Evaluation Based on Intuitionistic Fuzzy Cosine Similarity[J]. Computer Integrated Manufacturing Systems, 2021, 27(4): 1128-1134.)
doi: 10.13196/j.cims.2021.04.017
[25] 刘文斌, 何彦青, 吴振峰, 等. 基于BERT和多相似度融合的句子对齐方法研究[J]. 数据分析与知识发现, 2021, 5(7): 48-58.
[25] (Liu Wenbing, He Yanqing, Wu Zhenfeng, et al. Research on Sentence Alignment Method Based on BERT and Multi-similarity Fusion[J]. Data Analysis and Knowledge Discovery, 2021, 5(7): 48-58.)
[26] 孟美任, 彭希珺. 基于VSM和余弦相似度的稿件精准送审方法[J]. 中国科技期刊研究, 2018, 29(10): 982-986.
doi: 10.11946/cjstp.201806190534
[26] (Meng Meiren, Peng Xijun. Method for Accurate Assignment of Manuscript Review Based on VSM and Cosine Similarity[J]. Chinese Journal of Scientific and Technical Periodicals, 2018, 29(10): 982-986.)
doi: 10.11946/cjstp.201806190534
[27] Koren Y, Bell R, Volinsky C. Matrix Factorization Techniques for Recommender Systems[J]. Computer, 2009, 42(8): 30-37.
[28] Benson A R, Gleich D F, Leskovec J. Higher-Order Organization of Complex Networks[J]. Science, 2016, 353(6295): 163-166.
doi: 10.1126/science.aad9029 pmid: 27387949
[29] 刘大有, 虞强源, 杨博, 等. 数据结构[M]. 第2版. 北京: 高等教育出版社, 2010: 132-133.
[29] (Liu Dayou, Yu Qiangyuan, Yang Bo. Data Structure[M]. The 2nd Edition. Beijing: Higher Education Press, 2010: 132-133.)
[30] Wang H, Chen B, Li W J. Collaborative Topic Regression with Social Regularization for Tag Recommendation[C]// Proceedings of the 23rd International Joint Conference on Artificial Intelligence. 2013: 2719-2725.
[31] Wang C, Blei D M. Collaborative Topic Modeling for Recommending Scientific Articles[C]// Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York: ACM, 2011: 448-456.
[32] Koren Y. Factorization Meets the Neighborhood: A Multifaceted Collaborative Filtering Model[C]// Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York: ACM, 2008: 426-434.
[1] Ding Hao, Hu Guangwei, Qi Jianglei, Zhuang Guangguang. Recommending Medical Literature with Random Forest Model and Query Expansion[J]. 数据分析与知识发现, 2022, 6(7): 32-43.
[2] Li Zhi, Sun Rui, Yao Yuxuan, Li Xiaohuan. Recommending Point-of-Interests with Real-Time Event Detection[J]. 数据分析与知识发现, 2022, 6(10): 114-127.
[3] Wang Qinjie, Qin Chunxiu, Ma Xubu, Liu Huailiang, Xu Cunzhen. Recommending Scientific Literature Based on Author Preference and Heterogeneous Information Network[J]. 数据分析与知识发现, 2021, 5(8): 54-64.
[4] Wang Gensheng,Pan Fangzheng. Matrix Factorization Algorithm with Weighted Heterogeneous Information Network[J]. 数据分析与知识发现, 2020, 4(12): 76-84.
[5] Yan Wen,Lijian Ma,Qingtian Zeng,Wenyan Guo. POI Recommendation Based on Geographic and Social Relationship Preferences[J]. 数据分析与知识发现, 2019, 3(8): 30-39.
[6] Shi Xiaohua,Lu Hongtao. Detecting Community in Scientific Collaboration Network with Bayesian Symmetric NMF[J]. 数据分析与知识发现, 2017, 1(9): 49-56.
[7] Chen Dongyi,Zhou Zicheng,Jiang Shengyi,Wang Lianxi,Wu Jialin. A Framework for Customer Segmentation on Enterprises’ Microblog[J]. 现代图书情报技术, 2016, 32(2): 43-51.
[8] Wei Meng. Literature Recommendation Using Evolution Patterns[J]. 现代图书情报技术, 2014, 30(4): 20-26.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn