[Objective] This study proposes an improved paper recommendation framework based on Ensemble Learning and Factorization Machine. It addresses the issues of the existing methods, such as difficulties in processing sparse data and representing features. [Methods] First, we used Convolutional Neural Network, Network Embedding, and other algorithms to obtain feature representations, which were processed by Factorization Machine learners. Homogeneous weak Factorization Machine learners are then trained based on Ensemble Learning. We integrated these weak learners into a stronger learner through the voting mechanism and generated the final recommendations. [Results] We examined the new model with the CiteULike dataset, and the Precision, Accuracy, and F-Measure reached 72.6%, 69.7%, and 76.2%, respectively, 20%, 15%, and 9% higher than the benchmark algorithms. [Limitations] The input, sampling strategy, and processing mode need to be further explored. [Conclusions] The proposed Ensemble Factorization Machine enables effective representation and utilization of sparse data features, enhancing the recommendation performance.
杨辰, 郑若桢, 王楚涵, 耿爽, 王楠. 集成因子分解机及其在论文推荐中的应用研究*[J]. 数据分析与知识发现, 2023, 7(8): 128-137.
Yang Chen, Zheng Ruozhen, Wang Chuhan, Geng Shuang, Wang Nan. Ensemble Factorization Machine and Its Application in Paper Recommendation. Data Analysis and Knowledge Discovery, 2023, 7(8): 128-137.
Kong X J, Shi Y J, Yu S, et al. Academic Social Networks: Modeling, Analysis, Mining and Applications[J]. Journal of Network and Computer Applications, 2019, 132: 86-103.
doi: 10.1016/j.jnca.2019.01.029
[2]
Nascimento C, Laender A H F, da Silva A S, et al. A Source Independent Framework for Research Paper Recommendation[C]// Proceedings of the 11th Annual International ACM/IEEE Joint Conference on Digital Libraries. New York: ACM, 2011: 297-306.
(Yang Chen, Liu Tingting, Liu Lei, et al. A Novel Recommendation Approach of Electronic Literature Resources Combining Semantic and Social Features[J]. Journal of the China Society for Scientific and Technical Information, 2019, 38(6): 632-640.)
[4]
Champiri Z D, Asemi A, Binti S S S. Meta-Analysis of Evaluation Methods and Metrics Used in Context-Aware Scholarly Recommender Systems[J]. Knowledge and Information Systems, 2019, 61(2): 1147-1178.
doi: 10.1007/s10115-018-1324-5
[5]
Bhagavatula C, Feldman S, Power R, et al. Content-Based Citation Recommendation[OL]. arXiv Preprint, arXiv: 1802.08301.
[6]
Basu C, Hirsh H, Cohen W W, et al. Technical Paper Recommendation: A Study in Combining Multiple Information Sources[J]. Journal of Artificial Intelligence Research, 2001, 14: 231-252.
doi: 10.1613/jair.739
[7]
Caragea C, Bulgarov F A, Godea A, et al. Citation-Enhanced Keyphrase Extraction from Research Papers: A Supervised Approach[C]// Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. Stroudsburg, PA, USA: Association for Computational Linguistics, 2014: 1435-1446.
[8]
McNee S M, Albert I, Cosley D, et al. On the Recommending of Citations for Research Papers[C]// Proceedings of the 2002 ACM Conference on Computer Supported Cooperative Work. New York: ACM, 2002: 116-125.
(Bi Qiang, Liu Jian. Study on the Method of Aggregation and Service Recommendation of Digital Resource Based on Domain Ontology[J]. Journal of the China Society for Scientific and Technical Information, 2017, 36(5): 452-460.)
(Li Yamei, Qin Chunxiu, Ma Xubu. Research on Collaborative Recommendation of Scientific and Technological Literature Based on Researchers’ Contextual Topic Preference[J]. Information Studies: Theory & Application, 2021, 44(12): 180-189.)
[11]
Beel J, Gipp B, Langer S, et al. Research-Paper Recommender Systems: A Literature Survey[J]. International Journal on Digital Libraries, 2016, 17(4): 305-338.
doi: 10.1007/s00799-015-0156-0
(Tang Zhikang, Li Chunying, Tang Yong, et al. Paper Recommendation Method Based on Scholar Social Platform[J]. Computer & Digital Engineering, 2017, 45(2): 221-225.)
(Liu Jian, Bi Qiang, Liu Qingxu, et al. New Content Recommendation Service of Digital Literature[J]. New Technology of Library and Information Service, 2016(9): 70-77.)
(Chen Haihua, Meng Rui, Lu Wei. Research Review on Citation Recommendation of Academic Literatures[J]. Library and Information Service, 2015, 59(15): 133-143.)
doi: 10.13266/j.issn.0252-3116.2015.15.018
(Liu Yang. Research on the Hybrid Recommendation Model of Academic Reference Based on Quality[J]. Information Studies: Theory & Application, 2015, 38(2): 17-22.)
[16]
Haruna K, Ismail M A, Damiasih D, et al. A Collaborative Approach for Research Paper Recommender System[J]. PLoS One, 2017, 12(10): e0184516.
doi: 10.1371/journal.pone.0184516
[17]
Guo Q Y, Zhuang F Z, Qin C, et al. A Survey on Knowledge Graph-Based Recommender Systems[J]. IEEE Transactions on Knowledge and Data Engineering, 2022, 34(8): 3549-3568.
doi: 10.1109/TKDE.2020.3028705
[18]
Kanakia A, Shen Z H, Eide D, et al. A Scalable Hybrid Research Paper Recommender System for Microsoft Academic[C]// Proceedings of the 2019 World Wide Web Conference. New York: ACM, 2019: 2893-2899.
(Wang Qinjie, Qin Chunxiu, Ma Xubu, et al. Research on Recommendation Method of Scientific and Technological Literature Based on Author Preference and Heterogeneous Information Network[J]. Data Analysis and Knowledge Discovery, 2021, 5(8): 54-64.)
[20]
Ricci F, Rokach L, Shapira B. Introduction to Recommender Systems Handbook[A]// Ricci F, Rokach L, Shapira B, et al. Recommender Systems Handbook[M]. Boston, MA: Springer, 2011: 1-35.
(Cai Yi, Zhu Xiufang, Sun Zhangli, et al. Semi-Supervised and Ensemble Learning: A Review[J]. Computer Science, 2017, 44(S1): 7-13.)
[22]
Sagi O, Rokach L. Ensemble Learning: A Survey[J]. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 2018, 8(4): e1249.
doi: 10.1002/widm.2018.8.issue-4
[23]
Breiman L. Bagging Predictors[J]. Machine Learning, 1996, 24(2): 123-140.
[24]
Freund Y, Schapire R E. A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting[J]. Journal of Computer and System Sciences, 1997, 55(1): 119-139.
doi: 10.1006/jcss.1997.1504
[25]
Ho T K. The Random Subspace Method for Constructing Decision Forests[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1998, 20(8): 832-844.
doi: 10.1109/34.709601
Kuncheva L I, Whitaker C J. Measures of Diversity in Classifier Ensembles and Their Relationship with the Ensemble Accuracy[J]. Machine Learning, 2003, 51(2): 181-207.
doi: 10.1023/A:1022859003006
[28]
Rendle S. Factorization Machines[C]// Proceedings of the 2010 IEEE International Conference on Data Mining. IEEE, 2011: 995-1000.
[29]
Gu J X, Wang Z H, Kuen J, et al. Recent Advances in Convolutional Neural Networks[J]. Pattern Recognition, 2018, 77: 354-377.
[30]
Cui P, Wang X, Pei J, et al. A Survey on Network Embedding[J]. IEEE Transactions on Knowledge and Data Engineering, 2019, 31(5): 833-852.
doi: 10.1109/TKDE.69
[31]
Choong A C H, Lee N K. Evaluation of Convolutionary Neural Networks Modeling of DNA Sequences Using Ordinal Versus One-Hot Encoding Method[C]// Proceedings of the 2017 International Conference on Computer and Drone Applications. IEEE, 2018: 60-65.
(Zhou Feiyan, Jin Linpeng, Dong Jun. Review of Convolutional Neural Network[J]. Chinese Journal of Computers, 2017, 40(6): 1229-1251.)
[33]
Perozzi B, Al-Rfou R, Skiena S. DeepWalk: Online Learning of Social Representations[C]// Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York: ACM, 2014: 701-710.
[34]
Mikolov T, Chen K, Corrado G, et al. Efficient Estimation of Word Representations in Vector Space[OL]. arXiv Preprint, arXiv: 1301.3781.
[35]
Kingma D P, Ba J. A Method for Stochastic Optimization[C]// Proceedings of the 3rd International Conference on Learning Representations. 2015.
[36]
Wang H, Chen B Y, Li W J. Collaborative Topic Regression with Social Regularization for Tag Recommendation[C]// Proceedings of the 23rd International Joint Conference on Artificial Intelligence. 2013.
[37]
Koren Y, Bell R, Volinsky C. Matrix Factorization Techniques for Recommender Systems[J]. Computer, 2009, 42(8): 30-37.
[38]
Goldberg D, Nichols D, Oki B M, et al. Using Collaborative Filtering to Weave an Information Tapestry[J]. Communications of the ACM, 1992, 35(12): 61-70.