Please wait a minute...
Data Analysis and Knowledge Discovery  2023, Vol. 7 Issue (4): 68-76    DOI: 10.11925/infotech.2096-3467.2022.0430
Current Issue | Archive | Adv Search |
Scientific Collaboration Recommendation Based on Hypergraph
Chen Wenjie()
Chengdu Library and Information Center, Chinese Academy of Sciences, Chengdu 610041, China
Download: PDF (979 KB)   HTML ( 15
Export: BibTeX | EndNote (RIS)      
Abstract  

[Objective] To promote collaboration and academic community building among researchers, this paper proposes a hypergraph-based recommendation algorithm, SCRH. [Methods] Firstly, we constructed a scientific collaboration hyper-network based on hypergraph structure. Then, we created the hypergraph’s structural similarity index based on common neighbors and resource allocation. Next, we built the attribute similarity index of the hypergraph using the author topic model and deep autoencoder. Finally, the two measurement indices were linearly fused to achieve scientific collaboration recommendations. [Results] In the collaboration recommendation task, the AUC and MR index values of SCRH reached 0.88 and 2.35, which were 0.11 and 0.79 better than the optimal metrics of the comparison algorithms. [Limitations] SCRH only considers the author’s text attributes in the node attribute similarity measurement. It needs to fully utilize the author’s citation information, institution information, and publication levels. [Conclusions] SCRH considers the hypergraph’s structural and attribute features. It can effectively accomplish the research collaboration recommendation tasks in stem cells field.

Key wordsHypergraph      Structural Similarity      Attribute Similarity      Scientific Collaboration Recommendation     
Received: 05 May 2022      Published: 07 June 2023
ZTFLH:  TP393 G250  
Fund:National Key R&D Program of China(2018YFB1404205)
Corresponding Authors: Chen Wenjie,ORCID:0000-0001-7515-3624,E-mail: chenwj@clas.ac.cn   

Cite this article:

Chen Wenjie. Scientific Collaboration Recommendation Based on Hypergraph. Data Analysis and Knowledge Discovery, 2023, 7(4): 68-76.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.2096-3467.2022.0430     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2023/V7/I4/68

Examples of Scientific Collaboration Hypernetwork
The Flow of Scientific Collaboration Recommendation Algorithm
Attribute Feature Vector Generation
超网络 hypernet-1 hypernet-2 hypernet-3
文献数量 6 430 2 462 1 372
作者节点数量 40 478 11 929 5 417
二元合作关系数量 212 55 25
多元合作关系数量 5 538 1 855 946
关系节点数量比 0.14 0.16 0.18
Scientific Collaboration Hypernetwork
作者 1 2 …… 10
作者1 0.314 407 91 0.502 426 36 …… -0.817 366 26
作者2 -0.139 027 33 -0.394 726 16 …… 0.642 107 54
…… …… …… …… ……
作者40478 0.690 276 42 0.142 156 29 …… 0.071 254 63
Author-Attribute Feature Matrix
作者节点 合作节点 相似性值
De Marchi, Fabiola Sorarù, Gianni 10.58
De Marchi, Fabiola Cisari, Carlo 6.96
De Marchi, Fabiola Profico, Daniela 5.69
Lu, Huai-En Yarmishyn, Aliaksandr A 12.69
Lu, Huai-En Syu, Shih-Han 12.57
Lu, Huai-En Chen, Shih-Jen 12.41
Structural Similarity Value
作者节点 合作节点 相似性值
Shetty, Deeti K Zukowska, Monika 1.31
Shetty, Deeti K Nagy, Kristina Vintersten 2.73
Shetty, Deeti K Sunada, Yoshihide 2.83
Okano, Teruo Masterson, Claire 1.35
Okano, Teruo Yarygin, K N V 4.32
Okano, Teruo Bock, István BioTalentum 4.62
Attribute Similarity Value
Attribute Feature Vector Visualization
度量指标 hypernet-1 hypernet-2 hypernet-3
WHRA+AS 0.79 0.85 0.88
WHCN+AS 0.77 0.82 0.87
WHRA 0.73 0.81 0.84
WHCN 0.71 0.79 0.81
RA 0.70 0.78 0.79
CN 0.69 0.76 0.77
AUC on Three Hypernetworks
度量指标 hypernet-1 hypernet-2 hypernet-3
WHRA+AS 2.44 2.37 2.35
WHCN+AS 2.49 2.43 2.42
WHRA 2.58 2.53 2.41
WHCN 2.73 2.59 2.66
RA 3.03 3.17 3.04
CN 3.45 3.18 3.14
Mean Rank
[1] 林原, 刘海峰, 王海龙, 等. 基于表示学习的学者间潜在合作机会挖掘[J]. 情报杂志, 2019, 38(5): 65-70.
[1] (Lin Yuan, Liu Haifeng, Wang Hailong, et al. Potential Cooperation Opportunities Exploration Between Scholars Based on Presentation Learning[J]. Journal of Intelligence, 2019, 38(5): 65-70.)
[2] Newman M. Scientific Collaboration Networks. I. Network Construction and Fundamental Results[J]. Physical Review E, 2001, 64: 016131.
[3] 高晨晖, 姜晓睿, 叶政君, 等. 基于异质学术超网的文献评价[J]. 情报学报, 2016, 35(8): 826-837.
[3] (Gao Chenhui, Jiang Xiaorui, Ye Zhengjun, et al. Ranking Scientific Articles over Heterogeneous Academic Hypernetwork[J]. Journal of the China Society for Scientific and Technical Information, 2016, 35(8): 826-837.)
[4] 胡枫, 赵海兴, 何佳倍, 等. 基于超图结构的科研合作网络演化模型[J]. 物理学报, 2013, 62(19): 198901.
doi: 10.7498/aps
[4] (Hu Feng, Zhao Haixing, He Jiabei, et al. An Evolving Model for Hypergraph-Structure-Based Scientific Collaboration Networks[J]. Acta Physica Sinica, 2013, 62(19): 198901.)
doi: 10.7498/aps
[5] 于亚新, 张文超, 李振国, 等. 基于超图的EBSN个性化推荐及优化算法[J]. 计算机研究与发展, 2020, 57(12): 2556-2570.
[5] (Yu Yaxin, Zhang Wenchao, Li Zhenguo, et al. Hypergraph-Based Personalized Recommendation & Optimization Algorithm in EBSN[J]. Journal of Computer Research and Development, 2020, 57(12): 2556-2570.)
[6] Rosen-Zvi M, Griffiths T, Steyvers M, et al. The Author-Topic Model for Authors and Documents[C]// Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence. Arlington: AUAI Press, 2004: 487-494.
[7] 刘萍, 郑凯伦, 邹德安. 基于LDA模型的科研合作推荐研究[J]. 情报理论与实践, 2015, 38(9): 79-85.
[7] (Liu Ping, Zheng Kailun, Zou Dean. The Recommendation of S&T Collaboration Based on LDA Model[J]. Information Studies: Theory & Application, 2015, 38(9): 79-85.)
[8] 蒲姗姗. 基于知识互补的科研合作专家推荐模型研究[J]. 情报理论与实践, 2018, 41(8): 96-101.
[8] (Pu Shanshan. Expert Recommendation Model in Scientific and Technical Collaboration Based on Complementary Knowledge[J]. Information Studies: Theory & Application, 2018, 41(8): 96-101.)
[9] 李仲, 韩红旗, 吴广印, 等. 基于文本稀疏分布式表征的学术合作推荐[J]. 情报科学, 2019, 37(6): 113-118.
[9] (Li Zhong, Han Hongqi, Wu Guangyin, et al. Academic Collaboration Recommendation Based on Sparse Distributed Representation[J]. Information Science, 2019, 37(6): 113-118.)
[10] Kong X J, Mao M Y, Liu J Y, et al. TNERec: Topic-Aware Network Embedding for Scientific Collaborator Recommendation[C]// Proceedings of the 2018 IEEE SmartWorld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computing, Scalable Computing & Communications, Cloud & Big Data Computing, Internet of People and Smart City Innovation (SmartWorld/SCALCOM/UIC/ATC/CBDCom/IOP/SCI). IEEE, 2018: 1007-1014.
[11] 余传明, 林奥琛, 钟韵辞, 等. 基于网络表示学习的科研合作推荐研究[J]. 情报学报, 2019, 38(5): 500-511.
[11] (Yu Chuanming, Lin Aochen, Zhong Yunci, et al. Scientific Collaboration Recommendation Based on Network Embedding[J]. Journal of the China Society for Scientific and Technical Information, 2019, 38(5): 500-511.)
[12] Zhou J, Adnan Rafi M. Recommendation of Research Collaborator Based on Semantic Link Network[C]// Proceedings of the 15th International Conference on Semantics, Knowledge and Grids (SKG). IEEE, 2020: 16-20.
[13] 汪志兵, 韩文民, 孙竹梅, 等. 基于网络拓扑结构与节点属性特征融合的科研合作预测研究[J]. 情报理论与实践, 2019, 42(8): 116-120.
[13] (Wang Zhibing, Han Wenmin, Sun Zhumei, et al. Research on Scientific Collaboration Prediction Based on the Combination of Network Topology and Node Attributes[J]. Information Studies: Theory & Application, 2019, 42(8): 116-120.)
[14] 张鑫, 文奕, 许海云. 一种融合表示学习与主题表征的作者合作预测模型[J]. 数据分析与知识发现, 2021, 5(3): 88-100.
[14] (Zhang Xin, Wen Yi, Xu Haiyun. A Prediction Model with Network Representation Learning and Topic Model for Author Collaboration[J]. Data Analysis and Knowledge Discovery, 2021, 5(3): 88-100.)
[15] 熊回香, 李晓敏, 杜瑾. 基于学术关键词与共被引的学者推荐研究[J]. 情报学报, 2021, 40(7): 725-733.
[15] (Xiong Huixiang, Li Xiaomin, Du Jin. Scholar Recommendation Research Based on Academic Keywords and Co-Citation[J]. Journal of the China Society for Scientific and Technical Information, 2021, 40(7): 725-733.)
[16] Du O X, Li Y. Academic Collaborator Recommendation Based on Attributed Network Embedding[J]. Journal of Data and Information Science, 2022, 7(1): 37-56.
doi: 10.2478/jdis-2022-0005
[17] Pradhan T, Pal S. A Multi-Level Fusion Based Decision Support System for Academic Collaborator Recommendation[J]. Knowledge-Based Systems, 2020, 197: 105784.
[18] 王雨, 郭进利. 超网络视角下的科研合作网络节点重要性评估[J]. 图书馆杂志, 2018, 37(10): 89-100.
[18] (Wang Yu, Guo Jinli. The Evaluation of Node Importance Within the Scientific Collaboration Network from Hypernetwork Perspective[J]. Library Journal, 2018, 37(10): 89-100.)
[19] Liben-Nowell D, Kleinberg J. The Link-Prediction Problem for Social Networks[J]. Journal of the American Society for Information Science and Technology, 2007, 58(7): 1019-1031.
doi: 10.1002/(ISSN)1532-2890
[20] Zhou T, Lü L Y, Zhang Y C. Predicting Missing Links via Local Information[J]. The European Physical Journal B, 2009, 71(4): 623-630.
doi: 10.1140/epjb/e2009-00335-8
[21] Lü L Y, Zhou T. Link Prediction in Weighted Networks: The Role of Weak Ties[J]. EPL, 2010, 89(1): 18001.
doi: 10.1209/0295-5075/89/18001
[22] Newman M E. Clustering and Preferential Attachment in Growing Networks[J]. Physical Review E, Statistical, Nonlinear, and Soft Matter Physics, 2001, 64(2 Pt 2): 025102.
[23] 赵雪莉, 卢光跃, 吕少卿, 等. 结合属性信息的二分网络表示学习[J]. 计算机科学与探索, 2021, 15(3): 495-505.
doi: 10.3778/j.issn.1673-9418.2004034
[23] (Zhao Xueli, Lu Guangyue, Lv Shaoqing, et al. Attributed Bipartite Network Representation Learning[J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(3): 495-505.)
doi: 10.3778/j.issn.1673-9418.2004034
[24] Gehring J, Miao Y J, Metze F, et al. Extracting Deep Bottleneck Features Using Stacked Auto-Encoders[C]// Proceedings of the 26th IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE, 2013: 3377-3381.
[25] Schölkopf B, Platt J, Hofmann T. Greedy Layer-Wise Training of Deep Networks[C]// Proceedings of the 2006 NIPS Conference. 2007: 153-160.
[26] 邱均平, 陈木佩. 我国计量学领域作者合作关系研究[J]. 情报理论与实践, 2012, 35(11): 56-60.
[26] (Qiu Junping, Chen Mupei. Research on Author Collaboration in the Metrology Field in China[J]. Information Studies: Theory & Application, 2012, 35(11): 56-60.)
[27] 黄震华, 张佳雯, 张波, 等. 语义推荐算法研究综述[J]. 电子学报, 2016, 44(9): 2262-2275.
doi: 10.3969/j.issn.0372-2112.2016.09.035
[27] (Huang Zhenhua, Zhang Jiawen, Zhang Bo, et al. Survey of Semantics-Based Recommendation Algorithms[J]. Acta Electronica Sinica, 2016, 44(9): 2262-2275.)
doi: 10.3969/j.issn.0372-2112.2016.09.035
[28] Laurens V D M, Hinton G. Visualizing Data Using t-SNE[J]. Journal of Machine Learning Research, 2008, 9: 2579-2605.
[1] Bai Simeng,Niu Zhendong,He Hui,Shi Kaize,Yi Kun,Ma Yuanchi. Biomedical Text Classification Method Based on Hypergraph Attention Network[J]. 数据分析与知识发现, 2022, 6(11): 13-24.
[2] Wu Yuying,Sun Ping,He Xijun,Jiang Guorui. Predicting Transactions Among Agents in Patent Transfer Weighted Networks for New Energy[J]. 数据分析与知识发现, 2018, 2(11): 73-79.
[3] Yu Chuanming,Gong Yutian,Zhao Xiaoli,An Lu. Collaboration Recommendation of Finance Research Based on Multi-feature Fusion[J]. 数据分析与知识发现, 2017, 1(8): 39-47.
[4] Nie Hui Huang Guipeng. The Application and Implementation of Tree Edit Distance in Web Information Extraction[J]. 现代图书情报技术, 2010, 26(5): 29-34.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn