Please wait a minute...
Advanced Search
数据分析与知识发现  2023, Vol. 7 Issue (4): 68-76     https://doi.org/10.11925/infotech.2096-3467.2022.0430
  研究论文 本期目录 | 过刊浏览 | 高级检索 |
基于超图的科研合作推荐研究*
陈文杰()
中国科学院成都文献情报中心 成都 610041
Scientific Collaboration Recommendation Based on Hypergraph
Chen Wenjie()
Chengdu Library and Information Center, Chinese Academy of Sciences, Chengdu 610041, China
全文: PDF (979 KB)   HTML ( 14
输出: BibTeX | EndNote (RIS)      
摘要 

【目的】 为促进科研人员间的合作交流与学术团体的构建,提出基于超图的推荐算法SCRH,用于干细胞领域的科研合作推荐研究。【方法】 构建基于超图结构的科研合作超网络,然后基于共同邻居和资源分配构建超图的结构相似性指标,利用作者主题模型和深度自编码器构建超图的属性相似指标,最后将两种度量指标线性融合以实现科研合作推荐。【结果】 SCRH在合作推荐任务上AUC和MR指标值为0.88和2.35,相较于对比算法最优指标度量分别提升0.11和0.79。【局限】 SCRH在节点属性相似性度量中仅考虑作者的文本属性,没有充分利用作者的引用信息、机构信息和发文等级等属性信息。【结论】 SCRH同时考虑了超图的结构特征与属性特征,能够有效完成干细胞领域的科研合作推荐任务。

服务
把本文推荐给朋友
加入引用管理器
E-mail Alert
RSS
作者相关文章
陈文杰
关键词 超图结构相似性属性相似性科研合作推荐    
Abstract

[Objective] To promote collaboration and academic community building among researchers, this paper proposes a hypergraph-based recommendation algorithm, SCRH. [Methods] Firstly, we constructed a scientific collaboration hyper-network based on hypergraph structure. Then, we created the hypergraph’s structural similarity index based on common neighbors and resource allocation. Next, we built the attribute similarity index of the hypergraph using the author topic model and deep autoencoder. Finally, the two measurement indices were linearly fused to achieve scientific collaboration recommendations. [Results] In the collaboration recommendation task, the AUC and MR index values of SCRH reached 0.88 and 2.35, which were 0.11 and 0.79 better than the optimal metrics of the comparison algorithms. [Limitations] SCRH only considers the author’s text attributes in the node attribute similarity measurement. It needs to fully utilize the author’s citation information, institution information, and publication levels. [Conclusions] SCRH considers the hypergraph’s structural and attribute features. It can effectively accomplish the research collaboration recommendation tasks in stem cells field.

Key wordsHypergraph    Structural Similarity    Attribute Similarity    Scientific Collaboration Recommendation
收稿日期: 2022-05-05      出版日期: 2023-06-07
ZTFLH:  TP393 G250  
基金资助:*国家重点研发计划项目的研究成果之一(2018YFB1404205)
通讯作者: 陈文杰,ORCID:0000-0001-7515-3624,E-mail: chenwj@clas.ac.cn   
引用本文:   
陈文杰. 基于超图的科研合作推荐研究*[J]. 数据分析与知识发现, 2023, 7(4): 68-76.
Chen Wenjie. Scientific Collaboration Recommendation Based on Hypergraph. Data Analysis and Knowledge Discovery, 2023, 7(4): 68-76.
链接本文:  
https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/10.11925/infotech.2096-3467.2022.0430      或      https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/Y2023/V7/I4/68
Fig.1  科研合作超网络实例
Fig.2  科研合作推荐算法流程
Fig.3  属性特征向量生成
超网络 hypernet-1 hypernet-2 hypernet-3
文献数量 6 430 2 462 1 372
作者节点数量 40 478 11 929 5 417
二元合作关系数量 212 55 25
多元合作关系数量 5 538 1 855 946
关系节点数量比 0.14 0.16 0.18
Table 1  科研合作超网络
作者 1 2 …… 10
作者1 0.314 407 91 0.502 426 36 …… -0.817 366 26
作者2 -0.139 027 33 -0.394 726 16 …… 0.642 107 54
…… …… …… …… ……
作者40478 0.690 276 42 0.142 156 29 …… 0.071 254 63
Table 2  作者-属性特征矩阵
作者节点 合作节点 相似性值
De Marchi, Fabiola Sorarù, Gianni 10.58
De Marchi, Fabiola Cisari, Carlo 6.96
De Marchi, Fabiola Profico, Daniela 5.69
Lu, Huai-En Yarmishyn, Aliaksandr A 12.69
Lu, Huai-En Syu, Shih-Han 12.57
Lu, Huai-En Chen, Shih-Jen 12.41
Table 3  结构相似性值
作者节点 合作节点 相似性值
Shetty, Deeti K Zukowska, Monika 1.31
Shetty, Deeti K Nagy, Kristina Vintersten 2.73
Shetty, Deeti K Sunada, Yoshihide 2.83
Okano, Teruo Masterson, Claire 1.35
Okano, Teruo Yarygin, K N V 4.32
Okano, Teruo Bock, István BioTalentum 4.62
Table 4  属性相似性值
Fig.4  属性特征向量可视化
度量指标 hypernet-1 hypernet-2 hypernet-3
WHRA+AS 0.79 0.85 0.88
WHCN+AS 0.77 0.82 0.87
WHRA 0.73 0.81 0.84
WHCN 0.71 0.79 0.81
RA 0.70 0.78 0.79
CN 0.69 0.76 0.77
Table 5  三个超网络上的AUC
度量指标 hypernet-1 hypernet-2 hypernet-3
WHRA+AS 2.44 2.37 2.35
WHCN+AS 2.49 2.43 2.42
WHRA 2.58 2.53 2.41
WHCN 2.73 2.59 2.66
RA 3.03 3.17 3.04
CN 3.45 3.18 3.14
Table 6  平均排序
[1] 林原, 刘海峰, 王海龙, 等. 基于表示学习的学者间潜在合作机会挖掘[J]. 情报杂志, 2019, 38(5): 65-70.
[1] (Lin Yuan, Liu Haifeng, Wang Hailong, et al. Potential Cooperation Opportunities Exploration Between Scholars Based on Presentation Learning[J]. Journal of Intelligence, 2019, 38(5): 65-70.)
[2] Newman M. Scientific Collaboration Networks. I. Network Construction and Fundamental Results[J]. Physical Review E, 2001, 64: 016131.
[3] 高晨晖, 姜晓睿, 叶政君, 等. 基于异质学术超网的文献评价[J]. 情报学报, 2016, 35(8): 826-837.
[3] (Gao Chenhui, Jiang Xiaorui, Ye Zhengjun, et al. Ranking Scientific Articles over Heterogeneous Academic Hypernetwork[J]. Journal of the China Society for Scientific and Technical Information, 2016, 35(8): 826-837.)
[4] 胡枫, 赵海兴, 何佳倍, 等. 基于超图结构的科研合作网络演化模型[J]. 物理学报, 2013, 62(19): 198901.
doi: 10.7498/aps
[4] (Hu Feng, Zhao Haixing, He Jiabei, et al. An Evolving Model for Hypergraph-Structure-Based Scientific Collaboration Networks[J]. Acta Physica Sinica, 2013, 62(19): 198901.)
doi: 10.7498/aps
[5] 于亚新, 张文超, 李振国, 等. 基于超图的EBSN个性化推荐及优化算法[J]. 计算机研究与发展, 2020, 57(12): 2556-2570.
[5] (Yu Yaxin, Zhang Wenchao, Li Zhenguo, et al. Hypergraph-Based Personalized Recommendation & Optimization Algorithm in EBSN[J]. Journal of Computer Research and Development, 2020, 57(12): 2556-2570.)
[6] Rosen-Zvi M, Griffiths T, Steyvers M, et al. The Author-Topic Model for Authors and Documents[C]// Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence. Arlington: AUAI Press, 2004: 487-494.
[7] 刘萍, 郑凯伦, 邹德安. 基于LDA模型的科研合作推荐研究[J]. 情报理论与实践, 2015, 38(9): 79-85.
[7] (Liu Ping, Zheng Kailun, Zou Dean. The Recommendation of S&T Collaboration Based on LDA Model[J]. Information Studies: Theory & Application, 2015, 38(9): 79-85.)
[8] 蒲姗姗. 基于知识互补的科研合作专家推荐模型研究[J]. 情报理论与实践, 2018, 41(8): 96-101.
[8] (Pu Shanshan. Expert Recommendation Model in Scientific and Technical Collaboration Based on Complementary Knowledge[J]. Information Studies: Theory & Application, 2018, 41(8): 96-101.)
[9] 李仲, 韩红旗, 吴广印, 等. 基于文本稀疏分布式表征的学术合作推荐[J]. 情报科学, 2019, 37(6): 113-118.
[9] (Li Zhong, Han Hongqi, Wu Guangyin, et al. Academic Collaboration Recommendation Based on Sparse Distributed Representation[J]. Information Science, 2019, 37(6): 113-118.)
[10] Kong X J, Mao M Y, Liu J Y, et al. TNERec: Topic-Aware Network Embedding for Scientific Collaborator Recommendation[C]// Proceedings of the 2018 IEEE SmartWorld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computing, Scalable Computing & Communications, Cloud & Big Data Computing, Internet of People and Smart City Innovation (SmartWorld/SCALCOM/UIC/ATC/CBDCom/IOP/SCI). IEEE, 2018: 1007-1014.
[11] 余传明, 林奥琛, 钟韵辞, 等. 基于网络表示学习的科研合作推荐研究[J]. 情报学报, 2019, 38(5): 500-511.
[11] (Yu Chuanming, Lin Aochen, Zhong Yunci, et al. Scientific Collaboration Recommendation Based on Network Embedding[J]. Journal of the China Society for Scientific and Technical Information, 2019, 38(5): 500-511.)
[12] Zhou J, Adnan Rafi M. Recommendation of Research Collaborator Based on Semantic Link Network[C]// Proceedings of the 15th International Conference on Semantics, Knowledge and Grids (SKG). IEEE, 2020: 16-20.
[13] 汪志兵, 韩文民, 孙竹梅, 等. 基于网络拓扑结构与节点属性特征融合的科研合作预测研究[J]. 情报理论与实践, 2019, 42(8): 116-120.
[13] (Wang Zhibing, Han Wenmin, Sun Zhumei, et al. Research on Scientific Collaboration Prediction Based on the Combination of Network Topology and Node Attributes[J]. Information Studies: Theory & Application, 2019, 42(8): 116-120.)
[14] 张鑫, 文奕, 许海云. 一种融合表示学习与主题表征的作者合作预测模型[J]. 数据分析与知识发现, 2021, 5(3): 88-100.
[14] (Zhang Xin, Wen Yi, Xu Haiyun. A Prediction Model with Network Representation Learning and Topic Model for Author Collaboration[J]. Data Analysis and Knowledge Discovery, 2021, 5(3): 88-100.)
[15] 熊回香, 李晓敏, 杜瑾. 基于学术关键词与共被引的学者推荐研究[J]. 情报学报, 2021, 40(7): 725-733.
[15] (Xiong Huixiang, Li Xiaomin, Du Jin. Scholar Recommendation Research Based on Academic Keywords and Co-Citation[J]. Journal of the China Society for Scientific and Technical Information, 2021, 40(7): 725-733.)
[16] Du O X, Li Y. Academic Collaborator Recommendation Based on Attributed Network Embedding[J]. Journal of Data and Information Science, 2022, 7(1): 37-56.
doi: 10.2478/jdis-2022-0005
[17] Pradhan T, Pal S. A Multi-Level Fusion Based Decision Support System for Academic Collaborator Recommendation[J]. Knowledge-Based Systems, 2020, 197: 105784.
[18] 王雨, 郭进利. 超网络视角下的科研合作网络节点重要性评估[J]. 图书馆杂志, 2018, 37(10): 89-100.
[18] (Wang Yu, Guo Jinli. The Evaluation of Node Importance Within the Scientific Collaboration Network from Hypernetwork Perspective[J]. Library Journal, 2018, 37(10): 89-100.)
[19] Liben-Nowell D, Kleinberg J. The Link-Prediction Problem for Social Networks[J]. Journal of the American Society for Information Science and Technology, 2007, 58(7): 1019-1031.
doi: 10.1002/(ISSN)1532-2890
[20] Zhou T, Lü L Y, Zhang Y C. Predicting Missing Links via Local Information[J]. The European Physical Journal B, 2009, 71(4): 623-630.
doi: 10.1140/epjb/e2009-00335-8
[21] Lü L Y, Zhou T. Link Prediction in Weighted Networks: The Role of Weak Ties[J]. EPL, 2010, 89(1): 18001.
doi: 10.1209/0295-5075/89/18001
[22] Newman M E. Clustering and Preferential Attachment in Growing Networks[J]. Physical Review E, Statistical, Nonlinear, and Soft Matter Physics, 2001, 64(2 Pt 2): 025102.
[23] 赵雪莉, 卢光跃, 吕少卿, 等. 结合属性信息的二分网络表示学习[J]. 计算机科学与探索, 2021, 15(3): 495-505.
doi: 10.3778/j.issn.1673-9418.2004034
[23] (Zhao Xueli, Lu Guangyue, Lv Shaoqing, et al. Attributed Bipartite Network Representation Learning[J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(3): 495-505.)
doi: 10.3778/j.issn.1673-9418.2004034
[24] Gehring J, Miao Y J, Metze F, et al. Extracting Deep Bottleneck Features Using Stacked Auto-Encoders[C]// Proceedings of the 26th IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE, 2013: 3377-3381.
[25] Schölkopf B, Platt J, Hofmann T. Greedy Layer-Wise Training of Deep Networks[C]// Proceedings of the 2006 NIPS Conference. 2007: 153-160.
[26] 邱均平, 陈木佩. 我国计量学领域作者合作关系研究[J]. 情报理论与实践, 2012, 35(11): 56-60.
[26] (Qiu Junping, Chen Mupei. Research on Author Collaboration in the Metrology Field in China[J]. Information Studies: Theory & Application, 2012, 35(11): 56-60.)
[27] 黄震华, 张佳雯, 张波, 等. 语义推荐算法研究综述[J]. 电子学报, 2016, 44(9): 2262-2275.
doi: 10.3969/j.issn.0372-2112.2016.09.035
[27] (Huang Zhenhua, Zhang Jiawen, Zhang Bo, et al. Survey of Semantics-Based Recommendation Algorithms[J]. Acta Electronica Sinica, 2016, 44(9): 2262-2275.)
doi: 10.3969/j.issn.0372-2112.2016.09.035
[28] Laurens V D M, Hinton G. Visualizing Data Using t-SNE[J]. Journal of Machine Learning Research, 2008, 9: 2579-2605.
[1] 白思萌,牛振东,何慧,时恺泽,易坤,马原驰. 基于超图注意力网络的生物医学文本分类方法*[J]. 数据分析与知识发现, 2022, 6(11): 13-24.
[2] 武玉英, 孙平, 何喜军, 蒋国瑞. 新能源领域专利转让加权网络中主体间技术交易机会预测*[J]. 数据分析与知识发现, 2018, 2(11): 73-79.
[3] 余传明, 龚雨田, 赵晓莉, 安璐. 基于多特征融合的金融领域科研合作推荐研究*[J]. 数据分析与知识发现, 2017, 1(8): 39-47.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
版权所有 © 2015 《数据分析与知识发现》编辑部
地址:北京市海淀区中关村北四环西路33号 邮编:100190
电话/传真:(010)82626611-6626,82624938
E-mail:jishu@mail.las.ac.cn