[Objective] This paper proposes a modified translation model (TransTopic) to predict research cooperation, aiming to promote exchanges among researchers and maximize efficiency.[Methods] We used TransTopic to uniformly map the nodes and edges of the scientific research cooperation network to low-dimensional vectors. First, we used the LDA model to extract the topic distribution features of stem cells papers. Then, we turned topic features to edge vectors with the deep autoencoder and obtained node vectors based on the translation mechanism. Finally, we predicted the scientific cooperation through the semantic calculation between the vectors.[Results] TransTopic’s AUC (95.21%) and MeanRank (17.48) indicators for link prediction are better than those of the existing models, and its topic prediction accuracy rate reached 86.52%.[Limitations] The proposed method only considered a one-step translation path, and did not fully utilized information like author’s institution, research interests, and publication levels.[Conclusions] The proposed method based on translation model could effectively predict research cooperation in the field of stem cells.
陈文杰. 基于翻译模型的科研合作预测研究*[J]. 数据分析与知识发现, 2020, 4(10): 28-36.
Chen Wenjie. Predicting Research Collaboration Based on Translation Model. Data Analysis and Knowledge Discovery, 2020, 4(10): 28-36.
Guns R, Rousseau R. Recommending Research Collaborations Using Link Prediction and Random Forest Classifiers[J]. Scientometrics, 2014,101(2):1461-1473.
doi: 10.1007/s11192-013-1228-9
( Zhang Jinzhu, Wang Xiaomei, Han Tao. Predicting Co-authorship with Combination of Paths in Paper-author Bipartite Networks[J]. New Technology of Library and Information Service, 2016 (10):42-49.)
( Zhang Jinzhu, Yu Wenqian, Liu Jingjie, et al. Predicting Research Collaborations Based on Network Embedding[J]. Journal of the China Society for Scientific and Technical Information, 2018,37(2):132-139.)
( Lin Yuan, Liu Haifeng, Wang Hailong, et al. Potential Cooperation Opportunities Exploration Between Scholars Based on Presentation Learning[J]. Journal of Intelligence, 2019,38(5):65-70.)
[5]
Mikolov T, Sutskever I, Chen K, et al. Distributed Representations of Words and Phrases and Their Compositionality[C]// Proceedings of the 26th International Conference on Neural Information Processing Systems - Volume 2. 2013: 3111-3119.
[6]
Bordes A, Usunier N, Garcia-Durán A, et al. Translating Embeddings for Modeling Multi-Relational Data[C]// Proceedings of the 26th International Conference on Neural Information Processing Systems-Volume 2. 2013: 2787-2795.
( Liu Zhiyuan, Sun Maosong, Lin Yankai, et al. Knowledge Representation Learning: A Review[J]. Journal of Computer Research and Development, 2016,53(2):247-261.)
[8]
Wang Z W, Zhang J L, Feng J, et al. Knowledge Graph Embedding by Translating on Hyperplanes[C]//Proceedings of the 28th AAAI Conference on Artificial Intelligence. 2014: 1112-1119.
[9]
Lin Y K, Liu Z Y, Sun M S, et al. Learning Entity and Relation Embedding for Knowledge Graph Completion[C]//Proceedings of AAAI 2015. 2015:2181-2187.
[10]
Xiao H, Huang M L, Hao Y, et al. TransG: A Generative Mixture Model for Knowledge Graph Embedding[OL]. arXiv Preprint, arXiv:1509.05488, 2015.
[11]
He S Z, Liu K, Ji G L, et al. Learning to Represent Knowledge Graphs with Gaussian Embedding[C]//Proceedings of the 24th ACM International Conference on Information and Knowledge Management. 2015: 623-632.
[12]
Xiao H, Huang M L, Hao Y, et al. TransA: An Adaptive Approach for Knowledge Graph Embedding[OL]. arXiv Preprint, arXiv: 1509.05490, 2015.
( Fang Yang, Zhao Xiang, Tan Zhen, et al. A Revised Translation-Based Method for Knowledge Graph Representation[J]. Journal of Computer Research and Development, 2018,55(1):139-150.)
[14]
Lin Y K, Liu Z Y, Luan H B, et al. Modeling Relation Paths for Representation Learning of Knowledge Bases[OL]. arXiv Preprint, arXiv:1506.00379, 2015.
[15]
Xie R B, Liu Z Y, Jia J, et al. Representation Learning of Knowledge Graphs with Entity Descriptions[C]//Proceedings of the 30th AAAI Conference on Artificial Intelligence. 2016: 2659-2665.
[16]
Newman M E J. The Structure and Function of Complex Networks[J]. SIAM Review, 2003,45(2):167-256.
doi: 10.1137/S003614450342480
[17]
Liben-Nowell D, Kleinberg J. The Link Prediction Problem for Social Networks[J]. Journal of the American Society for Information Science and Technology, 2003,58(7):1019-1031.
doi: 10.1002/(ISSN)1532-2890
[18]
孙晓玲. 作者合作网络的结构及其演化与预测研究[D]. 大连: 大连理工大学, 2014.
[18]
( Sun Xiaoling. Research on the Structure, Evolution and Prediction of Author Cooperative Network[D]. Dalian: Dalian University of Technology, 2014.)
( Liu Jing, Sun Wei. Discovery of Potential Scientific and Technical Collaborative Relationship Based on Link Prediction[J]. Information Studies: Theory & Application, 2017,40(7):88-92, 121.)
( Wang Zhibing, Han Wenmin, Sun Zhumei, et al. Research on Scientific Collaboration Prediction Based on the Combination of Network Topology and Node Attributes[J]. Information Studies:Theory & Application, 2019,42(8):116-120, 109.)
( Zhang Jinzhu, Han Tao, Wang Xiaomei. Co-authorship Prediction in the Author-keyword Bipartite Networks[J]. Library and Information Service, 2016,60(21):74-80.)
[22]
Luong N T, Nguyen T T, Jung J J, et al. Discovering Co-author Relationship in Bibliographic Data Using Similarity Measures and Random Walk Model[C]//Proceedings of 2015 Asian Conference on Intelligent Information and Database Systems. 2015: 127-136.
( Ai Ke, Ma Guoshuai, Yang Kaikai, et al. A Classification Method of Scientific Collaborator Potential Prediction Based on Ensemble Learning[J]. Journal of Computer Research and Development, 2019,56(7):1383-1395.)
( Yu Chuanming, Lin Aochen, Zhong Yunci, et al. Scientific Collaboration Recommendation Based on Network Embedding[J]. Journal of the China Society for Scientific and Technical Information, 2019,38(5):500-511.)
[25]
Blei D M, Ng A Y, Jordan M I, et al. Latent Dirichlet Allocation[J]. Journal of Machine Learning Research, 2003,3(4/5):993-1022.
[26]
Tu C C, Zhang Z Y, Liu Z Y, et al. TransNet: Translation-Based Network Representation Learning for Social Relation Extraction[C]// Proceedings of the 26th International Joint Conference on Artificial Intelligence. 2017: 2864-2870.
[27]
Schölkopf B, Platt J, Hofmann T. Greedy Layer-Wise Training of Deep Networks[A]//Advances in Neural Information Processing Systems[M]. MIT Press, 2007: 153-160.
( Sun Lijuan. Order of Authors and Corresponding Author in Scientific Papers[J]. Chinese Journal of Scientific and Technical Periodicals, 2005,16(2):242-244.)
[29]
Srivastava N, Hinton G, Krizhevsky A, et al. Dropout: A Simple Way to Prevent Neural Networks from Overfitting[J]. Journal of Machine Learning Research, 2014,15(1):1929-1958.
( Li Zhiyu, Liang Xun, Zhou Xiaoping, et al. A Link Prediction Method for Large-Scale Networks[J]. Chinese Journal of Computer, 2016,39(10):1947-1964.)