Predicting Co-authorship with Combination of Paths in Paper-author Bipartite Networks
Zhang Jinzhu1(),Wang Xiaomei2,Han Tao2
1School of Economics and Management, Nanjing University of Science and Technology, Nanjing 210094, China 2National Science Library, Chinese Academy of Sciences, Beijing 100190, China
[Objective] This paper aims to predict co-authorship more effectively and reduce the information loss. [Methods] First, we constructed a paper-author bipartite network and its co-authorship counterpart in the field of library and information science. Second, we described the relationships among authors with the path-length of two and three from the bipartite network. Third, we used the logistic regression method to learn the influence of different factors. Finally, we predicted co-authorship in the paper-author bipartite network with various indictors. [Results] We found significant information loss in the change from the paper-author bipartite network to the co-authorship network. The logistic regression method was an appropriate way to learn the contributions of paths. The new indicators were more accurate and the predicted co-authorships could be interpreted more easily. [Limitations] We did not include the multiple paths methods to the present study and more research is needed to examine the proposed method in other areas. [Conclusions] Co-authorship prediction should be conducted in the paper-author bipartite network to reduce the information loss. The paths combination indicator in the paper-author bipartite network might be the most effective method to predict co-authorship, which could be applied to the patent-inventor bipartite network.
张金柱,王小梅,韩涛. 文献-作者二分网络中基于路径组合的合著关系预测研究*[J]. 现代图书情报技术, 2016, 32(10): 42-49.
Zhang Jinzhu,Wang Xiaomei,Han Tao. Predicting Co-authorship with Combination of Paths in Paper-author Bipartite Networks. New Technology of Library and Information Service, 2016, 32(10): 42-49.
Barabasi A L, Jeong H, Neda Z, et al.Evolution of the Social Network of Scientific Collaborations[J]. Physica A: Statistical Mechanics and Its Applications, 2002, 311(3): 590-614.
[2]
Guns R, Rousseau R.Recommending Research Collaborations Using Link Prediction and Random Forest Classifiers[J]. Scientometrics, 2014, 101(2): 1461-1473.
[3]
Zhang Q, Xu X, Zhu Y, et al. Measuring Multiple Evolution Mechanisms of Complex Networks [J]. Scientific Reports, 2015, 5: Ariticle No. 10350.
(Zhang Bin, Ma Feicheng.A Review on Link Prediction of Scientific Knowledge Network[J]. Journal of Library Science in China, 2015, 41(3): 99-113.)
[5]
Zhang J, Han T, Wang X.Uncovering the Mechanism of Knowledge Network Evolution by Link Prediction[J]. Geomatics and Information Science of Wuhan University, 2015, 39(S1): 100-106.
[6]
Zhao J, Miao L, Yang J, et al. Prediction of Links and Weights in Networks by Reliable Routes [J]. Scientific Reports, 2015, 5: Ariticle No. 12261.
[7]
Lv L, Zhou T.Link Prediction in Complex Networks: A Survey[J]. Physica A: Statistical Mechanics and Its Applications, 2010, 390(6): 1150-1170.
[8]
Guns R.Bipartite Networks for Link Prediction: Can They Improve Prediction Performance?[C]. In: Proceedings of International Society for Scientometrics and Informetrics. 2011: 249-260.
[9]
Gao M, Chen L, Xu Y.Projection Based Algorithm for Link Prediction in Bipartite Network[J]. Computer Science, 2016, 43(2): 118.
(Wu Yajing, Zhang Peng, Di Zengru, et al.Study on Bipartite Networks[J]. Complex Systems and Complexity Science, 2010, 7(1): 1-12.)
[11]
Daminelli S, Thomas J M, Duran C, et al.Common Neighbours and the Local-Community-Paradigm for Topological Link Prediction in Bipartite Networks[J]. New Journal of Physics, 2015, 17. .
[12]
Zhou T, Lv L, Zhang Y C.Predicting Missing Links via Local Information[J]. The European Physical Journal B-Condensed Matter and Complex Systems, 2009, 71(4): 623-630.
[13]
Hosmer Jr D W, Lemeshow S. Applied Logistic Regression[M]. New York: John Wiley & Sons, 2004.
[14]
Güne? ?, Gündüz-??üdücü ?, ?ataltepe Z. Link Prediction Using Time Series of Neighborhood-Based Node Similarity Scores[J]. Data Mining and Knowledge Discovery, 2016, 30(1): 147-180.
[15]
Sett N, Singh S R, Nandi S.Influence of Edge Weight on Node Proximity Based Link Prediction Methods: An Empirical Analysis[J]. Neurocomputing, 2016, 172: 71-83.