Similarity Measurement of Research Interests in Semantic Network
Ba Zhichao1,2(),Li Gang1,Zhu Shiwei2
1School of Information Management, Wuhan University, Wuhan 430072, China 2Information Research Institute of Shandong Academy of Sciences, Ji’nan 250014, China
[Objective] This study aims to identify relationship among authors of papers with similar contents but different keywords, and then tries to add more sematic factors to the co-occurrence analysis. [Methods] We proposed a method to gauge the similarity of research interests based on the keywords semantic network system. First, all keywords were represented as word vectors and translated into low dismension distribution with the help of neural network language—word2vec model. Second, we calculated the semantic association of keywords to build up a semantic network. Finally, we adopted the Jensen-Shannon distance method to measure the similarity of research interests. [Results] The proposed approach can accurately identify the similarities of co-occurrence and non co-occurrence terms and then effectively predict potential cooperation among authors. [Limitations] The amount and accuracy of training materials need to be increased. At present, we could only find potential cooperation between two authors. More research is needed to explore the possibilities of cooperation among multi-authors. [Conclusions] The proposed method could help to improve the performance of traditional co-occurrence analysis.
巴志超,李纲,朱世伟. 基于语义网络的研究兴趣相似性度量方法*[J]. 现代图书情报技术, 2016, 32(4): 81-90.
Ba Zhichao,Li Gang,Zhu Shiwei. Similarity Measurement of Research Interests in Semantic Network. New Technology of Library and Information Service, 2016, 32(4): 81-90.
(Qiu Junping, Liu Guohui, Dong Ke.Research on Knowledge Aggregation and Discipline Structure Based on Collaboration Analysis—Taking the Field of Knowledge Management in Domestic as an Example[J]. Information Studies: Theory&Application, 2014, 37(8): 6-11.)
(Li Gang, Li Lanfeng, Mao Jin, et al.Empirical Research on Similarity of Research Interests in Co-authorship Network[J]. Library and Information Service, 2015, 59(2): 75-81.)
(Wang Fusheng, Shi Xiuchun, Yang Hongyong.Research on Scientific Collaboration Network Based on Author Cliques[J]. Information Studies: Theory & Application, 2009, 32(1): 35-37.)
[4]
Abramo G, D’Angelo C A, Costa F. Identifying Interdisciplinary Through the Disciplinary Classification of Coauthors of Scientific Publications[J]. Journal of the American Society for Information Science and Technology, 2012, 63(11): 2206-2222.
(Qiu Junping, Zhang Xiaopei.Author Co-citation Analysis of Knowledge Management in China Based on the CSSCI[J]. Information Science, 2011, 29(10): 1141-1145.)
(Song Yanhui, Wu Yishan.Resarch on Knowledge Structure of Information Science Based on Author Bibliographic-coupling Analysis[J]. Library and Information Service, 2014, 58(1): 117-123.)
[7]
孙海生. 作者关键词共现网络及实证研究[J]. 情报杂志, 2012, 31(9): 63-67.
[7]
(Sun Haisheng.Author Keyword Co-Occurrence Network Analysis: An Empirical Research[J]. Journal of Intelligence, 2012, 31(9): 63-67.)
(Liu Ping, Guo Yuepei, Guo Yiting.Use of Author-Keyword Network for Detecting Author Similarity[J]. New Technology of Library and Information, 2013(12): 62-69.)
[9]
Jan Van Eck N, Waltman L. Appropritate Similarity Measure for Author Co-citation Analysis[J]. Journal of the American Society for Information Science and Technology, 2008, 59(10): 1653-1661.
(Qiu Junping, Li Xiaotao.Research on Knowledge Diffusion Based on Citation Network Mining and Timing Analysis[J]. Information Studies: Theory & Application, 2014, 37(7): 5-10.)
[11]
Zhao D, Strotman A.Evolution of Research Activities and Intellectual Influences in Information Science 1996-2005: Introducing Author Bibliographic-coupling Analysis[J]. Journal of the American Society for Information Science and Technology, 2008, 59(13): 2070-2086.
(Chen Yuan, Wang Feifei.An Analysis on the Bibliographic Coupling in the Field of Information Studies in China: Based on CSSCI[J]. Information and Documentation Services, 2011, 32(5): 6-12.)
(Wang Zhijin, Zhou Peng, Xie Lina.The Identification and Explanation of Research Fields of Contemporary Information Science in China Using ABCA Method[J]. Journal of the China Society for Scientific and Technical Information, 2013, 32(1): 4-12.)
(Chen Weijing, Zheng Ying.Mining Potential Cooperative Relationships Based on the Author Keyword Coupling Analysis[J]. Journal of Intelligence, 2013, 32(5): 127-131.)
[15]
Morris S A, Yen G G.Crossmaps: Visualization of Overlapping Relationships in Collections of Journal Papers[J]. Proceedings of the National Academy of Sciences, 2004, 101(S1): 5291-5296.)
[16]
Onyancha O B, Ocholla D N.Is HTV/AIDS in Africa Distinct? What Can We Learn from an Analysis of the Literature[J]. Scientometrics, 2009, 79(1): 277-296.
(Qiu Junping, Chen Mupei.Research on Author Collaboration in the Metrology Field in China[J]. Information Studies: Theory&Application, 2012, 35(11): 56-60.)
(Ding Jingda.Characteristics and Regularity in Scientific Communication Within Innovative Knowledge Community: An Empirical Study of a State Key Laboratory[J]. Journal of the China Society for Scientific and Technical Information, 2011, 30(10): 1086-1094.)
[19]
Mikolov T, Sutskever I, Chen K, et al.Distributed Representations of Words and Phrases and Their Compositionality [C]. In: Proceedings of the Neural Infornational Processing Systems Conference. Nevada, United States: Neural Information Processing Systems Foundation, 2013: 3111-3119.)
[20]
Morin F, Bengio Y.Hierarchical Probabilistic Neural Network Language Model [C]. In: Proceedings of the International Workshop on Artificial Intelligence and Statistics. Cambridge: Cambridge University Press, 2005: 246-252.
[21]
Polzehl J, Spokoiny V.Propagation-Separation Approach for Local Likelihood Estimation[J]. Probability Theory and Related Fields, 2006, 135(3): 335-362.
[22]
Callon M, Courtial J P, Laville F.Co-word Analysis as a Tool for Describing the Network of Interactions Between Basic and Technological Research: The Case of Polymer Chemsitry[J]. Scientmetrics, 1991, 22(1): 155-205.
(Zheng Huachuan, Yu Xiaoou, Xin Yan.Antigen CD44 with Clustered Analysis of Co-words: A Status Quo Investigation[J]. Chinese Journal of Medical Library and Information Science, 2002, 11(2): 1-3.)
[24]
Endres D M, Schindelin J E.A New Metric for Probability Distributions[J]. IEEE Transactions on Information Theory, 2003, 49(7): 1858-1860.