Please wait a minute...
Advanced Search
现代图书情报技术  2013, Vol. Issue (12): 62-69     https://doi.org/10.11925/infotech.1003-3513.2013.12.10
  知识组织与知识管理 本期目录 | 过刊浏览 | 高级检索 |
利用作者关键词网络探测作者相似性
刘萍, 郭月培, 郭怡婷
武汉大学信息管理学院 武汉 430072
Use of Author-Keyword Network for Detecting Author Similarity
Liu Ping, Guo Yuepei, Guo Yiting
School of Information Management, Wuhan University, Wuhan 430072, China
全文: PDF (1691 KB)   HTML  
输出: BibTeX | EndNote (RIS)      
摘要 准确识别作者研究内容的相似度,是探测学科知识结构和挖掘潜在合作关系的重要基础工作,也是近年来图书情报学的研究热点。现有的相似度计算方法大都依赖于属性的直接关联,忽略属性间的间接关联。提出一种新的基于作者关键词网络的作者相似度计算方法,通过向量空间模型计算出关键词之间的关联度,再利用图结构相似度算法P-Rank挖掘出作者间的间接关联关系。初步实验表明该方法能够有效地识别作者之间的相似度,相比于传统的关键词耦合和向量空间模型算法,该方法可以明显地提高作者相似度计算的准确性。
服务
把本文推荐给朋友
加入引用管理器
E-mail Alert
RSS
作者相关文章
刘萍
郭月培
郭怡婷
关键词 作者关键词网络相似度P-Rank    
Abstract:Accurately measuring the authors similarities is the fundamental work of detecting disciplinary knowledge structure and mining potential cooperative relationships, it is one of important research issues in library and information science. Current approaches rely on direct associations between properties,and the indirect associations between properties are ignored. This paper proposes a new measurement of authors similarities based on author-keyword network. Firstly the relatedness of keywords are calculated based on Vector Space Model, and then structure similarity algorithm P-Rank is used to calculate the similarities between authors.The initial experiment demonstrates the effectiveness of the proposed approach. Compared with keyword-coupling method and Vector Space Model method, this approach obtains more meaningful results.
Key wordsAuthor-Keyword network    Similarity    P-Rank
收稿日期: 2013-08-19      出版日期: 2014-01-08
:  G350  
基金资助:本文系国家自然科学基金项目“面向知识创新的科研组织知识社区挖掘——从社会资本角度”(项目编号:71203164)的研究成果之一。
通讯作者: 刘萍     E-mail: pliuleeds@126.com
引用本文:   
刘萍, 郭月培, 郭怡婷. 利用作者关键词网络探测作者相似性[J]. 现代图书情报技术, 2013, (12): 62-69.
Liu Ping, Guo Yuepei, Guo Yiting. Use of Author-Keyword Network for Detecting Author Similarity. New Technology of Library and Information Service, 2013, (12): 62-69.
链接本文:  
https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/10.11925/infotech.1003-3513.2013.12.10      或      https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/Y2013/V/I12/62
[1] 马瑞敏, 倪超群. 基于作者同被引分析的我国图书情报学知识结构及其演变研究[J]. 中国图书馆学报, 2011, 37(6): 17-26.(Ma Ruimin, Ni Chaoqun. A Study on Intellectual Structure and the Evolution of Library and Information Science in China Based on Author Co-citation Analysis[J].Journal of Library Science in China, 2011, 37(6): 17-26.)
[2] 韩瑞凯, 孟嗣仪, 刘云, 等. 基于兴趣相似度的社区结构发现算法研究[J]. 铁路计算机应用, 2010, 19 (10): 10-14.(Han Ruikai, Meng Siyi, Liu Yun, et al. Research on Algorithm of Community Structure Detection Based on Interest Similarity[J].Railway Computer Application, 2010, 19(10): 10-14.)
[3] 陈卫静, 郑颖. 基于作者关键词耦合的潜在合作关系挖掘[J]. 情报杂志, 2013, 32(5): 127-131.(Chen Weijing, Zheng Ying. Mining Potential Cooperative Relationships Based on the Author Keyword Coupling Analysis[J]. Journal of Intelligence, 2013, 32(5):127-131.)
[4] Ding Y. Scientific Collaboration and Endorsement: Network Analysis of Coauthorship and Citation Networks[J]. Journal of Informetrics, 2011, 5(1):187-203.
[5] Liu X, Bollen J, Nelson M L, et al. Co-authorship Networks in the Digital Library Research Community[J]. Information Processing & Management, 2005, 41(6):1462-1480.
[6] Lu K, Wolfram D. Measuring Author Research Relatedness: A Comparison of Word-based, Topic-based, and Author Cocitation Approaches[J]. Journal of the American Society for Information Science and Technology, 2012, 63(10): 1973-1986.
[7] Small H. Co-citation in the Scientific Literature: A New Measure of the Relationship Between Two Documents[J]. Journal of the American Society for Information Science, 1973, 24(4): 265-269.
[8] Ahlgren P, Jarneving B, Rousseau R. Requirements for a Cocitation Similarity Measure, with Special Reference to Pearson's Correlation Coefficient[J].Journal of the American Society for Information Science and Technology, 2003, 54(6): 550-560.
[9] Zhao D, Strotmann A. Counting First, Last, or All Authors in Citation Analysis: A Comprehensive Comparison in the Highly Collaborative Stem Cell Research Field[J]. Journal of the American Society for Information Science and Technology, 2011, 62(4): 654-676.
[10] Zhao D, Strotmann A. Evolution of Research Activities and Intellectual Influences in Information Science 1996-2005: Introducing Author Bibliographic-coupling Analysis[J]. Journal of the American Society for Information Science and Technology, 2008, 59(13): 2070-2086.
[11] Kessler M M. Bibliographic Coupling Between Scientific Papers[J].American Documentation, 1963, 14(1):10-25.
[12] 孙海生. 作者关键词共现网络及实证研究[J]. 情报杂志, 2012, 31(9): 63-67.(Sun Haisheng. Author Keyword Co-occurrence Network Analysis and Empirical Research[J]. Journal of Intelligence, 2012, 31(9): 63-67.)
[13] 陈仕吉. 科学研究前沿探测方法综述[J]. 现代图书情报技术, 2009(9):28-33.(Chen Shiji. Survey of Approaches to Research Front Detection[J].New Technology of Library and Information Service, 2009(9): 28-33.)
[14] Salton G, Wong A, Yang C S. A Vector Space Model for Automatic Indexing[J].Communications of ACM, 1975, 18(11):613-620.
[15] 刘志辉, 郑彦宁.基于作者关键词耦合分析的研究专业识别方法研究[J]. 情报学报, 2013, 32(8):788-796.(Liu Zhihui, Zheng Yanning. Research on AKCA Based Research Specialty Identification[J]. Journal of the China Society for Scientific and Technical Information, 2013, 32(8): 788-796.)
[16] Leydesdorff L. Why Words and Co-words Cannot Map the Development of the Sciences[J]. Journal of the American Society for Information Science, 1997, 48(5): 418-427.
[17] Jeh G, Widom J. SimRank: A Measure of Structural-context Similarity[C].In: Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'02). New York: ACM, 2002: 538-543.
[18] Zhao P, Han J, Sun Y. P-Rank: A Comprehensive Structural Similarity Measure over Information Networks[C].In: Proceedings of the 18th ACM Conference on Information and Knowledge Management (CIKM'09). New York: ACM, 2009: 553-562.
[19] 范鑫鑫. 基于页面相似度的链接分析算法研究[D]. 大连:大连理工大学, 2008.(Fan Xinxin. Research of Link Analysis Algorithm Based on Distributed Factor[D]. Dalian: Dalian University of Technology, 2008.)
[20] Jin R, Lee V E, Hong H. Axiomatic Ranking of Network Role Similarity[C].In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York: ACM, 2011: 922-930.
[21] Lee P, Lakshmanan L V S, Yu J X. On Top-k Structural Similarity Search[C].In: Proceedings of the 28th IEEE International Conference on Data Engineering (ICDE'12).Washington DC: IEEE Computer Society, 2012:774-785.
[22] IBM. SPSS Software[EB/OL].[2013-09-25]. http://www-01.ibm.com/software/analytics/spss/.
[23] Cilibrasi R L, Vitanyi P M B. The Google Similarity Distance[J]. IEEE Transactions on Knowledge and Data Engineering, 2007, 19(3): 370-383.
[24] 盛志超, 陶晓鹏. 基于维基百科的语义相似度计算方法[J]. 计算机工程, 2011, 37(7):193-195. (Sheng Zhichao, Tao Xiaopeng. Semantic Similarity Computing Method Based on Wikipedia[J]. Computer Engineering, 2011, 37(7):193-195.)
[1] 韩辉, 刘秀文. 海事适任评估中主观题自动评分技术研究*[J]. 数据分析与知识发现, 2021, 5(8): 113-121.
[2] 刘文斌, 何彦青, 吴振峰, 董诚. 基于BERT和多相似度融合的句子对齐方法研究*[J]. 数据分析与知识发现, 2021, 5(7): 48-58.
[3] 闫强,张笑妍,周思敏. 基于义原相似度的关键词抽取方法 *[J]. 数据分析与知识发现, 2021, 5(4): 80-89.
[4] 向卓元,刘志聪,吴玉. 基于用户行为自适应推荐模型研究 *[J]. 数据分析与知识发现, 2021, 5(4): 103-114.
[5] 吕学强,罗艺雄,李家全,游新冬. 中文专利侵权检测研究综述*[J]. 数据分析与知识发现, 2021, 5(3): 60-68.
[6] 吴彦文, 蔡秋亭, 刘智, 邓云泽. 融合多源数据和场景相似度计算的数字资源推荐研究*[J]. 数据分析与知识发现, 2021, 5(11): 114-123.
[7] 盛嘉祺, 许鑫. 融合主题相似度与合著网络的学者标签扩展方法研究*[J]. 数据分析与知识发现, 2020, 4(8): 75-85.
[8] 徐以聪,田学东,李新福,杨芳,史青宣. 基于犹豫模糊权重的数学表达式检索 *[J]. 数据分析与知识发现, 2020, 4(7): 118-126.
[9] 苏庆,陈思兆,吴伟民,李小妹,黄佃宽. 基于学习情况协同过滤算法的个性化学习推荐模型研究*[J]. 数据分析与知识发现, 2020, 4(5): 105-117.
[10] 刘萍,彭小芳. 基于形式概念分析的词汇相似度计算*[J]. 数据分析与知识发现, 2020, 4(5): 66-74.
[11] 高原,施元磊,张蕾,曹天奕,冯筠. 基于游记文本的游客游览行程重构*[J]. 数据分析与知识发现, 2020, 4(2/3): 165-172.
[12] 李家全,李宝安,游新冬,吕学强. 基于专利知识图谱的专利术语相似度计算研究*[J]. 数据分析与知识发现, 2020, 4(10): 104-112.
[13] 俞琰,陈磊,姜金德,赵乃瑄. 结合词向量和统计特征的专利相似度测量方法 *[J]. 数据分析与知识发现, 2019, 3(9): 53-59.
[14] 关鹏,王曰芬,傅柱. 基于LDA的主题语义演化分析方法研究 * ——以锂离子电池领域为例[J]. 数据分析与知识发现, 2019, 3(7): 61-72.
[15] 张佩瑶,刘东苏. 基于词向量和BTM的短文本话题演化分析*[J]. 数据分析与知识发现, 2019, 3(3): 95-101.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
版权所有 © 2015 《数据分析与知识发现》编辑部
地址:北京市海淀区中关村北四环西路33号 邮编:100190
电话/传真:(010)82626611-6626,82624938
E-mail:jishu@mail.las.ac.cn