|
|
Disambiguating Author Names Automatically for Institutional Repository |
Wangqiang Zhang1(),Zhongming Zhu1,Yamei Li2,Linong Lu1,Wei Liu1 |
1(Lanzhou Information Center, Chinese Academy of Sciences, Lanzhou 730000, China) 2(ShanghaiTech University Library, Shanghai 201210, China) |
|
|
Abstract [Objective] This paper tries to automatically finish the disambiguation of author names in institutional repositories, and then provide human intervention mechanism at the right time. [Methods] First, we analyzed the unqiue features of the author name disambiguation. Then, we constructed a general disambiguation framework for the institutional repository. [Results] Our framework achieved good results in practice with more than 99% of precision. [Limitations] We did not examine the author names without affiliation addresses, and there may be exceptions in the alias of authors and institutions. [Conclusions] This framework could effectively disambiguate author names in institutional repositories, which helps us provide more value-added services.
|
Received: 07 March 2018
Published: 15 August 2019
|
[1] | Authority Control of Metadata Values[EB/OL]. [2018-02-20].. | [2] | ORCID Integration[EB/OL]. [2018-02-20].. | [3] | CSpace[EB/OL]. [2018-02-20]. . | [4] | 刘巍, 祝忠明, 张旺强, 等. 机构知识库中作者标识与作品认领机制的研究与实现[J]. 现代图书情报技术, 2014(3): 8-13. | [4] | (Liu Wei, Zhu Zhongming, Zhang Wangqiang, et al.Development and Research of Author Identifier and Item Claim Service for Institutional Repository[J]. New Technology of Library and Information Service, 2014(3): 8-13.) | [5] | 陈嘉勇, 周婕, 李玲, 等. 基于文献实体关系模型的高校机构知识库作者认领模式研究[J]. 情报理论与实践, 2015, 38(2): 59-63. | [5] | (Chen Jiayong, Zhou Jie, Li Ling, et al.Research on Author Claim Pattern for University Institutional Repository Based on Paper-Entity Relationship Model[J]. Information Studies: Theory & Application, 2015, 38(2): 59-63.) | [6] | Han H, Giles L, Zha H, et al.Two Supervised Learning Approaches for Name Disambiguation in Author Citations[C]// Proceedings of the 4th ACM/IEEE Joint Conference on Digital Libraries. New York: ACM, 2004: 296-305. | [7] | Treeratpituk P, Giles C L.Disambiguating Authors in Academic Publications Using Random Forests[C]// Proceedings of the 9th ACM/IEEE- CS Joint Conference on Digital Libraries. New York: ACM, 2009: 39-48. | [8] | Fan X M, Wang J Y, Pu X, et al.On Graph-based Name Disambiguation[J]. Journal of Data and Information Quality, 2011, 2(2): 23-56. | [9] | Song Y, Huang J, Councill I G, et al.Efficient Topic-based Unsupervised Name Disambiguation[C]//Proceedings of the 7th ACM/IEEE – CS Joint Conference on Digital Libraries. New York: ACM, 2007: 342-351. | [10] | 张雄, 陈福才, 黄瑞阳. 基于融合特征相似度的实体消歧方法研究[J]. 计算机应用研究, 2017, 34(2): 347-350, 396. | [10] | (Zhang Xiong, Chen Fucai, Huang Ruiyang.Research on Entity Disambiguation Method Based on Fusion Feature Similarity[J]. Application Research of Computers, 2017, 34(2): 347-350, 396.) | [11] | 肖晶, 梁冰, 张晓丹, 等. 一种面向篇级数据的作者名消歧规则和算法[J]. 现代图书情报技术, 2012(5): 55-59. | [11] | (Xiao Jing, Liang Bing, Zhang Xiaodan, et al.Author Disambiguation Rules and Algorithm for Article Level Data[J]. New Technology of Library and Information Service, 2012(5): 55-59.) | [12] | 上海科技大学知识管理系统[EB/OL]. [2018-02-20]. .(ShanghaiTech University Knowledge Management System)[EB/OL]. [2018-02-20]. |
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|