[Objective] This paper tries to automatically finish the disambiguation of author names in institutional repositories, and then provide human intervention mechanism at the right time. [Methods] First, we analyzed the unqiue features of the author name disambiguation. Then, we constructed a general disambiguation framework for the institutional repository. [Results] Our framework achieved good results in practice with more than 99% of precision. [Limitations] We did not examine the author names without affiliation addresses, and there may be exceptions in the alias of authors and institutions. [Conclusions] This framework could effectively disambiguate author names in institutional repositories, which helps us provide more value-added services.
(Liu Wei, Zhu Zhongming, Zhang Wangqiang, et al.Development and Research of Author Identifier and Item Claim Service for Institutional Repository[J]. New Technology of Library and Information Service, 2014(3): 8-13.)
(Chen Jiayong, Zhou Jie, Li Ling, et al.Research on Author Claim Pattern for University Institutional Repository Based on Paper-Entity Relationship Model[J]. Information Studies: Theory & Application, 2015, 38(2): 59-63.)
Han H, Giles L, Zha H, et al.Two Supervised Learning Approaches for Name Disambiguation in Author Citations[C]// Proceedings of the 4th ACM/IEEE Joint Conference on Digital Libraries. New York: ACM, 2004: 296-305.
Treeratpituk P, Giles C L.Disambiguating Authors in Academic Publications Using Random Forests[C]// Proceedings of the 9th ACM/IEEE- CS Joint Conference on Digital Libraries. New York: ACM, 2009: 39-48.
Fan X M, Wang J Y, Pu X, et al.On Graph-based Name Disambiguation[J]. Journal of Data and Information Quality, 2011, 2(2): 23-56.
Song Y, Huang J, Councill I G, et al.Efficient Topic-based Unsupervised Name Disambiguation[C]//Proceedings of the 7th ACM/IEEE – CS Joint Conference on Digital Libraries. New York: ACM, 2007: 342-351.