Please wait a minute...
Advanced Search
现代图书情报技术  2012, Vol. 28 Issue (5): 55-59     https://doi.org/10.11925/infotech.1003-3513.2012.05.08
  知识组织与知识管理 本期目录 | 过刊浏览 | 高级检索 |
一种面向篇级数据的作者名消歧规则和算法
肖晶, 梁冰, 张晓丹, 吕世炅
中国科学技术信息研究所 北京 100038
Author Disambiguation Rules and Algorithm for Article Level Data
Xiao Jing, Liang Bing, Zhang Xiaodan, Lv Shijiong
Institute of Scientific & Technical Information of China, Beijing 100038, China
全文: PDF (503 KB)   HTML  
输出: BibTeX | EndNote (RIS)      
摘要 在深入分析NSTL篇级元数据特点的基础上,结合模糊匹配算法,提出一种适合NSTL现有数据的人名消歧规则集,并给出基于该规则集的人名消歧算法。通过对实际数据集的实验,该算法在准确率、召回率等指标方面都有良好的表现,具备较好的消歧效果。
服务
把本文推荐给朋友
加入引用管理器
E-mail Alert
RSS
作者相关文章
肖晶
梁冰
张晓丹
吕世炅
关键词 作者名消歧模糊匹配篇级数据消歧算法    
Abstract:This paper analyzes the article level data in NSTL, then presents a rule set for name disambiguation combining with fuzzy matching algorithm, and provides the relevant name disambiguation algorithm. Through the experiment based on the actual data set, it is found that the algorithm gives a good precision and recall value, which is a good effect for name disambiguation.
Key wordsAuthor disambiguation    Fuzzy matching    Article level data    Disambiguation algorithm
收稿日期: 2012-04-05      出版日期: 2012-07-24
: 

TP391

 
基金资助:

本文系“十二五”国家科技支撑计划项目“信息资源自动处理、智能检索与STKOS应用服务集成”(项目编号:2011BHA10B05)的研究成果之一。

引用本文:   
肖晶, 梁冰, 张晓丹, 吕世炅. 一种面向篇级数据的作者名消歧规则和算法[J]. 现代图书情报技术, 2012, 28(5): 55-59.
Xiao Jing, Liang Bing, Zhang Xiaodan, Lv Shijiong. Author Disambiguation Rules and Algorithm for Article Level Data. New Technology of Library and Information Service, 2012, 28(5): 55-59.
链接本文:  
https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/10.11925/infotech.1003-3513.2012.05.08      或      https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/Y2012/V28/I5/55
[1] Bagga A,Baldwin B.Entity-based Cross-document Coreferencing Using the Vector Space Model[C].In:Proceedings of the 17th International Conference on Computational Linguistics.1998:75-85.

[2] Mann G S,Yarowsky D. Unsupervised Personal Name Disambiguation[C].In:Proceedings of the 7th Conference on Natural Language Learning at HLT-NAACL 2003 (CoNLL-2003).2003:33-40.

[3] Fleischman M B, Hovy E. Multi-Document PerSon Name Resolution[C]. In:Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics,Reference Resolution Workshop.2004.

[4] Malin B.Unsupervised Name Disambiguation via Social Network Similarity[C].In:Proceedings of the SIAM International Conference on Data Mining,Workshop on Link Analysis,Counterterrorism, and Security in Conjunction.2005: 93-102.

[5] 郎君,秦兵,宋巍,等.基于社会网络的人名检索结果重名消解[J]. 计算机学报 , 2009, 32(7):1365-1373.(Lang Jun,Qin Bing,Song Wei,et al. Person Name Disambiguation of Searching Results Using Social Network[J]. Chinese Journal of Computers, 2009, 32(7):1365-1373.)

[6] Tang J, Zhang J, Zhang D, et al.A Unified Framework for Name Disambiguation[C]. In:Proceedings of the 17th International Conference on World Wide Web.2008:1205-1206.

[7] Chen C, Hu J F, Wang H F. Clustering Technique in Multi-document Personal Name Disambiguation[C]. In: Proceedings of the ACL-IJNCLP 2009 Student Research Workshop,Suntex, Singaore. Stroudsburg, PA, USA:Association for Computational Linguistics,2009:88-95.

[8] ORCID. Welcome to ORCID [EB/OL].[2012-03-02].http://about.orcid.org/.

[9] Bagga A. Evaluation of Coreferences and Coreference Resolution Systems[C].In:Proceedings of the 1st International Conference on Language Resources and Evaluation.Granada:European Language Resources Association,1998.

[10] Zhang D, Tang J, Li J Z, et al. A Constraint-based Probabilistic Framework for Name Disambiguation[C]. In: Proceedings of the 16th ACM Conference on Information and Knowledge Management (CIKM'2007). 2007:1019-1022.

[11] Kang I S,Na S H,Lee S, et al.On Co-authorship for Author Disambiguation[J].Information Processing & Management, 2009,45(1): 84-97.

[12] McRae-Spencer D M, Shadbolt N R. Also by the Same Author: AKTiveAuthor, a Citation Graph Approach to Name Disambiguation[C].In: Proceedings of the 6th ACM/IEEE-CS Joint Conference on Digital Libraries. New York, NY, USA:ACM,2006:53-54.

[13] 章顺瑞,游宏梁. 基于层次聚类算法的中文人名消歧[J]. 现代图书情报技术 , 2010(11):64-68.(Zhang Shunrui,You Hongliang.Chinese People Name Disambiguation by Hierarchical Clustering[J].New Technology of Library and Information Service, 2010(11):64-68.)

[14] 董国志,朱玉全,程显毅.中文人称代词指代消解的研究[J]. 计算机应用研究 ,2011,28(5):1774-1779.(Dong Guozhi,Zhu Yuquan,Cheng Xianyi.Search on Personal Pronoun Anaphora Resolution in Chinese[J].Application Research of Computers, 2011,28(5):1774-1779.)
[1] 沈喆, 王毅, 姚毅凡, 成颖. 面向学术文献的作者名消歧方法研究综述*[J]. 数据分析与知识发现, 2020, 4(8): 15-27.
[2] 张旺强,祝忠明,李雅梅,卢利农,刘巍. 机构知识库作者名自动消歧框架设计与实践*[J]. 数据分析与知识发现, 2019, 3(6): 92-98.
[3] 杨波, 杨军威, 阎素兰. 基于规则的机构名规范化研究[J]. 现代图书情报技术, 2015, 31(6): 57-63.
[4] 郭舒. 文献数据库中作者名消歧算法研究[J]. 现代图书情报技术, 2013, 29(7/8): 69-74.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
版权所有 © 2015 《数据分析与知识发现》编辑部
地址:北京市海淀区中关村北四环西路33号 邮编:100190
电话/传真:(010)82626611-6626,82624938
E-mail:jishu@mail.las.ac.cn