Please wait a minute...
New Technology of Library and Information Service  2010, Vol. 26 Issue (6): 42-47    DOI: 10.11925/infotech.1003-3513.2010.06.07
article Current Issue | Archive | Adv Search |
Overview on the Advance of the Research on Named Entity Recognition
Sun Zhen1,3  Wang Huilin2
1(Department of Information Management, Peking University,Beijing 100871,China)
2(Institute of Scientific & Technical Information of China,Beijing 100038,China)
3(National Administration for Code Allocation to Organizations, Beijing 100029,China)
Download: PDF(393 KB)   HTML  
Export: BibTeX | EndNote (RIS)      
Abstract  

 The paper introduces the background and motivation of Named Entity Recognition,and summarizes the history development of Named Entity Recognition at home and abroad, as well as the related technology and evaluation method. Finally,it discusses the new development trends on Named Entity Recognition.

Key words Named entity recognition      Information extraction      Information retrieval      Machine translation      Evaluation methods     
Received: 24 March 2010      Published: 26 July 2010
: 

 

 
  TP391

 
Corresponding Authors: Sun Zhen     E-mail: sunzhenyh@yahoo.com.cn

Cite this article:

Sun Zhen Wang Huilin. Overview on the Advance of the Research on Named Entity Recognition. New Technology of Library and Information Service, 2010, 26(6): 42-47.

URL:

http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2010.06.07     OR     http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2010/V26/I6/42

[1] Chinchor N. MUC-7 Named Entity Task Definition[C]. In:Proceedings of the 7th Message Understanding Conference,Virginia. 1998.
[2] Sproat R,Emerson T. The First International Chinese Word Segmentation Bakeoff[C].In:Proceedings of the 2nd SIGHAN Workshop on Chinese Language Processing,Sapporo,Japan. 2003:133-143.
[3] Rau L F. Extracting Company Names from Text[C].In:Proceedings of the 7th IEEE Conference on Artificial Intelligence Applications.1991:29-32.
[4] Grishman R, Sundheim B. Message Understanding Conference-6:A Brief History[C]. In:Proceedings of the 16th International Conference on Computational Linguistics. 1996.
[5] Chinchor N A.Overview of MUC-7/MET-2[C]. In:Proceedings of the 7th Message Understanding Conference.1998.
[6] 孙茂松,黄昌宁,高海燕,等.中文姓名的自动辨识[J].中文信息学报,1995,9(2):16-27.
[7] 张小衡,王玲玲.中文机构名称的识别与分析[J]. 中文信息学报,1997, 11(4):21-32.
[8] Zhang Y, Zhou J F. A Trainable Method for Extracting Chinese Entity Names and Their Relations[C]. In: Proceedings of the 2nd Chinese Language Processing Workshop,HongKong.2000:66-76.
[9] Bikel D M,Schwarta R, Weischedel R M.An Algorithm that Learns What’s in a Name[J].Machine Learning Journal Special Issue on Natural Language Learning,1999,34(1-3):211-231.
[10] Liao W,Veeramachaneni S. A Simple Semi-supervised Algorithm for Named Entity Recognition[C].In:Proceedings of the NAACL HLT 2009 Workshop on Semi-supervised Learning for Natural Language Processing.2009:58-65.
[11] Ratinov L, Roth D.Design Challenges and Misconceptions in Named Entity Recognition[C]. In:Proceedings of the 13th Conference on Computational Natural Language Learning.2009:147–155.
[12] Tsai T, Wu S, Lee C,et al. Mencius:A Chinese Named Entity Recognizer Using the Maximum Entropy-based Hybrid Model[J].International Journal of Computational Linguistics & Chinese Language Processing,2004,9(1):65-81.
[13] 冯元勇,孙乐,李文波,等.基于单字提示特征的中文命名实体识别快速算法[J].中文信息学报,2008,22(1):105-110.
[14] 郑逢强, 林磊,刘秉权,等.《知网》在命名实体识别中的应用研究[J].中文信息学报,2008,22(5):97-101.
[15] 刘非凡,赵军,吕碧波,等.面向商务信息抽取的产品命名实体识别研究[J].中文信息学报,2006,20(1):7-13.
[16] 薛征山,郭剑毅,余正涛,等.基于HMM的中文旅游景点的识别[J].昆明理工大学学报:理工版,2009,34(6):44-48.
[17] 邱莎.基于统计的生物命名实体识别研究[D].成都:四川大学,2006.
[18] 徐薇,付滨,刘柳,等.中文命名实体识别系统的领域扩展[C].见:第9届全国计算语言学学术会议论文集. 2007.
[19] Poibeau T.The Multilingual Named Entity Recognition Framework[C].In:Proceedings of the 10th Conference on European Chapter of the Association for Computational Linguistics.2003:155-158.
[20] Sang T K. Introduction to the CoNLL-2002 Shared Task:Language-Independent Named Entity Recognition[C].In:Proceedings of the 6th Conference on Natural Language Learning, Taipei,Taiwan. Morristown, NJ, USA:Association for Computational Linguistics,2002:1-4.
[21] Sang T K,Meulder F D.Introduction to the CoNLL-2003 Shared Task:Language-Independent Named Entity Recognition[C]. In:Proceedings of the 7th Conference on Natural Language Learning at HLT-NAACL, Edmonton,Canada. Morristown, NJ, USA:Association for Computational Linguistics,2003:142-147.
[22] 王宁,葛瑞芳,苑春法,等.中文金融新闻中公司名的识别[J].中文信息学报,2002,16(2):1-6.
[23] 王丹,樊兴华. 面向短文本的命名实体识别[J].计算机应用, 2009,29(1):143-145.
[24] 张祝玉,任飞亮,朱靖波.基于条件随机场的中文命名实体识别特征比较研究[C].见:第4届全国信息检索与内容安全学术会议论文集.2008.
[25] 第一届中国中文信息学会汉语处理评测(CIPS-CLPE)暨第四届国际中文自然语言处理Bakeoff[EB/OL].[2010-01-11].http://www.china-language.gov.cn/bakeoff08/.
[26] Kazama J, Torisawa K.Exploiting Wikipedia as External Knowledge for Named Entity Recognition[C].In:Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning.2007:698-707.
[27] Cucerzan S. Large-Scale Named Entity Disambiguation Based on Wikipedia Data[C]. In:Proceedings of Empirical Methods in Natural Language Processing, Prague,Czech Republic.2007:708–716.
[28] 俞鸿魁,张华平,刘群,等.基于层叠隐马尔可夫模型的中文命名实体识别[J].通信学报,2006,27(2):87-93.
[29] Lin Y, Tsai T, Chou W, et al. A Maximum Entropy Approach to Biomedical Named Entity Recognition[C].In:Proceedings of the 4th ACM SIGKDD Workshop on Data Mining in Bioinformatics.2004.
[30] Automatic Content Extraction 2008 Evaluation Plan(ACE08)[EB/OL].(2008-05-30).[2010-01-11]. http://nist.gov/speech/tests/ace/2008/doc/ace08-evalplan.v1.2d.pdf.

[1] Zhiqiang Liu,Yuncheng Du,Shuicai Shi. Extraction of Key Information in Web News Based on Improved Hidden Markov Model[J]. 数据分析与知识发现, 2019, 3(3): 120-128.
[2] Qingmin Liu,Changqing Yao,Chongde Shi,Xiaojie Wen,Yueying Sun. Vocabulary Optimization of Neural Machine Translation for Scientific and Technical Document[J]. 数据分析与知识发现, 2019, 3(3): 76-82.
[3] Haixia Sun,Lei Wang,Yingjie Wu,Weina Hua,Junlian Li. Matching Strategies for Institution Names in Literature Database[J]. 数据分析与知识发现, 2018, 2(8): 88-97.
[4] Dongmei Mu,Shan Jin,Yuanhong Ju. Finding Association Between Diseases and Genes from Literature Abstracts[J]. 数据分析与知识发现, 2018, 2(8): 98-106.
[5] Chaofan Yang,Zhonghua Deng,Xin Peng,Bin Liu. Review of Information Retrieval Research: Case Study of Conference Papers[J]. 数据分析与知识发现, 2017, 1(7): 35-43.
[6] Xiaojuan Zhang, Yi Han. Reviews on Temporal Information Retrieval[J]. 数据分析与知识发现, 2017, 1(1): 3-15.
[7] Mingxuan Huang. Cross Language Information Retrieval Model Based on Matrix-weighted Association Patterns Mining[J]. 数据分析与知识发现, 2017, 1(1): 26-36.
[8] Ding Heng,Lu Wei. Building Standard Literature Knowledge Service System[J]. 现代图书情报技术, 2016, 32(7-8): 120-128.
[9] Heng Ding, Wei Lu. A Study on Correlation-based Cross-Modal Information Retrieval[J]. 现代图书情报技术, 2016, 32(1): 17-23.
[10] Yufeng Duan,Sisi Huang. Information Extraction from Chinese Plant Species Diversity Description Text[J]. 现代图书情报技术, 2016, 32(1): 87-96.
[11] Liu Wei, Wang Xing, Song Peiyan. A Noise Cleaning Method for Synonym Extraction Results[J]. 现代图书情报技术, 2015, 31(6): 64-70.
[12] Jiang Chuntao. Automatic Annotation of Bibliographical References in Chinese Patent Documents[J]. 现代图书情报技术, 2015, 31(10): 81-87.
[13] Mao Jin, Li Gang, Cao Yujie. Re-rank Retrieval Results Through Subject Indexing[J]. 现代图书情报技术, 2014, 30(7): 48-55.
[14] Li Xiangdong, Huo Yayong, Huang Li. Study of Book Pages Automatic Identification and Bibliographic Information Extraction[J]. 现代图书情报技术, 2014, 30(4): 71-77.
[15] Liu Yajing, Wang Yanxi, Hao Dan, Zhou Jinhui. Study on the Methods of Institutional Repository Supporting Research Services[J]. 现代图书情报技术, 2014, 30(3): 1-7.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn