Please wait a minute...
New Technology of Library and Information Service  2009, Vol. 3 Issue (2): 51-55    DOI: 10.11925/infotech.1003-3513.2009.02.09
Current Issue | Archive | Adv Search |
GPE-entity Recognition Based on Conditional Random Fields
Zong Ping1,2   Shi Shuicai1,2   Wang Tao1,2   Lv Xueqiang1,2
1(Chinese Information Processing Research Center, Beijing Information Science &Technology University, Beijing 100101,China)
2(Beijing TRS Information Technology Co.Ltd., Beijing 100101,China )
Download:
Export: BibTeX | EndNote (RIS)      
Abstract  

This paper detects Geographical Political Entities (GPE) and it subtypes from the English corpus of Automatic Content Extraction (ACE) evaluation, based on Conditional Random Fields (CRFs). A feature set is extracted from the ACE corpus, and contributions of different feature sets to the detection of GPE entities are evaluated in the experiments. The results show that the feature set extracted in this paper can get higher rate of recall and accuracy.

Key wordsACE      GPE      Entity detection      CRF      Feature selection     
Received: 18 November 2008      Published: 25 February 2009
: 

TP391

 
Corresponding Authors: Zong Ping     E-mail: zong.ping@trs.com.cn
About author:: Zong Ping,Shi Shuicai,Wang Tao,Lv Xueqiang

Cite this article:

Zong Ping,Shi Shuicai,Wang Tao,Lv Xueqiang. GPE-entity Recognition Based on Conditional Random Fields. New Technology of Library and Information Service, 2009, 3(2): 51-55.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2009.02.09     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2009/V3/I2/51

[1] Linguistic Data Consortium.ACE(Automatic Content Extraction) English Annotation Guidelines for Entities Version 6.1[EB/OL].[2008-03-29].http://projects.ldc.upenn.edu/ace.
[2] ZHOU GD, SU J. Named Entity Recognition Using an HMMbased Chunk Tagger[C]. In:  Proceedings of the 40th Annual Meeting of the Association for Computation Linguistics, Philadelphia. USA:Association for Computational Linguistics,2002:473-480.
[3] Bender O, Ney H. Maximum Entropy Models for Named Entity Recognition [C]. In: Proceedings of the Conference on Computational Natural Language Learning,Edmonton,Canada. USA:Association for Computational Linguistics,2003:148-151.
[4] Lafferty J,McCallum A,Pereira F.Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Seqquence Data[J].The Journal of Manchine Learning Research,2001, ICML01:282-289.
[5] Hacioglu K,Douglas B,Chen Y. Detection of Entity Mentions Occurring in English and Chinese Text[C].In:Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing,Cannada.USA:Association for Computational Linguistics,2005(10): 379-386.
[6] The ACE 2008 Evaluation Plan.Assessment of Detection and Recognition of Entities and Relations Within and Across Documents[EB/OL].[2008-05-07].http://www.nist.gov/speech/tests/ace/ace08/doc/.
[7] Sutton C,McCallum A,Rohanimanesh K.Dynamic Conditional Random Fields:Factorized Probabilistic Models for Labeling and Segmenting Sequence Data[J].The Journal of Machine Learning Research,2007,8(3):693-723.
[8] 廖先桃.CRF理论、工具包的使用及在NE上的应用[R/OL].[2008-04-02].http://ir.hit.edu.cn/phpwebsite/index.php?module=documents&JAS_DocumentManager_op=downloadFile &JAS_File_id=215.
[9] 张海雷,曹菲菲,陈文亮,等.基于多层次特征集成的中文实体指代识别[J],中文信息学报,2007,21(5):126-130.
[10] 向晓雯.基于条件随机场的中文命名实体识别[D],厦门:厦门大学,2006.
[11] Florian R,Hassan H,Jing H,et al.Factorizing Complex Models: A Case Study in Mention Detection[J].Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics. 2006(9):473-480.
[12] 郭家清.基于条件随机场的命名实体识别研究[D],沈阳:沈阳航空工业学院,2007.

[1] Wang Hao, Lin Kerou, Meng Zhen, Li Xinlei. Identifying Multi-Type Entities in Legal Judgments with Text Representation and Feature Generation[J]. 数据分析与知识发现, 2021, 5(7): 10-25.
[2] Yu Xuehan, He Lin, Xu Jian. Extracting Events from Ancient Books Based on RoBERTa-CRF[J]. 数据分析与知识发现, 2021, 5(7): 26-35.
[3] Hu Haotian,Ji Jinfeng,Wang Dongbo,Deng Sanhong. An Integrated Platform for Food Safety Incident Entities Based on Deep Learning[J]. 数据分析与知识发现, 2021, 5(3): 12-24.
[4] Liang Jiaming, Zhao Jie, Zheng Peng, Huang Liushen, Ye Minqi, Dong Zhenning. Framework for Computing Trust in Online Short-Rent Platform Using Feature Selection of Images and Texts[J]. 数据分析与知识发现, 2021, 5(2): 129-140.
[5] Cheng Tiejun, Wang Man, Huang Baofeng, Feng Lanping. Predicting Online Public Opinion in Emergencies Based on CEEMDAN-BP[J]. 数据分析与知识发现, 2021, 5(11): 59-67.
[6] Zhu Lu,Tian Xiaomeng,Cao Sainan,Liu Yuanyuan. Subspace Cross-modal Retrieval Based on High-Order Semantic Correlation[J]. 数据分析与知识发现, 2020, 4(5): 84-91.
[7] Wei Wei,Guo Chonghui,Xing Xiaoyu. Annotating Knowledge Points & Recommending Questions Based on Semantic Association Rules[J]. 数据分析与知识发现, 2020, 4(2/3): 182-191.
[8] Xue Fuliang,Liu Lifang. Fine-Grained Sentiment Analysis with CRF and ATAE-LSTM[J]. 数据分析与知识发现, 2020, 4(2/3): 207-213.
[9] Ma Jianxia,Yuan Hui,Jiang Xiang. Extracting Name Entities from Ecological Restoration Literature with Bi-LSTM+CRF[J]. 数据分析与知识发现, 2020, 4(2/3): 78-88.
[10] Na Ma,Zhixiong Zhang,Pengmin Wu. Automatic Identification of Term Citation Object with Feature Fusion[J]. 数据分析与知识发现, 2020, 4(1): 89-98.
[11] Wangqiang Zhang,Zhongming Zhu,Yamei Li,Linong Lu,Wei Liu. Disambiguating Author Names Automatically for Institutional Repository[J]. 数据分析与知识发现, 2019, 3(6): 92-98.
[12] Cheng Zhou,Hongqin Wei. Evaluating and Classifying Patent Values Based on Self-Organizing Maps and Support Vector Machine[J]. 数据分析与知识发现, 2019, 3(5): 117-124.
[13] Jiaming Liang,Jie Zhao,Zhou Jianlong,Zhenning Dong. Detecting Collusive Fraudulent Online Transaction with Implicit User Behaviors[J]. 数据分析与知识发现, 2019, 3(5): 125-138.
[14] Tingxin Wen,Yangzi Li,Jingshuang Sun. News Hotspots Discovery Method Based on Multi Factor Feature Selection and AFOA/K-means[J]. 数据分析与知识发现, 2019, 3(4): 97-106.
[15] Zhiqiang Wu,Zhongming Zhu,Wei Liu,Sili Wang. Research and Practice on the Extension of Knowledge Analysis and Visualization Function in CSpace[J]. 数据分析与知识发现, 2019, 3(3): 112-119.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn