|
|
Named Entity Extraction Model Based on Hierarchical Pattern Matching |
Wang Hao |
(Department of Information Management,Nanjing University,Nanjing 210093,China) |
|
|
Abstract This paper emphasizes the process of extraction and classification of Expression Named Entity(ENE) in non-structured Chinese text, attempts to construct pattern collection for matching and builds the ENE Extraction Model Based Hierarchical Pattern Matching(HPM_ENE_EM), which is the base of the application research on intelligence, such as Competitive Intelligence System(CIS),user interest degree gaining and so on. At last, the paper discusses the detailed application of this model used for extracting the abbreviative terms in academic papers.
|
Received: 26 March 2007
Published: 25 May 2007
|
|
Corresponding Authors:
Wang Hao
E-mail: ywhaowang810710@sina.com
|
About author:: Wang Hao |
1王睿, 张洁, 张由仪等. 基于混合模型的中文命名实体抽取系统. 清华大学学报(自然科学版), 2005(S1):1908-1914
2Chen H H, Ding Y W, Tsa S C, et a1. Description of the NITU System Used for MET2. In: Proc. of 7th Message Understanding Conference, 1998
3B1ack W J, Rinaldi F, Mowatt D. Facile: Description of the NE System Used For MUC-7. In: Proc. of 7th Message Understanding Conf, 1998
4Fukumoto J, Shimohata M, Masui F,et al. Electric Industry: Description of the Oki System as Used for MET-2. In: Proc. of 7th Message Understanding Conf, 1998
5Berners-Lee T, Fischetti M,Dertouzos T M. Weaving the Web: The Original Design and Ultimate Destiny of the World Wide Web by its Inventor. Harper, San Francisco. 1999
6Zhou G D, Su J. Named Entity Recognition using an HMM-based Chunk Tagger. In: Proc. of the 40th Annual Meeting of the ACL,Philadelphia, PA 2002, 473-480
7Bender O,Och F J,Ney H. Maximum Entropy Models for Named Entity Recognition, Proceedings of the Conference on Computational Natural Language Learning. Edmonton, Canada, 2003, 148-151
8庄明, 老松杨, 吴玲达. 一种统计和词性相结合的命名实体发现方法. 计算机应用, 2004(01):22-24
9王胜, 朱明. 基于最大熵马尔可夫模型的地址信息抽取. 计算机工程与应用, 2005(21):192-194 |
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|