|
|
Application on Information Extraction from Factual Information Based on Conditional Random Fields Method |
Wu Shuai |
China Defense Science & Technology Information Center, Beijing 100142, China |
|
|
Abstract A method based on the Conditional Random Fields (CRFs) is proposed to extract the information of unstructured factual information text, and the method of parameter estimation and feature selection is also anlyzed. During information extraction, the author blocks the text firstly with the help of format information such as separator and special identifier, and then extracts the designated block with Conditional Random Fields. The proposed method is applied in Global Weapon Knowledge Base System (GWKBS), and experiment results show that it has a better precision and recall performance.
|
Received: 11 March 2010
Published: 04 January 2011
|
|
[1] 李保利,陈玉忠,俞士汶.信息抽取研究综述 [J]. 计算机工程与应用 ,2003,39(10):1-5.
[2] Seymore K, McCallum A, Rosenfeld R. Learning Hidden Markov Model Structure for Information Extraction . In: Proceedings of the AAAI’99 Workshop on Machine Learning for Information Extraction. 1999:37-42.
[3] 林亚平,刘云中,周顺先,等. 基于最大熵的隐马尔可夫模型文本信息抽取 [J]. 电子学报 , 2005,33 (2):236-240.
[4] 刘云中,林亚平,陈治平. 基于隐马尔可夫模型的文本信息抽取 [J]. 系统仿真学报 , 2004,16(3):507-510.
[5] 张玲,黄铁军,高文. 基于隐马尔可夫模型的引文信息提取 [J]. 计算机工程 , 2003,29(20):33-34,54.
[6] Han H, Giles C,Manavoglu E, et al. Automatic Document Metadata Extraction Using Support Vector Machines . In: Proceedings of Joint Conference on Digital Libraries. 2003:37-48.
[7] Lafferty J, McCallum A, Pereira F. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data . In: Proceedings of the 18th International Conference on Machine Learning. 2001:282-289.
[8] Byrd R H, Nocedal J, Schnabel R B. Representations of Quasi-Newton Matrices and Their Use in Limited Memory Methods [J]. Mathematical Programming, 1994 (2):129-156.
[9] Darroch J N, Ratcliff D. Generalized Iterative Scaling for Log-linear Models [J]. Annals of Mathematical Statistics,1972,43(5):1470-1480.
[10] Della Pietra S, Della Pietra V, Lafferty J. Inducing Features of Random Fields [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1997,19(4):380-393.
[11] Peng F, McCallum A. Accurate Information Extraction from Research Papers Using Conditional Random Fields [J]. Information Processing & Management,2006,42(4):963-979.
[12] Sha F, Pereira F. Shallow Parsing with Conditional Random Fields . In: Proceedings of Human Language Technology NAACL. 2003:134-141.
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|