Please wait a minute...
New Technology of Library and Information Service  2010, Vol. 26 Issue (5): 13-17    DOI: 10.11925/infotech.1003-3513.2010.05.03
article Current Issue | Archive | Adv Search |
Research on Chinese Chemical Name Recognition Based on Heuristic Rules
Li Nan Zheng Rongting Ji JiumingTeng Qingqing
(Library of East China University of Science and Technology, Shanghai 200237, China)
Download:
Export: BibTeX | EndNote (RIS)      
Abstract  

This paper proposes a method of domain name recognition based on heuristic rules, to overcome the shortage of traditional solution in specific domain. It firstly studies chemical name in Chinese to obtain its domain features and statistical language features, and then on the basis of such features,it puts forward several heuristic rules, which is applicable to domain name recognition of chemical literature. Comparison experiment shows this method can improve the efficiency of domain name recognition obviously.

Key wordsChemical name recognition         Heuristic rule          Domain feature          Statistical language feature        IUPAC     
Received: 09 April 2010      Published: 25 May 2010
: 

 

 
  TP391

 
Corresponding Authors: Li Nan     E-mail: ajen@ecust.edu.cn

Cite this article:

Li Nan Zheng Rongting Ji JiumingTeng Qingqing. Research on Chinese Chemical Name Recognition Based on Heuristic Rules. New Technology of Library and Information Service, 2010, 26(5): 13-17.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2010.05.03     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2010/V26/I5/13

1] 赵军.命名实体识别、排歧和跨语言关联[J].中文信息学报,2009,23(2):3-17.
[2] Grishman R, Sundhiem B. Design of the MUC-6 Evaluation[C]. In: Proceedings of the 6th Message Understanding Conference. NJ: Association for Computational Linguistics, 1995:1-11.
[3] Chen H H, Ding Y W, Tsai S C, et al. Description of the NTU System Used for MET-2[C]. In: Proceedings of the 7th Message Understanding Conference. 1998.
[4] Black W J, Rinaldi F, Mowatt D. Facile: Description of the NE System Used For MUC-7[C]. In: Proceedings of the 7th Message Understanding Conference. 1998.
[5] Sun J, Gao J F, Zhang L, et al. Chinese Named Entity Identification Using Class Based Language Model[C]. In: Proceedings of the 19th International Conference on Computational Linguistics. NJ: Association for Computational Linguistics, 2002: 1-7.
[6] Zhou G D, Su J. Named Entity Recognition Using an HMM Based Chunk Tagger[C]. In: Proceedings of the 40th Annual Meeting of the ACL. NJ: Association for Computational Linguistics, 2002: 473-480.
[7] Ramaparkhi A. A Simple Introduction to Maximum Entropy Models for Natural Language Processing[R]. Institute for Research in Cognitive Science, University of Pennsylvania, 1997.
[8] 刘建华,张智雄,徐健,等.自动术语识别——对科技文献进行文本挖掘的重要技术方法[J].现代图书情报技术,2008(8):12-17.
[9] Krauthammer M, Rzhetsky A, Morozov P, et al. Using BLAST for Identifying Gene and Protein Names in Journal Articles [J]. Gene, 2000, 259(1):245-252.
[10] 宋丹,孙济庆.基于规则的化学特征词自动标引研究[J].情报学报, 2009,28(5):689-692.
[11] Klinger R, Kolárik C, Fluck J, et al. Detection of IUPAC and IUPAC-like Chemical Names[J]. Bioinformatics, 2008, 24(13):268-276.
[12] 中国化学会.化学命名原则[M].北京:科学出版社,1984.

[1] Sun Zhen Wang Huilin. Overview on the Advance of the Research on Named Entity Recognition[J]. 现代图书情报技术, 2010, 26(6): 42-47.
[2] Li Jing Tan Ying Shi Qiaomei. Design and Implementation of E-mail Pushing Service System for Paper Indexed by Three Famous Indexes[J]. 现代图书情报技术, 2010, 26(6): 83-87.
[3] Zou RongChen WuJiang AirongZhang ChengyuYuan Hongliang. Design and Implementation of Upgrading Program of Tsinghua University Library Network[J]. 现代图书情报技术, 2010, 26(5): 79-83.
[4] Hua Bolin,Guo Jiang. System Design and Implementation of University Laboratory Web Information Extraction Based on Rules[J]. 现代图书情报技术, 2009, (10): 62-66.
[5] Zou Rong,Fan Aihong,Jiang Airong. Construction of the Academic Papers Management System with DSpace[J]. 现代图书情报技术, 2009, (10): 90-94.
[6] Zhang Yiyan,Du Weiwei, Gao Song. Design and Realization of Standing Order Management System[J]. 现代图书情报技术, 2009, (10): 86-89.
[7] Wu Zhenxin,Yao Fei,Gao Jianxiu,Sun Minjie. A Comprehensive Review of 2009 International Conference on Preservation of Digital Objects——Moving into the Mainstream, Enabling Our Digital Future[J]. 现代图书情报技术, 2009, (10): 1-6.
[8] Wang Jiandong. A Literature Review of Progress in Foreign Usability Research[J]. 现代图书情报技术, 2009, (9): 7-16.
[9] Zhu Zhongming,Ma Jianxia,Lu Linong,Li Fuqiang ,Liu Wei,Wu Denglu. Developing an Institutional Repository Platform via Extending DSpace[J]. 现代图书情报技术, 2009, 25(7-8): 11-17.
[10] Hak Lae Kim, Simon Scerri, John G.Breslin, Stefan Decker, Hong Gee Kim. The State of the Art in Tag Ontologies: A Semantic Model for Tagging and Folksonomies[J]. 现代图书情报技术, 2009, 3(3): 30-37.
[11] Chen Shiji,Shi Liwen,Zuo Wenge. Research of Compound Digital Object Under e-Science Environment[J]. 现代图书情报技术, 2009, 3(2): 33-38.
[12] Jiang Caihong,Qiao Xiaodong ,Zhu Lijun. Ontology-based Patent Abstracts' Knowledge Extraction[J]. 现代图书情报技术, 2009, 3(2): 23-28.
[13] Shao Zengrong,Li Ying,Fan Tijun. The Application of Regular Expressions in Online Oil Price Event[J]. 现代图书情报技术, 2009, 3(2): 83-88.
[14] Wu Zheng. Design and Implementation of Universal Mobile Phone Library System[J]. 现代图书情报技术, 2009, 3(1): 98-104.
[15] Li Feng,Li Chunwang. Study on Mashup Technology[J]. 现代图书情报技术, 2009, 3(1): 44-49.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn