Automatic Term Recognition——An Important Method for Text Mining on Scientific Literature

doi:10.11925/infotech.1003-3513.2008.08.02

New Technology of Library and Information Service

2008, Vol. 24

Issue (8): 12-17 DOI: 10.11925/infotech.1003-3513.2008.08.02

article

Current Issue | Archive | Adv Search

Automatic Term Recognition——An Important Method for Text Mining on Scientific Literature

Liu Jianhua ^1,2Zhang Zhixiong ¹Xu Jian ^1,2,3Xu Yandong¹

¹(National Science Library, Chinese Academy of Sciences, Beijing 100190, China)
²(Graduate University of the Chinese Academy of Sciences, Beijing 100049,China）
³(Department of Information Management，Sun Yat-Sen University, Guangzhou 510275,China）

Download:
Export: BibTeX | EndNote (RIS)

Abstract

Automatic Term Recognition(ATR) is a key process of knowledge technology such as knowledge extraction and text mining. To enrich the text mining theories and methods based on term recognition, support constructing related systems, it refers to some main existing methods for ATR, find key problems of the process. Through researches on related programs and systems, existing term resources, we could choose the best one for ourselves’ ATR system.

Key words： Automatic term recognition Term variation Term ambiguity

Received: 16 June 2008 Published: 25 August 2008

G250.73

Corresponding Authors: Liu Jianhua E-mail: liujh@mail.las.ac.cn

About author:: Liu Jianhua,Zhang Zhixiong,Xu Jian,Xu Yandong

	Service

	E-mail this article
	Add to my bookshelf
	Add to citation manager
	E-mail Alert
	RSS
	Articles by authors

Cite this article:

Liu Jianhua,Zhang Zhixiong,Xu Jian,Xu Yandong. Automatic Term Recognition——An Important Method for Text Mining on Scientific Literature. New Technology of Library and Information Service, 2008, 24(8): 12-17.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2008.08.02 OR https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2008/V24/I8/12

［1］ Feldman R, Fresko M, Kinar Y, et al. Text Mining at the Term Level［J］. Lecture Notes In Computer Science, 1998:65-73.
［2］ Mima H, Ananiadou S, Nenadic G. The ATRACT Workbench:Automatic Term Recognition and Clustering for Terms［J］. Lecture Notes in Computer Science, 2001,2166:126-133.
［3］ Milios E, Zhang Y, et al. Automatic Term Extraction and Document Similarity in Special Text Corpora［C］. In： Proceeding of the 6th conference of the Pacific Association for Computational Linguistics,New York:ACM, 2003:275-284.
［4］ Love S. Benchmarking the Performance of Two Automated Term-Extraction Systems:LOGOS and ATAO［EB/OL］.［2008-04-03］.http://www.olst.umontreal.ca/pdf/memoirelove.pdf.
［5］ Kajikawa Y, Sugiyama Y. Causal Knowledge Extraction by Natural Language Processing in Material Science:A Case Study in Chemical Vapor Deposition［J］. Data Science Journal, 2006,5:108-118.
［6］ Jensen L J, Saric J, Bork P.Literature Mining for the Biologist:from Information Retrieval to Biological Discovery［J］. Nature Reviews (Genetics), 2006,7:119-129.
［7］ Krauthammer M, Nenadic G. Term Identification in the Biomedical Literature［J］. Journal of Biomedical Informatics, 2004,37(6):512-526.
［8］ Asunción Gómez-Pérez, David Manzano-MachoA Survey of Ontology Learning Methods and Techniques ［EB/OL］.［2008-06-05］. http://www.sti-innsbruck.at/fileadmin/documents/deliverables/Ontoweb/D1.5.pdf.
［9］ Term versus Word［EB/OL］.［2008-02-24］. http://www.termiumplus.gc.ca/didacticiel_tutorial/english/lesson1/page1_2_4_e.html.
［10］ Alegria I, Arregi O, Balza I. Linguistic and Statistical Approaches to Basque Term Extraction［EB/OL］.［2008-2-24］. http://ixa.is.ehu.es.
［11］于卫. 自动中文术语识别若干方法研究［D］. 哈尔滨：哈尔滨工业大学，2004.
［12］ Ananiadou S, Nenadic G. Automatic Terminology Management in Biomedicine［M］. Text Mining for Biology and Biomedicine, UK:Artech House Publishers, 2006.
［13］ Buitelaar P, Cimiano P, Grobelnik M. Ontology Learning from Text［C］.In:the ECML/PKDD 2005 Workshop on:Knowledge Discovery and Ontologies, Porto, Protugal, 2005.
［14］ Olena Medelyna. Automatic Keyphrase Indexing with a Domain-Specific Thesaurus［D］. Germany：University of Freiburg, 2005.
［15］ TerMine Plugin for Protege 4 ［EB/OL］.［2008-4-3］. http://www.co-ode.org/downloads/protege-x/plugins/termine-docs.pdf.
［16］张榕.术语定义抽取、聚类与术语识别研究［D］.北京：北京语言大学，2006.
［17］ TerMine［EB/OL］.［2008-04-03］. http://www.nactem.ac.uk/software/termine/
［18］ Cheshire3-Termine Demonstration using Medline Abstracts［EB/OL］.［2008-04-03］.http://www.nactem.ac.uk/software/ctermine/.
［19］穗志方等.信息科学与技术领域术语自动提取研究［C］.见：第五届东亚术语论坛，2002.
［20］ UMLS［EB/OL］.［2008-04-03］. http://www.nlm.nih.gov/research/umls/.
［21］ Liu H,Johnson S B, Friedman C. Automatic Resolution of Ambiguous Terms Based on Machine Learning and Conceptual Relations in the UMLS［J］. Journal of the American Medical Associations, 2002,9(6):621-636.
［22］ Harkema H, Gaizauskas R, Mark H, et al. A Large Scale Terminology Resource for Biomedical Text Processing. Linking Biological Literature［J］, Ontologies and Databases, 2004(6):53-60.
［23］俞士汶，于江生.中文概念词典的结构［J］.中文信息学报，2002,16（4）:12-20.
［24］ Zan H, Duan G, Fan M. Single World Term Extraction Using a Bilingual Semantic Lexicon-based Approach［C］. In:Third International Conference on Natural Computation, ICNC:IEEE Computer Society, 2007:451-456.

[1]	Qiao Jianzhong. An Improved Best-First Search Algorithm Based Focused Crawling Research[J]. 现代图书情报技术, 2013, 29(7/8): 28-35.
[2]	Qiao Jianzhong. Statistical Characteristics Based Web Page Relevance Judgment Strategy for the “Type” Topics Crawled[J]. 现代图书情报技术, 2012, 28(6): 9-16.
[3]	Xu Shuwei. Relevant Judgment Strategy of Synchronous Collaborative Retrieval Results[J]. 现代图书情报技术, 2012, 28(4): 41-47.
[4]	Xing Meifeng. Study on Solution to Redundancy of Scientific Literature Keywords[J]. 现代图书情报技术, 2012, 28(1): 34-39.
[5]	Xia Yuhua, Sun Jiande, Qi Jingtao. Key Frame Extraction on Quick Browsing Approaches to Academic Videos in Library[J]. 现代图书情报技术, 2011, 27(10): 40-44.
[6]	Xu Jian. A Term Similarity Algorithm Based on Context Dependency Relation Pattern[J]. 现代图书情报技术, 2011, 27(9): 28-33.
[7]	Qiao Jianzhong. Anchor and Link Text Expansion Based KBES Algorithm Tunneling Strategy[J]. 现代图书情报技术, 2011, 27(3): 45-50.
[8]	Xu Jian Zhang Zhixiong Xiao Zhuo Deng Zhaojun. Review on Scientific and Technical Term Semantic Similarity Measure Methods[J]. 现代图书情报技术, 2010, 26(7/8): 51-57.
[9]	Liu Jianhua ,Zhang Zhixiong. Relation Extraction Based on Stanford Parser[J]. 现代图书情报技术, 2009, 25(5): 1-5.
[10]	Yin Shumei,Zhang Zhixiong,Wu Zhenxin. A Method for Automatic Keyword Extraction and Filtration from Medical Texts[J]. 现代图书情报技术, 2008, 24(8): 31-36.
[11]	Zhang Zhixiong,Wu Zhenxin,Liu Jianhua,Xu Jian,Hong Na,Zhao Qi. Analysis of State-of-the-Art Knowledge Extraction Technologies[J]. 现代图书情报技术, 2008, 24(8): 2-11.
[12]	Xu Jian,Zhang Zhixiong,Wu Zhenxin. Review on Techniques of Entity Relation Extraction[J]. 现代图书情报技术, 2008, 24(8): 18-23.
[13]	Liu Xiaojuan. Study on Formalizing Indicators in Automated Website Evaluation[J]. 现代图书情报技术, 2008, 24(4): 61-65.
[14]	Wei Maoqian,Xie Jing,Ma Ziwei. Design and Implementation of Union Search and Extended Service System Based on Lightweight Infrastructure[J]. 现代图书情报技术, 2007, 2(11): 19-22.
[15]	Jiao Yuying,Cheng Quan. Research on Integrated Services of Knowledge Grid Based on Ontology[J]. 现代图书情报技术, 2007, 2(8): 6-11.

Viewed

Full text

Abstract

Cited

Shared

Discussed