Please wait a minute...
New Technology of Library and Information Service  2010, Vol. 26 Issue (4): 59-65    DOI: 10.11925/infotech.1003-3513.2010.04.10
article Current Issue | Archive | Adv Search |
Automatic Domain-specific Term Extraction in Administrative-domain Ontology
Zhai Dufeng1,Liu Baisong2
1 (School of Business, Ningbo University, Ningbo 315211, China)
2(Ningbo University Network Center, Ningbo 315211, China)
Download:
Export: BibTeX | EndNote (RIS)      
Abstract  

This paper introduces a new method to extract the administrative-domain Ontology term automatically. Firstly, some words that are representative of the candidate terms should be extracted through the technology of word segmentation and the characters merger method. Secondly, the candidate terms are filtered by the way of C-value method and TF-IDF algorithm to achieve the automatic domain-specific term extraction in administrative-domain Ontology. Finally,the experiment shows that this method can improve the accuracy of the extracted terms and do not affect the recall-rate.

Key wordsAdministrative-domain Ontology        Terms        Characters merger method        C-value        TFIDF algorithm     
Received: 22 March 2010      Published: 25 April 2010
: 

TP391

 
Corresponding Authors: Zhai Dufeng     E-mail: zhaidufeng@126.com

Cite this article:

Zhai Dufeng,Liu Baisong. Automatic Domain-specific Term Extraction in Administrative-domain Ontology. New Technology of Library and Information Service, 2010, 26(4): 59-65.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2010.04.10     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2010/V26/I4/59

[1] eEurope 2002 Action Plan[EB/OL].(2000-06-20). [2010-02-05].http://ec.europa.eu/information _society/eeurope/2002/action_plan/pdf/actionplan_en.pdf.
[2] 电子政务主题词表编制与应用系统课题组.《综合电子政务主题词表》(试用本)范畴表[M].北京:科学技术文献出版社,2005.
[3] 温春,王晓斌,石昭祥.中文领域本体学习中术语的自动抽取[J].计算机应用研究,2009,27(7):2652-2655.
[4] Bourigault D.Surface Grammatical Analysis for the Extraction of Terminological Noun Phrases [C].In:Proceedings of the 14th International Conference on Computational Linguistics, Nantes, France.Morristown, NJ, USA:Association for Computational Linguistics,1992:977-981.
[5] Shamsfard M, Barforoush A  A .Learning Ontologies from Natural Language Texts [J].International Journal of Human-Computer Studies,2004,60(1):17-63.
[6] Frantzi K, Ananiadou S,Mima H.Automatic Recognition of Multi-Word Terms:the C-value/NC-value Method[J].International Journal on Digital Libraries, 2000,3(2):115-130.
[7] Justeson J S, Katz S M. Technical Terminology:Some Linguistic Properties and an Algorithm for Identification in Text[J].Natural Language Engineering,1995,1(1):9-27.
[8] Rezgui Y.Text-based Domain Ontology Building Using TF-IDF and Metric Clusters Techniques[J]. The Knowledge Engineering Review,2007,22(4):379-403.
[9] Pantel P,Lin D.A Statistical Corpus-based Term Extractor[C].In:Proceedings of the 14th Biennial Conference of the Canadian Society on Computational Studies of Intelligence:Advances in Artificial Intelligence. London, UK:Springer-Verlag,2001:36-46.
[10] 杜波,田怀凤,王立,等.基于多策略的专业领域术语抽取器的设计[J].计算机工程,2005,31(14):159-160.
[11] Missikoff M, Navigli R, Velardi P.Integrated Approach for Web Ontology Learning and Engineering[J].IEEE Computer,2002,35(11):60-63.
[12] Tsay Y T, Tsai W H.Attributed String Matching by Split-and-Merge for On-Line Chinese Character Recognition[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence,1993,15(2):180-185.
[13] 官莹莹.面向中文文本的本体学习方法研究[D]. 长春:吉林大学,2009.

[1] Sun Hongfei, Hou Wei. Application of Improved TFIDF Algorithm in Mining Potential Cooperation Relationship[J]. 现代图书情报技术, 2014, 30(10): 84-92.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn