New Technology of Library and Information Service  2011, Vol. 27 Issue (4): 29-34    DOI: 10.11925/infotech.1003-3513.2011.04.05
Study on Term Extraction on the Basis of Chinese Domain Texts
Gu Jun1,2, Wang Hao1
1. Department of Information Management, Nanjing University, Nanjing 210093,China;
2. Baoshan Iron and Steel Company Ltd., Shanghai 201900,China
Abstract  Based on the ICTCLAS dictionary segmentation, this paper proposes a method that extracts relevant concept terminology from the Chinese patent texts by maximum matching and frequency statistics, then computes the weights of the items by TF-IDF and gets the final concept terminology. Finally, it analyzes the results with the sample data extraction experiments.
Key wordsOntology      Concept extraction      Maximum matching and frequency statistics      TF-IDF      Chinese word segmentation     
Received: 10 February 2011      Published: 11 June 2011



Gu Jun, Wang Hao. Study on Term Extraction on the Basis of Chinese Domain Texts. New Technology of Library and Information Service, 2011, 27(4): 29-34.

