Please wait a minute...
New Technology of Library and Information Service  2010, Vol. 26 Issue (9): 37-41    DOI: 10.11925/infotech.1003-3513.2010.09.07
article Current Issue | Archive | Adv Search |
Research of Term Semantic Hierarchy Induction for Domain-specific Chinese Text Information Processing
Ji Peipei1,2, Yan Xiaoyan1, Cen Yonghua3,4, Wang Lingyan1,2
1. The Chengdu Branch of National Science Library, Chinese Academy of Sciences, Chengdu 610041, China;
2. Graduate University of Chinese Academy of Sciences, Beijing 100049, China;
3. School of Economics and Management, Nanjing University of Science and Technology, Nanjing 210094, China;
4. Department of Information Management, Nanjing University, Nanjing 210093, China
Download:
Export: BibTeX | EndNote (RIS)      
Abstract  

Term semantic relationship is a key step of Chinese text information processing.Through researches on some existing methods at home and abroad,a process of term semantic hierarchy induction is proposed, which uses multiple clustering method to get the whole hierarchy,and combine with comprehensive similarity caculation to get the label of classes.Finally,some experiments are done to verify its rationality.

Key wordsTerm      semantic      hierarchy      Domain-specific      text      information      processing      Term      relationship     
Received: 31 May 2010      Published: 26 October 2010
: 

TP391

 

Cite this article:

Ji Peipei, Yan Xiaoyan, Cen Yonghua, Wang Lingyan. Research of Term Semantic Hierarchy Induction for Domain-specific Chinese Text Information Processing. New Technology of Library and Information Service, 2010, 26(9): 37-41.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2010.09.07     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2010/V26/I9/37


[1] Amsler R.A Taxonomy for English Nouns and Verbs . In:Proceedings of the 19th Annual Meeting on Association for Computational Linguistics,New York,America.1981:133-138.

[2] Calzolari N. Detecting Patterns in a Lexical Data Base .In:Proceedings of the 10th International Conference on Computational Linguistics,New York,America.1984:170-173.

[3] Hearst M. Automatic Acquisition of Hyponyms from Large Text Corpora .In:Proceedings of the 14th Conference on Computational Linguistics,New York,America.1992:539-545.

[4] Maedche A, Staab S. Discovering Conceptual Relations from Text . In:Proceedings of ECAI- 2000, Amsterdam, Holland.2000:321-325.

[5] Fotzo H N, Gallinari P. Learning Generation/Specialization Relations Between Concepts-Application for Automatically Building Thematic Document Hierarchies .In:Proceedings of RIAO, Paris, France.2004:322-335.

[6] Caraballo S A. Automatic Construction of a Hypernym-labeled Noun Hierarchy from Text . In: Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics on Computational Linguistics, Maryland, America.1999:120-126.

[7] Fisher D H. Knowledge Acquisition via Incremental Conceptual Clustering
[J].Machine Learning,1987,2(2):139-172.

[8] Cimiano P, Staab S, Tane J. Automatic Acquisition of Taxonomies from Text:FCA Meets NLP . In: Proceedings of International Workshop on Adaptive Text Extraction and Mining,Seattle,America. 2003:301-309.

[9] Obitko M, Snasel V, Smid J. Ontology Design with Formal Concept Analysis .In:Proceedings of CLA 2004, Ostrava, Czech.2004:111-119.

[10] Buitelaar P, Olejnik D, Sintek M. A Protégé Plug- In for Ontology Extraction from Text Based on Linguistic Analysis .In: Proceedings of the 1st European Semantic Web Symposium,Heraklion,Greece.2004:205-219.

[11] Velardi P, Fabriani P, Missikoff M. Using Text Processing Techniques to Automatically Enrich a Domain Ontology .In: Proceedings of International Conference on Formal Ontology in Information Systems,Ogunquit, America.2001:270-284.

[12] 裴炳镇, 陈晓明, 胡熠, 等. 一种建立中文概念分类关系的新算法
[J]. 计算机工程与应用 , 2004, 40(36):18-21.

[13] 何婷婷, 张小鹏. 特定领域本体自动构造方法
[J]. 计算机工程 , 2007,33(22):235-237.

[14] 温春, 石昭祥, 张亮. 中文领域本体概念层次获取方法对比研究
[J]. 计算机应用研究 , 2009,26(8):2847-2850,2884.

[15] 温春, 石昭祥, 杨国正. 一种利用度属性获取本体概念层次的方法
[J]. 小型微型计算机系统 , 2010,31(2):322-326.

[16] 万韬, 徐德智. 一种基于文档聚类的本体学习方法
[J]. 计算机与信息技术 , 2009(12):50-53.

[17] 龚静, 李安民. 一种改进的K-means中文文本聚类算法
[J]. 湖南工业大学学报 , 2008,22(2): 52-54.

[18] 孙秀娟. 基于遗传算法的K-means聚类算法分析研究 . 济南:山东师范大学, 2009.

[1] Zhou Zeyu,Wang Hao,Zhao Zibo,Li Yueyan,Zhang Xiaoqin. Construction and Application of GCN Model for Text Classification with Associated Information[J]. 数据分析与知识发现, 2021, 5(9): 31-41.
[2] Wang Yifan,Li Bo,Shi Hua,Miao Wei,Jiang Bin. Annotation Method for Extracting Entity Relationship from Ancient Chinese Works[J]. 数据分析与知识发现, 2021, 5(9): 63-74.
[3] Chen Jie,Ma Jing,Li Xiaofeng. Short-Text Classification Method with Text Features from Pre-trained Models[J]. 数据分析与知识发现, 2021, 5(9): 21-30.
[4] Wang Ruolin, Niu Zhendong, Lin Qika, Zhu Yifan, Qiu Ping, Lu Hao, Liu Donglei. Disambiguating Author Names with Embedding Heterogeneous Information and Attentive RNN Clustering Parameters[J]. 数据分析与知识发现, 2021, 5(8): 13-24.
[5] Tan Ying, Tang Yifei. Extracting Citation Contents with Coreference Resolution[J]. 数据分析与知识发现, 2021, 5(8): 25-33.
[6] Jiang Yaren, Le Xiaoqiu. Continual Learning for One-to-many Entity Relationship Generation with Small Samples[J]. 数据分析与知识发现, 2021, 5(8): 45-53.
[7] Wang Qinjie, Qin Chunxiu, Ma Xubu, Liu Huailiang, Xu Cunzhen. Recommending Scientific Literature Based on Author Preference and Heterogeneous Information Network[J]. 数据分析与知识发现, 2021, 5(8): 54-64.
[8] Li Wenna, Zhang Zhixiong. Entity Alignment Method for Different Knowledge Repositories with Joint Semantic Representation[J]. 数据分析与知识发现, 2021, 5(7): 1-9.
[9] Zhang Le, Leng Jidong, Lv Xueqiang, Cui Zhuo, Wang Lei, You Xindong. RLCPAR: A Rewriting Model for Chinese Patent Abstracts Based on Reinforcement Learning[J]. 数据分析与知识发现, 2021, 5(7): 59-69.
[10] Huang Mingxuan,Jiang Caoqing,Lu Shoudong. Expanding Queries Based on Word Embedding and Expansion Terms[J]. 数据分析与知识发现, 2021, 5(6): 115-125.
[11] Cao Rui,Liao Bin,Li Min,Sun Ruina. Predicting Prices and Analyzing Features of Online Short-Term Rentals Based on XGBoost[J]. 数据分析与知识发现, 2021, 5(6): 51-65.
[12] Ma Yingxue,Zhao Jichang. Patterns and Evolution of Public Opinion on Weibo During Natural Disasters: Case Study of Typhoons and Rainstorms[J]. 数据分析与知识发现, 2021, 5(6): 66-79.
[13] Yu Bengong,Zhu Xiaojie,Zhang Ziwei. A Capsule Network Model for Text Classification with Multi-level Feature Extraction[J]. 数据分析与知识发现, 2021, 5(6): 93-102.
[14] Xie Hao,Mao Jin,Li Gang. Sentiment Classification of Image-Text Information with Multi-Layer Semantic Fusion[J]. 数据分析与知识发现, 2021, 5(6): 103-114.
[15] Meng Zhen,Wang Hao,Yu Wei,Deng Sanhong,Zhang Baolong. Vocal Music Classification Based on Multi-category Feature Fusion[J]. 数据分析与知识发现, 2021, 5(5): 59-70.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn