As a large semantic knowledge resource, Ontology plays an important role in information processing. However, how to construct an effective Ontology is an important problem to its application. The first issue in automatic Ontology creation is domain concepts acquisition. In this article we experiment on a method to obtain domain concepts which are based on lexical cooccurrence (and then to support the automatic Ontology construction). The first step of this method is to obtain the primary starting concepts by manual analysis, and then to extract relative co-occurrence concepts from the corpus. Based on the corpus of People’s Daily, January 1998, the article focuses especially on the fields of sports and diplomacy. We extract the relative concepts, to examine the practical results of co-occurrence-based domain concepts acquisition.
耿骞,耿崇. 利用词语共现进行Ontology的概念获取[J]. 现代图书情报技术, 2006, 1(2): 43-45.
Geng Qian,Geng Chong. Concept Extraction in Automatic OntologyConstruction Using Words Cooccurrence. New Technology of Library and Information Service, 2006, 1(2): 43-45.
1Brian Roark and Eugene Charniak, Noun-phrase co-occurrence statistics for semi-automatic semantic lexicon construction. In: Proceedings of ACL-98, Montreal, Quebec, Canada, 1998
2张敏, 面向自然语言检索的索引研究，北京师范大学硕士学位论文. 2004
3Gomez-Perez A., ManzanoMacho D. A Survey of Ontology Learning Methods and Techniques. Deliverable 15, OntoWeb Project, 2003
5Shamsfard M., Barforoush A. Learning ontologies from natural language text. International Journal of Human-Computer Studies 60 (1): 17-63, 2004
6Richardson, S.D., Dolan, W.B., Vanderwende, L. MindNet: Acquiring and Structuring Semantic Information from Text. Proceedings of the joint ACL and COLING conference, Montreal. 1998
7詹卫东. 面向自然语言处理的大规模语义知识库研究述要. 见：载徐波，孙茂松，靳光谨主编.中文信息处理若干重要问题. 北京：科学出版社，2003 .107~121