利用词语共现进行Ontology的概念获取

doi:10.11925/infotech.1003-3513.2006.02.10

现代图书情报技术

2006, Vol. 1

Issue (2): 43-45 https://doi.org/10.11925/infotech.1003-3513.2006.02.10

信息检索技术

本期目录 | 过刊浏览 | 高级检索

利用词语共现进行Ontology的概念获取

耿骞耿崇

（北京师范大学管理学院北京 100875）

Concept Extraction in Automatic OntologyConstruction Using Words Cooccurrence

Geng Qian Geng Chong

(School of Management, Beijing Normal University, Beijing 100875, China)

摘要
参考文献
相关文章
Metrics

全文:
输出: BibTeX | EndNote (RIS)

摘要

作为大规模的语义知识资源库，Ontology在信息处理中具有重要的作用。但是，如何有效地构建Ontology却是一个重要的问题。对于自动构建Ontology的过程来说，首要的问题就是如何获取领域概念。本文尝试了一种利用词语共现获取领域概念的方法，用于支持领域Ontology的构建。该方法首先通过人工领域分析，获得起始领域概念，然后利用起始概念从语料库中抽取共现的概念，从而获取相关的概念知识。同时，本文以1998年1月份的人民日报语料库为语料，针对外交和体育两个领域，尝试从中提取相关的概念，从而检验利用词语共现获取领域概念的实际效果。

	服务

	把本文推荐给朋友
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	耿骞
	耿崇

关键词 ：词语共现, 概念提取, 本体构建

Abstract：

As a large semantic knowledge resource, Ontology plays an important role in information processing. However, how to construct an effective Ontology is an important problem to its application. The first issue in automatic Ontology creation is domain concepts acquisition. In this article we experiment on a method to obtain domain concepts which are based on lexical cooccurrence (and then to support the automatic Ontology construction). The first step of this method is to obtain the primary starting concepts by manual analysis, and then to extract relative co-occurrence concepts from the corpus. Based on the corpus of People’s Daily, January 1998, the article focuses especially on the fields of sports and diplomacy. We extract the relative concepts, to examine the practical results of co-occurrence-based domain concepts acquisition.

Key words： Lexical co-occurrence Concept extraction Ontology construction

收稿日期: 2005-10-27 出版日期: 2006-02-25

G250.7

通讯作者: 耿崇 E-mail: g123ch@163.com

作者简介: 耿骞,耿崇

引用本文:

耿骞,耿崇. 利用词语共现进行Ontology的概念获取[J]. 现代图书情报技术, 2006, 1(2): 43-45.
Geng Qian,Geng Chong. Concept Extraction in Automatic OntologyConstruction Using Words Cooccurrence. New Technology of Library and Information Service, 2006, 1(2): 43-45.

链接本文:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/10.11925/infotech.1003-3513.2006.02.10 或 https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/Y2006/V1/I2/43

1Brian Roark and Eugene Charniak, Noun-phrase co-occurrence statistics for semi-automatic semantic lexicon construction. In: Proceedings of ACL-98, Montreal, Quebec, Canada, 1998
2张敏, 面向自然语言检索的索引研究，北京师范大学硕士学位论文. 2004
3Gomez-Perez A., ManzanoMacho D. A Survey of Ontology Learning Methods and Techniques. Deliverable 15, OntoWeb Project, 2003
4耿骞，毛瑞.汉语自然语言检索中的词法分析处理.情报科学 2004.4
5Shamsfard M., Barforoush A. Learning ontologies from natural language text. International Journal of Human-Computer Studies 60 (1): 17-63, 2004
6Richardson, S.D., Dolan, W.B., Vanderwende, L. MindNet: Acquiring and Structuring Semantic Information from Text. Proceedings of the joint ACL and COLING conference, Montreal. 1998
7詹卫东. 面向自然语言处理的大规模语义知识库研究述要. 见：载徐波，孙茂松，靳光谨主编.中文信息处理若干重要问题. 北京：科学出版社，2003 .107~121

[1]	邓诗琦,洪亮. *面向智能应用的领域本体构建研究^——以反电话诈骗领域为例**[J]. 数据分析与知识发现, 2019, 3(7): 73-84.
[2]	陆佳莹,袁勤俭,黄奇,钱韵洁. 基于概念格理论的产品领域本体构建研究^*[J]. 现代图书情报技术, 2016, 32(5): 38-46.
[3]	颜时彦, 王胜清, 罗云川, 黄浩军. 云环境下基于FCA的领域本体协作构建模式初探[J]. 现代图书情报技术, 2014, 30(3): 49-56.
[4]	何金晶, 窦永香. 社会化标注系统中的本体研究综述[J]. 现代图书情报技术, 2013, (6): 16-22.
[5]	薛建武, 赵娜, 王东娜. 面向本体构建的叙词表词间关系细化和应用研究[J]. 现代图书情报技术, 2013, 29(3): 14-20.
[6]	李树青, 刘晓倩. 基于向心扩散加权XML模型的异构用户个性化模式匹配方法[J]. 现代图书情报技术, 2012, 28(5): 32-40.
[7]	刘萍, 胡月红. 基于FCA和关联规则的情报学本体构建[J]. 现代图书情报技术, 2012, 28(2): 34-40.
[8]	杨锐, 汤怡洁, 刘毅, 李崴. Web环境中的本体构建系统研究综析[J]. 现代图书情报技术, 2012, 28(1): 13-18.
[9]	滕广青, 毕强. 基于概念格的异构资源领域本体构建研究[J]. 现代图书情报技术, 2011, 27(5): 7-12.
[10]	张云中. 一种基于FCA和Folksonomy的本体构建方法[J]. 现代图书情报技术, 2011, 27(12): 15-23.
[11]	吴红李玉平胡泽文. 基于领域本体的专利信息检索系统研究与实现[J]. 现代图书情报技术, 2010, 26(6): 71-77.
[12]	夏立新,韩永青,张进. 基于本体的情报检索学科知识组织体系构建*[J]. 现代图书情报技术, 2008, 24(12): 80-85.
[13]	刘春艳,陈淑萍,伍玉成 . 基于SKOS的叙词表到本体的转换研究[J]. 现代图书情报技术, 2007, 2(5): 32-35.
[14]	聂卉,龙朝晖 . 描述逻辑语义推理机制的应用研究[J]. 现代图书情报技术, 2006, 1(11): 61-64.
[15]	李景,孟连生. 构建知识本体方法体系的比较研究[J]. 现代图书情报技术, 2004, 20(7): 17-22.

Viewed

Full text

Abstract

Cited

Shared

Discussed