基于聚类算法的本体层次关系获取研究

doi:10.11925/infotech.1003-3513.2011.12.07

现代图书情报技术

2011, Vol. 27

Issue (12): 46-51 https://doi.org/10.11925/infotech.1003-3513.2011.12.07

知识组织与知识管理

本期目录 | 过刊浏览 | 高级检索

基于聚类算法的本体层次关系获取研究

谷俊^1,2, 朱紫阳³

1. 南京大学信息管理系南京 210093;
2. 上海宝山钢铁股份有限公司上海 201900;
3. 南京信息工程大学图书馆南京 210044

Study on Ontology Hierarchy Relation Induction on Clustering Algorithm

Gu Jun^1,2, Zhu Ziyang³

1. Department of Information Management, Nanjing University, Nanjing 210093, China;
2. Baoshan Iron and Steel Company Ltd., Shanghai 201900, China;
3. Library of Nanjing University of Information Science and Technology, Nanjing 210044, China

摘要
参考文献
相关文章
Metrics

全文: PDF (533 KB) HTML
输出: BibTeX | EndNote (RIS)

摘要提出利用蚁群聚类方法进行初始聚类,通过K-means聚类算法对初始聚类的结果进一步分层聚类,并结合术语综合相似度计算的方式提取每个类的标签,从而完成术语层次关系的构建。最后抽取部分实验结果,由领域专家对其进行评价,并对结果进行分析。

	服务

	把本文推荐给朋友
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	谷俊
	朱紫阳

关键词 ：本体, 语义层次, 蚁群算法, 聚类

Abstract：This paper proposes a method,which clusters the initial terms collection by ant colony algorithm and clusters the results hierarchy by K-means algorithm, then gets the labels of classes using the comprehensive similarity calculation, finishes the term hierarchy relation’s structure at last. Parts of experimental results are appraised and analyzed by domain experts.

Key words： Ontology Semantic hierarchy Ant colony algorithm Clustering

收稿日期: 2011-10-20 出版日期: 2012-02-02

TP391

基金资助:

本文系国家社会科学基金项目“面向语义网本体的知识管理研究”(项目编号:09CTQ010) 的研究成果之一。

引用本文:

谷俊, 朱紫阳. 基于聚类算法的本体层次关系获取研究[J]. 现代图书情报技术, 2011, 27(12): 46-51.
Gu Jun, Zhu Ziyang. Study on Ontology Hierarchy Relation Induction on Clustering Algorithm. New Technology of Library and Information Service, 2011, 27(12): 46-51.

链接本文:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/10.11925/infotech.1003-3513.2011.12.07 或 https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/Y2011/V27/I12/46

[1] Berners-Lee T, Hendler J,Lassila O. The Semantic Web[J]. Scientific American, 2001 (5): 28-37.

[2] Ying D, Schubea F. Ontology Research and Development:Part I-A Review of Ontology Generation [J]. Journal of Information Science, 2002, 28(2):123-136.

[3] Harris Z S. Mathematical Structures of Language[M]. New York:Wiley, 1968.

[4] Carbalb S A. Automatic Construction of a Hypemym-labeled Noun Hierarchy from Text[C].In:Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics, Maryland.1999:120-126.

[5] Fisher D H. Knowledge Acquisition via Incremental Conceptual Clustering[J]. Machine Learning,1987,2(2):139-172.

[6] Cimiano P, Staab S, Tane J. Automatic Acquisition of Taxonomies from Text FCA Meets NLP[C]. In:Proceedings of the International Workshop on Adaptive Text Extraction and Mining, Seattle,USA.2003:301-309.

[7] 马辉民, 李卫华, 吴良元. VSM在中文文本聚类中的应用及实证分析[J]. 武汉理工大学学报:信息与管理工程版,2006, 28(4): 56-59.

[8] 乐兵,王明文. 基于遗传算法的动态文本聚类[J]. 江西师范大学学报:自然科学版,2006, 30(3): 278-281.

[9] 龚静, 李安民. 一种改进的K-means中文文本聚类算法[J]. 湖南工业大学学报,2008,22(2): 52-54.

[10] 王刚,钟国祥. 一种基于本体相似度计算的文本聚类算法研究[J]. 计算机科学,2010, 37(9): 222-224.

[11] 温春,石昭祥,杨国正. 一种利用度属性获取本体概念层次的方法[J]. 小型微型计算机系统, 2010(2): 322-326.

[12] 季培培,鄢小燕,岑咏华,等. 面向领域中文文本信息处理的术语语义层次获取研究[J]. 现代图书情报技术,2010(9): 37-41.

[13] 余永红,柏文阳. 基于特征项权重自动分解的文本聚类[J]. 计算机工程,2011, 37(11): 25-27.

[14] 谷俊,王昊.基于领域中文文本的术语抽取方法研究[J]. 现代图书情报技术,2011(4): 29-34.

[15] TF-IDF[EB/OL].[2011-11-13]. http://zh.wikipedia.org/zh-cn/TF-IDF.

[16] Cosine Similarity[EB/OL].[2011-11-13]. http://en.wikipedia.org/wiki/Cosine_similarity.

[17] Dorigo M, Blum C. Ant Colony Optimization Theory:A Survey[J]. Theoretical Computer Science, 2005,344(2-3):243-278.

[18] Deneubourg J L, Goss S, Franks N, et al. The Dynamics of Collective Sorting Robot-like Ants and Ant-like Robots[C]. In:Proceedings of the International Conference on Simulation of Adaptive Behavior: From Animals to Animates.1991:356-363.

[19] 段海滨. 蚁群算法原理及其应用[M].北京:科学出版社, 2005.

[20] Lumer E, Faiea B. Diversity and Adaption in Populations of Clustering Ants[C]. In:Proceedings of the 3rd International Conference on Simulation of Adaptive Behavior. Cambridge, MA: MIT Press,1994:501-508.

[21] Alan L P, Scott W C. Tech Mining[M]. New Jersey: John Wiley & Sons, Inc., 2005.

[1]	王若琳, 牛振东, 蔺奇卡, 朱一凡, 邱萍, 陆浩, 刘东磊. 基于异质信息嵌入与RNN聚类参数预测的作者姓名消歧方法^*[J]. 数据分析与知识发现, 2021, 5(8): 13-24.
[2]	王晰巍,贾若男,韦雅楠,张柳. 多维度社交网络舆情用户群体聚类分析方法研究^*[J]. 数据分析与知识发现, 2021, 5(6): 25-35.
[3]	卢利农,祝忠明,张旺强,王小春. 基于Lingo3G聚类算法的机构知识库跨库知识整合与知识指纹服务实现[J]. 数据分析与知识发现, 2021, 5(5): 127-132.
[4]	张梦瑶, 朱广丽, 张顺香, 张标. 基于情感分析的微博热点话题用户群体划分模型 ^*[J]. 数据分析与知识发现, 2021, 5(2): 43-49.
[5]	盛姝, 黄奇, 杨洋, 解绮雯, 秦新国. HL7 FHIR框架下中国医疗领域信息交换研究与解决方案[J]. 数据分析与知识发现, 2021, 5(11): 13-28.
[6]	丁浩, 艾文华, 胡广伟, 李树青, 索炜. 融合用户兴趣波动时序的个性化推荐模型^*[J]. 数据分析与知识发现, 2021, 5(11): 45-58.
[7]	杨辰, 陈晓虹, 王楚涵, 刘婷婷. 基于用户细粒度属性偏好聚类的推荐策略^*[J]. 数据分析与知识发现, 2021, 5(10): 94-102.
[8]	于丰畅,程齐凯,陆伟. 基于几何对象聚类的学术文献图表定位研究[J]. 数据分析与知识发现, 2021, 5(1): 140-149.
[9]	温萍梅,叶志炜,丁文健,刘颖,徐健. 命名实体消歧研究进展综述^*[J]. 数据分析与知识发现, 2020, 4(9): 15-25.
[10]	曾桢,李纲,毛进,陈璟浩. 区域公共安全数据治理与业务领域本体研究^*[J]. 数据分析与知识发现, 2020, 4(9): 41-55.
[11]	邬金鸣,侯跃芳,崔雷. 基于医学主题词标引规则的词共现聚类分析结果自动判读和表达的研究[J]. 数据分析与知识发现, 2020, 4(9): 133-144.
[12]	席运江, 杜蝶蝶, 廖晓, 仉学红. 基于超网络的企业微博用户聚类研究及特征分析*[J]. 数据分析与知识发现, 2020, 4(8): 107-118.
[13]	杨旭,钱晓东. 基于改进的Vicsek模型的社会网络同步聚类算法*[J]. 数据分析与知识发现, 2020, 4(4): 119-128.
[14]	熊回香,李晓敏,李跃艳. 基于图书评论属性挖掘的群组推荐研究*[J]. 数据分析与知识发现, 2020, 4(2/3): 214-222.
[15]	魏家泽,董诚,何彦青,刘志辉,彭柯芸. 基于均衡段落和分话题向量的新闻热点话题检测研究^*[J]. 数据分析与知识发现, 2020, 4(10): 70-79.

Viewed

Full text

Abstract

Cited

Shared

Discussed