Please wait a minute...
Advanced Search
现代图书情报技术  2014, Vol. 30 Issue (4): 7-13    DOI: 10.11925/infotech.1003-3513.2014.04.02
  数字图书馆 本期目录 | 过刊浏览 | 高级检索 |
机构知识库语义知识获取方法分析及实验研究
王思丽, 祝忠明, 姚晓娜
中国科学院国家科学图书馆兰州分馆 兰州 730000
Analysis and Experimental Research on Method of Semantic Knowledge Acquisition for Institutional Repository
Wang Sili, Zhu Zhongming, Yao Xiaona
The Lanzhou Branch of National Science Library, Chinese Academy of Sciences, Lanzhou 730000, China
全文: PDF(908 KB)   HTML  
输出: BibTeX | EndNote (RIS)      
摘要 

[目的] 通过分析总结和实验研究,提出并形成一种有效的语义知识获取方法,为实现机构知识库的语义化提供理论基础和可行技术路线。[方法] 对国内外的语义知识获取方法进行对比分析,提出机构知识库语义知识获取的体系框架,并总结和深度解析其关键技术;同时,以中国科学院机构知识库平台为例进行实验研究。[结果] 该方法可有效地从机构知识库底层的关系数据库的数据和实体关系结构中自动获取语义知识信息并转化为RDF三元组形式进行浏览和查询。[局限] 定义一个合理有效的语义映射规则,需要经过领域专家评估、较多的人工干预以及反复实验才能确定;不同机构知识库间同一实体对象的语义知识获取关联没有涉及。[结论] 有利于帮助后续研究人员和机构知识库开发人员更好地了解和掌握机构知识库语义知识获取的方法和关键技术,从而为提升机构知识库的服务能力奠定基础。

服务
把本文推荐给朋友
加入引用管理器
E-mail Alert
RSS
作者相关文章
王思丽
祝忠明
姚晓娜
关键词 机构知识库语义映射知识获取ER模式    
Abstract

[Objective] The paper proposes and forms an effective method of semantic knowledge acquisition through analysis, summary and experiment, in order to provide theoretical principle and possible technological route for the semantization of Institutional Repository. [Methods] Based on the contrastive analysis of methods of semantic knowledge acquisition both at home and abroad, the paper proposes a system framework of semantic knowledge acquisition for Institutional Repository, and sums up its key technologies for deep analysis and then takes the CAS IR GRID for an experimental study. [Results] This method can automatically and effectively acquire semantic knowledge information from data and entity relationship structure of relational database of underlying Institutional Repository and convert it into RDF triples for browse and search. [Limitations] To define a reasonable and effective mapping rule may need domain expert evaluation, more manual intervention and repeated experiments. The semantic knowledge acquisition and relevance study for the same entity object between different Institutional Repository is not involved in this paper. [Conclusions] This study may better help follow-up researchers and developers quickly understand and master the method and key technologies of semantic knowledge acquisition, then lay the foundations for enhancing knowledge service capabilities of Institutional Repository.

Key wordsInstitutional Repository    Semantic mapping    Knowledge acquisition    ER mode
收稿日期: 2013-11-13     
:  G250  
基金资助:

本文系中国科学院国家科学图书馆兰州分馆业务领域前瞻项目“知识资源语义化组识、技术集成与开放服务的趋势扫描”(项目编号:1500013004)的研究成果之一

通讯作者: 王思丽 E-mail:wangsl@llas.ac.cn     E-mail: wangsl@llas.ac.cn
作者简介: 作者贡献声明:王思丽,祝忠明:提出研究思路,设计研究方案; 王思丽,姚晓娜:进行实验;从IRGRID中采集、抽取和分析数据; 王思丽:论文起草和最终版本修订。
引用本文:   
王思丽, 祝忠明, 姚晓娜. 机构知识库语义知识获取方法分析及实验研究[J]. 现代图书情报技术, 2014, 30(4): 7-13.
Wang Sili, Zhu Zhongming, Yao Xiaona. Analysis and Experimental Research on Method of Semantic Knowledge Acquisition for Institutional Repository. New Technology of Library and Information Service, DOI:10.11925/infotech.1003-3513.2014.04.02.
链接本文:  
http://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/10.11925/infotech.1003-3513.2014.04.02

[1] 董金祥.基于语义面向服务的知识管理与处理[M].杭州:浙江大学出版社,2009:111-115.(Dong Jinxiang.Semantics-Based Services-Oriented Knowledge Management and Proce­ssing[M].Hangzhou:Zhejiang University Press,2009:111-115.)
[2] 候筱婷.基于数据仓库、OLAP和数据挖掘技术的数据分析、展现与预测[D].西安:西安电子科技大学,2007.(Hou Xiaoting.Data Analysis,Exhibition and Prediction Based on Data Warehouse,OLAP and Data Mining Technologies[D].Xi'an:Xidian University,2007.)
[3] Hammer J,McHugh J,Garcia-Molina H.Semistructured Data:the TSIMMIS Experience[C].In:Proceedings of the 1st East-European Workshop on Advances in Database and Information Systems(ADBI'97).UK:British Computer Society Swinton,1997:22-30.
[4] Soderland S.Learning Information Extraction Rules for Semi-Structured and Free Text[J].Machine Learning,1999,34(1-3):233-272.
[5] Etzioni O,Cafarella M,Downey D,et al.Unsupervised Named-Entity Extraction from the Web:An Experimental Study[J].Artificial Intelligence,2005,165(1):91-134.
[6] Ashraf F,Alhajj R.CluxTex:Information Extraction from HTML Pages[C].In:Proceedings of the IEEE 21st International Conference on Advanced Information Networking and Applications Workshops.Niagara Falls:IEEE,2007:355-360.
[7] Cheng C K,Pan X S,Kurfess F.Ontology-based Semantic Classification of Unstructured Documents[C].In:Proceedings of the 1st International Workshop on AMR 2003.2004:120-131.
[8] Khasawneh N,Chan C C.Active User-based and Ontology-based Web Log Data Preprocessing for Web Usage Mining[C].In:Proceeding of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence.Hong Kong,China:IEEE,2006:325-328.
[9] Volz R,Handschuh S,Staab S,et al.Unveiling the Hidden Bride:Deep Annotation for Mapping and Migrating Legracy Data to the Semantic Web[J].Journal of Web Semantics,2004,11(1):187-206.
[10] Astroval I.Reverse Engineering of Relational Databases to Ontologies[C].In:Proceedings of the 1st European Semantic Web Symposium.Berlin:Springer,2004:327-341.
[11] Xu Z M,Zhang S C,Dong Y S.Mapping between Relational Database Schema and OWL Ontology for Deep Annotation[C].In:Proceeding of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence.Hong Kong,China:IEEE,2006:548-552.
[12] Bizer C,Cyganiak R.D2R Server-publishing Relational Databases on the Semantic Web[C].In:Proceedings of the 5th International Semantic Web Conference.2006.
[13] R2RML:RDB to RDF Mapping Language[EB/OL].(2012-09-27).[2013-07-10].http://www.w3.org/TR/r2rml/.
[14] 中国科学院机构知识库服务网格[EB/OL].[2013-08-01].http://www.irgrid.ac.cn/.(CAS IR GRID[EB/OL].[2013-08-01].http://www.irgrid.ac.cn/.)
[15] DataMaster[EB/OL].[2013-06-20].http://eulergui.sourceforge.net/documentation.html.

[1] 张旺强,祝忠明,李雅梅,卢利农,刘巍. 机构知识库作者名自动消歧框架设计与实践*[J]. 数据分析与知识发现, 2019, 3(6): 92-98.
[2] 吴志强,祝忠明,刘巍,王思丽. CSpace知识分析与可视化功能扩展研究与实践*[J]. 数据分析与知识发现, 2019, 3(3): 112-119.
[3] 李静,刘潇,王效俐. 邻域粗糙集融合网格搜索组合分类器的理财决策知识获取研究*[J]. 数据分析与知识发现, 2019, 3(1): 85-94.
[4] 羊柳,傅柱,王曰芬. 概念设计中的设计过程知识获取研究*[J]. 数据分析与知识发现, 2018, 2(2): 29-36.
[5] 吴志强,祝忠明,姚晓娜,王思丽. CSpace机构知识库影音资源支持能力扩展研究与实践*[J]. 数据分析与知识发现, 2017, 1(9): 90-96.
[6] 王思丽,刘巍,祝忠明,吴志强,王金平. 基于CSpace的科技信息可配置化自动监测功能设计与实现*[J]. 数据分析与知识发现, 2017, 1(10): 85-93.
[7] 吴志强,祝忠明,刘巍,张旺强,姚晓娜. 机构知识库三维模型检索与展示技术研究与实践*[J]. 数据分析与知识发现, 2017, 1(1): 73-80.
[8] 张旺强,祝忠明,姚晓娜,刘巍. 基于开放获取论文推送转发服务系统iSwitch的机构知识库内容建设*[J]. 现代图书情报技术, 2016, 32(4): 91-96.
[9] 钱力, 师洪波, 张晓林, 梁娜. 开放获取论文推送转发服务系统iSwitch: 论文分发推送[J]. 现代图书情报技术, 2015, 31(6): 7-12.
[10] 严潮斌, 陈嘉勇, 侯瑞芳, 李玲, 周婕. 查收查引服务支撑需求驱动下的高校机构知识库建设[J]. 现代图书情报技术, 2015, 31(5): 94-100.
[11] 白海燕. ORCID在机构知识库中的整合介绍[J]. 现代图书情报技术, 2015, 31(3): 8-17.
[12] 谷威, 李超凡, 王洪俊, 肖诗斌, 施水才. 专利检索日志的同义词获取[J]. 现代图书情报技术, 2015, 31(2): 24-30.
[13] 赵瑞雪, 杜若鹏. 中国农业科学院机构知识库的实践探索[J]. 现代图书情报技术, 2015, 31(2): 72-77.
[14] 张晓丹, 乔晓东, 顾立平, 姚长青, 初景利. 中国学术期刊对机构知识库存缴政策调查分析[J]. 现代图书情报技术, 2014, 30(6): 1-7.
[15] 姚晓霞, 聂华, 顾立平, 张冬荣, 吴越, 韦成府. 我国教育科研机构知识库建设现状调查与分析[J]. 现代图书情报技术, 2014, 30(5): 1-9.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
版权所有 © 2015 《数据分析与知识发现》编辑部
地址:北京市海淀区中关村北四环西路33号 邮编:100190
电话/传真:(010)82626611-6626,82624938
E-mail:jishu@mail.las.ac.cn