Please wait a minute...
Advanced Search
现代图书情报技术  2013, Vol. 29 Issue (9): 15-22     https://doi.org/10.11925/infotech.1003-3513.2013.09.03
  数字图书馆 本期目录 | 过刊浏览 | 高级检索 |
基于MapReduce的书目数据关联匹配研究
虞为1, 陈俊鹏2
1. 南京大学信息管理学院 南京 210093;
2. 南京财经大学信息工程学院 南京 210023
Linking and Mapping of Library Catalogue Data Based on MapReduce
Yu Wei1, Chen Junpeng2
1. School of Information Management, Nanjing University, Nanjing 210093, China;
2. School of Information Engineering, Nanjing University of Finance and Economics, Nanjing 210023, China
全文: PDF (1137 KB)   HTML  
输出: BibTeX | EndNote (RIS)      
摘要 提出一个基于MapReduce的书目数据关联匹配架构,通过参引MODS本体将MARC格式的书目数据转换成关联数据格式。再通过对书目数据和书目数据间的关联匹配,以及书目数据和开放关联社区其他的关联数据间的匹配初步实现书目数据和其他关联数据集间的语义关联,使关联的书目数据成为关联开放数据社区中的一部分,为图书馆的知识发现和语义检索服务提供有效的语义数据支持。
服务
把本文推荐给朋友
加入引用管理器
E-mail Alert
RSS
作者相关文章
陈俊鹏
虞为
关键词 MapReduce关联匹配书目数据关联数据    
Abstract:In this paper, the MARC data is transformed to linked data, based on MapReduce model and MODS Onto-logy. Through the mapping among different linked open data sets, the library catalogue data can become part of the linked open data community and provide efficient semantic data to knowledge discovery and semantic service.
Key wordsMapReduce    Mapping and linkage    Catalogue data    Linked data
收稿日期: 2013-06-06      出版日期: 2013-09-27
:  G254 TP18  
基金资助:本文系国家自然科学基金项目“面向知识服务的知识组织模式与应用研究”(项目编号:71273126)和国家社会科学基金项目“基于关联数据的图书馆语义云服务研究”(项目编号:12CTQ009)的研究成果之一。
通讯作者: 虞为     E-mail: yuw.nju@gmail.com
引用本文:   
虞为, 陈俊鹏. 基于MapReduce的书目数据关联匹配研究[J]. 现代图书情报技术, 2013, 29(9): 15-22.
Yu Wei, Chen Junpeng. Linking and Mapping of Library Catalogue Data Based on MapReduce. New Technology of Library and Information Service, 2013, 29(9): 15-22.
链接本文:  
https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/10.11925/infotech.1003-3513.2013.09.03      或      https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/Y2013/V29/I9/15
[1] Heath T, Bizer C. Linked Data: Evolving the Web into a Global Data Space [M]. The 1st Edition.Morgan & Claypool Publishers, 2011.
[2] Bizer C, Heath T, Idehen K, et al. Linked Data on the Web [C]. In: Proceedings of WWW2008, Beijing, China. 2008: 1265-1266.
[3] The GeoNames Geographical Database[EB/OL].[2013-07-12]. http://www.geonames.org/.
[4] Samwald M, Jentzsch A, Bouton C, et al. Linked Open Drug Data for Pharmaceutical Research and Development [J]. Journal of Cheminformatics, 2011, 3(1): 19.
[5] Bizer C, Lehmann J, Kobilarov G, et al. DBpedia - A Crystallization Point for the Web of Data [J]. Web Semantics: Science, Services and Agents on the World Wide Web, 2009,7 (3): 154-165.
[6] Linking Open Government Data [EB/OL].[2013-07-12]. http://logd.tw.rpi.edu/.
[7] Malmsten M. Making a Library Catalogue Part of Semantic Web [C]. In: Proceedings of the 2008 International Conference on Dublin Core and Metadata Applications.2008: 146-152.
[8] Summers E, Isaac A, Redding C, et al. LCSH, SKOS and Linked Data [C]. In: Proceedings of the 2008 International Conference on Dublin Core and Metadata Applications.2008: 25-33.
[9] OCLC- WorldCat [EB/OL]. [2013-05-20]. http://www.worldcat.org/.
[10] 黄华军,曾新红,林伟明. OTCSS关联数据服务的研究与实现[J]. 现代图书情报技术,2012 (7/8): 40- 47.(Huang Huajun, Zeng Xinhong, Lin Weiming. Research and Implementation about Linked Data Service of OTCSS[J]. New Technology of Library and Information Service, 2012 (7/8):40-47.)
[11] 欧石燕.面向关联数据的语义数字图书馆资源描述与组织框架设计与实现[J]. 中国图书馆学报,2012, 38(6):58-71. (Ou Shiyan. Design and Implementation of a Linked Data-oriented Framework for Resource Description and Organization in Semantic Digital Libraries [J]. Journal of Library Science in China, 2012, 38 (6):58-71.)
[12] 夏翠娟,刘炜,赵亮,等. 关联数据发布技术及其实现——以Drupal为例[J]. 中国图书馆学报,2012, 38(1):49-57. (Xia Cuijuan,Liu Wei,Zhao Liang, et al. The Current Technologies and Tools for Linked Data: A Case of Drupal [J]. Journal of Library Science in China, 2012, 38 (1):49-57.)
[13] 白海燕,朱礼军.关联数据的自动关联构建研究[J]. 现代图书情报技术,2010(2): 44-49. (Bai Haiyan, Zhu Lijun. Research on Automatic Interlinking of Linked Data[J]. New Technology of Library and Information Service, 2010(2): 44-49.)
[14] DBLP [EB/OL]. [2013-05-20]. http://dblp.uni-trier.de/.
[15] Moller K, Heath T, Handschuh S,et al. Recipes for Semantic Web Dog Food: The ESWC and ISWC Metadata Projects [C]. In: Proceedings of the 6th International Semantic Web Conference and the 2nd Asian Semantic Web Conference. Berlin, Heidelberg: Springer-Verlag,2007: 802-815.
[16] The Linking Open Data Cloud Diagram[EB/OL]. [2013-05-20]. http://lod-cloud.net/.
[17] DuraCloud [EB/OL]. [2013-05-20]. http://www.duracloud.org/.
[18] Dean J, Ghemawat S. MapReduce: Simplified Data Processing on Large Clusters [C]. In: Proceedings of the 6th Symposium on Operating Systems Design and Implementation, 2004.
[19] Bizer C. The Emerging Web of Linked Data [J]. IEEE Intelligent Systems, 2009, 24(5):87-92.
[20] Oren E, Delbru R, Catasta M, et al. Sindice.com: A Document-oriented Lookup Index for Open Linked Data [J]. International Journal of Metadata, Semantics and Ontologies, 2008, 3(1): 37-52.
[21] Cheng G, Ge W, Qu Y. Falcons: Searching and Browsing Entities on the Semantic Web [C]. In: Proceedings of the 17th International Conference on World Wide Web (WWW’08).New York: ACM, 2008: 1101-1102.
[22] MarOnt Ontology [EB/OL]. [2013-05-20].http://deri.semanticweb.org/content/marcont-ontology.
[23] The FRBR Blog. Bibliographic Ontology Specification 1.0[EB/OL]. (2008-06-06). [2013-05-20]. http://www.frbr.org/2008/06/06/bibliographic-ontology-specification-10.
[24] 白海燕,乔晓东. 基于本体和关联数据的书目组织语义化研究[J]. 现代图书情报技术,2010(9): 18-27. (Bai Haiyan, Qiao Xiaodong. Study of Semantic Bibliography base on Ontology and Linked Data[J]. New Technology of Library and Information Service, 2010 (9): 18-27.)
[25] 王军,程煜华. 基于传统知识组织资源的本体自动构建[J]. 情报学报, 2009, 28(5): 651-657. (Wang Jun, Cheng Yuhua. An Automatic Approach to Ontology Building by Integrating Traditional Knowledge Organization Resources[J]. Journal of the China Society for Scientific and Technical Information, 2009, 28(5): 651-657.)
[26] MOD Schema [EB/OL]. [2013-05-20]. http://www.loc.gov/standards/mods/.
[27] SIMILE Widgets [EB/OL]. [2013-05-20]. http://simile-widgets.org.
[28] RDF Ontology for MODS V3.1[EB/OL]. [2013-05-20]. http://simile.mit.edu/2006/01/ontologies/mods3.
[29] Project Gutenberg Australia [EB/OL]. [2013-05-20]. http://gutenberg.net.au/.
[30] code4rda [EB/OL]. [2013-05-20]. http://code.google.com/p/code4rda/.
[31] The Library of Congress. Resource Description and Access [EB/OL]. [2013-05-20]. http://www.loc.gov/aba/rda/.
[32] Bizer C, Heath T, Berners-Lee T. Linked Data – The Story So Far [J]. International Journal on Semantic Web and Information Systems, 2009, 5(3):1-22.
[33] Volz J, Bizer C, Gaedke M, et al. Discovering and Maintaining Links on the Web of Data [C]. In: Proceedings of the 8th International Semantic Web Conference.Berlin, Heidelberg: Springer-Verlag,2009: 650-665.
[34] DBpedia. DBpedia 3.8 Downloads[EB/OL]. [2013-05-20]. http://wiki.dbpedia.org/Downloads38.
[1] 杨恒,王思丽,祝忠明,刘巍,王楠. 基于并行协同过滤算法的领域知识推荐模型研究*[J]. 数据分析与知识发现, 2020, 4(6): 15-21.
[2] 沈志宏, 姚畅, 侯艳飞, 吴林寰, 李跃鹏. 关联大数据管理技术: 挑战、对策与实践*[J]. 数据分析与知识发现, 2018, 2(1): 9-20.
[3] 崔家旺, 李春旺. 基于关联数据的类簇语义揭示模型研究[J]. 数据分析与知识发现, 2017, 1(4): 57-66.
[4] 姜赢, 张婧, 朱玲萱. 面向Cytoscape平台的关联数据知识图谱概览抽取与可视化*[J]. 数据分析与知识发现, 2017, 1(3): 29-37.
[5] 高长元, 于建萍, 何晓燕. 基于改进粒子群算法的云计算产业联盟知识搜索算法研究*[J]. 数据分析与知识发现, 2017, 1(3): 81-89.
[6] 齐云飞, 赵宇翔, 朱庆华. 关联数据在数字图书馆移动视觉搜索系统中的应用研究*[J]. 数据分析与知识发现, 2017, 1(1): 81-90.
[7] 赵夷平,毕强. 关联数据在学术资源网相似文献发现中的应用研究*[J]. 现代图书情报技术, 2016, 32(3): 41-49.
[8] 郭振英, 赵文兵, 魏育辉. 轻量级书目本体关联数据建设实践[J]. 现代图书情报技术, 2015, 31(7-8): 139-143.
[9] 高劲松, 程娅, 梁艳琪. 面向关联数据集的本体匹配方法研究[J]. 现代图书情报技术, 2015, 31(6): 33-40.
[10] 梁艺多, 翟军. 本体推理在关联数据链接发现中的应用研究[J]. 现代图书情报技术, 2015, 31(4): 87-95.
[11] 卓可秋, 虞为, 苏新宁. 突发事件检测的MapReduce并行化实现[J]. 现代图书情报技术, 2015, 31(2): 46-54.
[12] 马宾, 殷立峰. 一种基于Hadoop平台的并行朴素贝叶斯网络舆情快速分类算法[J]. 现代图书情报技术, 2015, 31(2): 78-84.
[13] 高劲松, 梁艳琪, 李珂, 肖涟, 周习曼. 面向关联数据的电子商务信用信息服务模型研究[J]. 现代图书情报技术, 2014, 30(6): 8-16.
[14] 王忠义, 夏立新, 石义金, 郑森茂. 数字图书馆中层关联数据的创建与发布[J]. 现代图书情报技术, 2013, (5): 28-33.
[15] 刘炜, 夏翠娟, 张春景. 大数据与关联数据:正在到来的数据技术革命[J]. 现代图书情报技术, 2013, (4): 2-9.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
版权所有 © 2015 《数据分析与知识发现》编辑部
地址:北京市海淀区中关村北四环西路33号 邮编:100190
电话/传真:(010)82626611-6626,82624938
E-mail:jishu@mail.las.ac.cn