|
|
Building Knowledge Graph with Sci-Tech Big Data |
Ying Wang1( ),Li Qian1,2,Jing Xie1,2,Zhijun Chang1,2,Beibei Kong1 |
1National Science Library, Chinese Academy of Sciences, Beijing 100190, China 2Department of Library, Information and Archives Management, University of Chinese Academy of Sciences, Beijing 100190, China |
|
|
Abstract [Objective] This paper tries to extract information from Sci-Tech big data and build an academic knowledge network, aiming to develop smart knowledge services. [Methods] We proposed an Ontology schema and a framework to contruct knowledge graph based on the distributed storage and high-performance computing of big data platform. The proposed model helped us extract and align research entities for relationship discovery. We also adopted the knowledge merging and enrichment, semantic storage and quality management techniques. [Results] We created a huge knowledge graph including more than 300 million entities and 1.1 billion relations. It also supported knowledge discovery platform and smart personal research assistant apps for scientific big data. [Limitations] More research is needed to improve the quality management of knowledge graph, as well as the precision of entity alignment. [Conclusions] The proposed method improve the knowledge management of scientific and technology big data.
|
Received: 03 December 2018
Published: 04 March 2019
|
[1] | Singhal A. Introducing the Knowledge Graph: Things, Not Strings[EB/OL]. [2013-04-10]. . | [2] | Wu W, Li H, Wang H, et al.Probase: A Probabilistic Taxonomy for Text Understanding[C]// Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data. New York: ACM, 2012: 481-492. | [3] | Baidu Open Knowledge Graph[EB/OL]. [2018-08-16].. | [4] | 张阔. 从搜索信息到搜索知识——技术架构[EB/OL]. [2013-03-26]. . | [4] | (Zhang Kuo. From Information Search to Knowledge Search — Technology Infrastructure[EB/OL]. [2013-03-26]. | [5] | 王元卓, 贾岩涛, 赵泽亚, 等. OpenKN-网络大数据时代的知识计算引擎[J]. 中国计算机学会通讯, 2014, 10(11): 30-35. | [5] | (Wang Yuanzhuo, Jia Yantao, Zhao Zeya, et al.OpenKG-Knowledge Computing Engine in the Era of Network Big Data[J]. Communications of the Chinese Computer Federation, 2014, 10(11): 30-35.) | [6] | Zhu J G, Wang H F, Shen B J. Software. Zhishi.Schema: A Software Programming Taxonomy Derived from Stackoverflow[C]// Proceedings of the 14th International Semantic Web Conference (ISWC 2015), Bethlehem, Pennsylvania, USA. 2015: 1-4. | [7] | Introduction to CN-Probase [EB/OL]. [2017-11-29]. . | [8] | 国务院. 新一代人工智能发展规划[R]. [2017-07-08]. 新一代人工智能发展规划[R]. [2017-07-08]. . | [8] | (State Council.New Generation Artificial Intelligence Development Plan[R]. [2017-07-08]. New Generation Artificial Intelligence Development Plan[R]. [2017-07-08]. | [9] | 钱力, 谢靖, 常志军, 等. 基于科技大数据的智能知识服务体系研究设计[J]. 数据分析与知识发现. DOI: 10.11925/infotech.2096-3467.2018.1364. | [9] | (Qian Li, Xie Jing, Chang Zhijun, et al.Designing Smart Knowledge Services with Sci-Tech Big Data[J]. Data Analysis and Knowledge Discovery. DOI: 10.11925/infotech.2096-3467. 2018.1364.) | [10] | Springer Nature.SN SciGraph [EB/OL]. [2018-08-18].. | [11] | Allen B P.The Roll of Metadata in the Second Machine Age [EB/OL]. [2017-02-02].. | [12] | Taylor & Francis.Wizdom.ai [EB/OL].[2018-05-05]. . | [13] | Tang J, Zhang J, Yao L M, et al.AMiner: Extraction and Mining of Academic Social Networks[C]// Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD’2008). Las Vegas, Nevada, USA. New York, ACM, 2008:990-998. | [14] | Acemap Knowledge Graph[EB/OL]. [2018-05-05].. | [15] | 国家科技文献中心. NSTL统一文献元数据标准3.0[EB/OL]. [2017-10-18]. . | [15] | (National Science and Technology Library. Unified MetaData Standard for Scientific Literature Version3.0 [EB/OL]. [2017-10-18]. |
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|