Please wait a minute...
Data Analysis and Knowledge Discovery  2017, Vol. 1 Issue (4): 84-93    DOI: 10.11925/infotech.2096-3467.2017.04.10
Orginal Article Current Issue | Archive | Adv Search |
Building Semantic Enrichment Framework for Scientific Literature Retrieval System
Xie Jing, Wang Jingdong, Wu Zhenxin(), Zhang Zhixiong, Wang Ying, Ye Zhifei
National Science Library, Chinese Academy of Sciences, Beijing 100190, China
Download: PDF (6590 KB)   HTML ( 3
Export: BibTeX | EndNote (RIS)      

[Objective] This paper aims to improve the scientific literature retrieval system with the help of semantic recognition and knowledge relationship computing. [Methods] First, we identified and extracted semantic objects from the scientific literature. Then, we calculated and established semantic relations among the objects using data-mining tools. Finally, we built semantic multidimensional index for these objects and relations, and then designed a new data organization model. [Results] The new system effectively identified the semantic information and improved the user experience. [Limitations] We need to expand the dataset used in this study and evaluate the new system in other areas. [Conclusions] The proposed system could retrieve more knowledge and indicate some future directions.

Key wordsSemantic Enrichment      Semantic Knowledge Organization      Semantic Relation Presentation      Multidimensional Index     
Received: 03 March 2017      Published: 24 May 2017
ZTFLH:  TP391  

Cite this article:

Xie Jing,Wang Jingdong,Wu Zhenxin,Zhang Zhixiong,Wang Ying,Ye Zhifei. Building Semantic Enrichment Framework for Scientific Literature Retrieval System. Data Analysis and Knowledge Discovery, 2017, 1(4): 84-93.

URL:     OR

文章PMID 来源
术语类型 MeSH词表
语义关系缩写 文本中
置信度 术语开始位置 术语结束位置
SE 00000000 tx 1 entity C1280519 Effectiveness qlco Effectiveness 1000 1 13
SE 00000000 tx 1 entity C0150143 Behavior mannagement topp behavioural managenment 964 18 39
SE 00000000 tx 1 entity C0149931 Migraine Disorders dsyn migraine 1000 44 51
SE 00000000 tx 1 entity C0001675 Adult aggp adult 888 56 60
SE 00000000 tx 1 entity C0030705 Patients podg patients 888 62 69
SE 00000000 tx 1 entity C0015607 family medicine
bmod family practice 901 81 95
SE 00000000 tx 1 entity C0442592 Clinic hcro,mnob clinics 901 97 103
SE 00000000 tx 1 entity C1514720 Randomized ftcn randomized 851 108 117
SE 00000000 tx 1 entity C0702113 Controlled ftcn controlled 851 119 128
SE 00000000 tx 1 entity C0008976 Clinical Trials resa trial 851 130 134
SE|00000000||tx|1|relation|3|1|C0149931|Migraine Disorders|dsyn|dsyn|||migraine|||1000|44|51|
索引字段 字段描述 字段功能
S 三元组主语 检索查询
P 三元组谓语 检索查询
O 三元组宾语 检索查询
S+P 主语与谓词拼接组合 分面揭示
P+O 谓词与宾语拼接组合 分面揭示
[1] U.S.National Library of Medicine. Semantic Knowledge Representation [EB/OL].[2016-01-13].
[2] Wikipedia. Knowledge Graph [EB/OL].[2016-02-10].
[3] Google Inside Search [EB/OL]. [2016-02-10].
[4] Wolframalpha. Computational Knowledge Engine [EB/OL].[2015-03-10].
[5] Kngine. The Most Intelligent Engine [EB/OL]. [2015-03-10].
[6] SindiceTech. Enterprise Knowledge Graphs [EB/OL]. [2015- 03-10].
[7] W3C Semantic Web. RDF [EB/OL].[2015-06-05].
[8] SindiceTech. FreeBase Distribution [EB/OL]. [2015-03-10].
[9] Apache Solr [EB/OL]. [2015-06-05].
[10] PubMed [EB/OL]. [2015-10-11].
[11] U.S.National Library of Medicine. SemRep [EB/OL].[2015-10-22].
[12] Del Corro L, Gemulla R.ClausIE: Clause-Open Information Extraction[C]//Proceedings of the the 22nd International Conference on World Wide Web. 2013:355-366.
[13] Merrill M D.Knowledge Objects[R]. USA: CBT Solutions, 1998: 1-11.
[14] U.S.National Library of Medicine. Unified Medical Language System (UMLS) [EB/OL].[2016-01-13]. .
[15] 王颖, 张智雄, 李传席, 等. 科技知识组织体系开放引擎系统的设计与实现[J]. 现代图书情报技术,2015 (10): 95-101.
[15] (Wang Ying, Zhang Zhixiong, Li Chuanxi, et al.The Design and Implementation of Open Engine System for Scientific & Technological Knowledge Organization Systems[J]. New Technology of Library and Information Service, 2015 (10): 95-101.)
[16] UMLS. Semantic Relationships [EB/OL].[2015-10-17].
[17] Chakraborty A, Munshi S, Mukhopadhyay D.Searching and Establishment of S-P-O Relationships for Linked RDF Graphs: An Adaptive Approach[C]//Proceedings of International Conference on Cloud & Ubiquitous Computing & Emerging Technologies (CUBE). 2013.
[18] Matthews P H.Syntactic Relations:A Critical Survey[M]. University of CambridgePress, 2007: 3-10.
[19] U.S.National Library of Medicine. Medical Subject Headings (MeSH) [EB/OL].[2015-06-05].
[20] U.S.National Library of Medicine. MeSH Category Tree View [EB/OL].[2015-06-05].
[21] MetaMap - A Tool For Recognizing UMLS Concepts in Text [EB/OL]. [2015-06-20].
[22] The Stanford Natural Language Processing Group. Stanford Part of Speech Tagger [EB/OL].[2015-08-24].
[23] SPECIALIST dTagger [EB/OL]. [2015-06-20].
[24] 孙坦, 刘峥. 面向外文科技文献信息的知识组织体系建设思路[J]. 图书与情报, 2013 (1): 2-7.
doi: 10.3969/j.issn.1003-6938.2013.01.001
[24] (Sun Tan, Liu Zheng.Methodology Framework of Knowledge Organization System for Scientific & Technological Literature[J]. Library & Information, 2013(1): 2-7.)
doi: 10.3969/j.issn.1003-6938.2013.01.001
[25] Rindflesch T C, Fiszman M.The Interaction of Domain Knowledge and Linguistic Structure in Natural Language Pprocessing: Interpreting Hypernymic Propositions in Biomedical Text[J]. Journal of Biomedical Informatics, 2003, 36(6): 462-477.
doi: 10.1016/j.jbi.2003.11.003 pmid: 14759819
[1] Wang Sili, Zhu Zhongming, Yang Heng, Liu Wei. Research on Automatic Identification of Hypernym-Hyponym Relations of Domain Concepts Based on Pattern and Projection Learning [J]. 数据分析与知识发现, 0, (): 1-.
[2] Weng Mengjuan,Yao Changqing,Han Hongqi,Wang Lijun,Ran Yaxin. Classification and Indexing Method with CNN for Imbalanced Datasets[J]. 数据分析与知识发现, 2020, 4(7): 87-95.
[3] Tang Xiaobo,Gao Hexuan. Classification of Health Questions Based on Vector Extension of Keywords[J]. 数据分析与知识发现, 2020, 4(7): 66-75.
[4] Qiu Erli,He Hongwei,Yi Chengqi,Li Huiying. Research on Public Policy Support Based on Character-level CNN Technology[J]. 数据分析与知识发现, 2020, 4(7): 28-37.
[5] Wang Jiandong,Yu Shiyang. Principles on Constructing National Economic Brain[J]. 数据分析与知识发现, 2020, 4(7): 2-17.
[6] Xu Hongxia,Yu Qianqian,Qian Li. Studying Content Interaction Data with Topic Model and Sentiment Analysis[J]. 数据分析与知识发现, 2020, 4(7): 110-117.
[7] Li Keyu,Wang Hao,Gong Lijuan,Tang Huihui. Measurement and Distribution of Index Quality in Research Topics from Academic Databases[J]. 数据分析与知识发现, 2020, 4(6): 91-108.
[8] Wei Tingxin,Bai Wenlei,Qu Weiguang. Sense Prediction for Chinese OOV Based on Word Embedding and Semantic Knowledge[J]. 数据分析与知识发现, 2020, 4(6): 109-117.
[9] Yang Heng,Wang Sili,Zhu Zhongming,Liu Wei,Wang Nan. Recommending Domain Knowledge Based on Parallel Collaborative Filtering Algorithm[J]. 数据分析与知识发现, 2020, 4(6): 15-21.
[10] Jiao Qihang,Le Xiaoqiu. Generating Sentences of Contrast Relationship[J]. 数据分析与知识发现, 2020, 4(6): 43-50.
[11] Cai Yongming,Liu Lu,Wang Kewei. Identifying Key Users and Topics from Online Learning Community[J]. 数据分析与知识发现, 2020, 4(6): 69-79.
[12] Wang Mo,Cui Yunpeng,Chen Li,Li Huan. A Deep Learning-based Method of Argumentative Zoning for Research Articles[J]. 数据分析与知识发现, 2020, 4(6): 60-68.
[13] Ye Guanghui, Xu Tong. Research on Dynamic City Profile Based on Evolutionary Analysis [J]. 数据分析与知识发现, 0, (): 1-.
[14] Li Junlian,Wu Yingjie,Deng Panpan,Leng Fuhai. Automatic Data Processing Strategy of Citation Anomie Based on Feature Fusion[J]. 数据分析与知识发现, 2020, 4(5): 38-45.
[15] Liu Ping,Peng Xiaofang. Calculating Word Similarities Based on Formal Concept Analysis[J]. 数据分析与知识发现, 2020, 4(5): 66-74.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938