Please wait a minute...
Data Analysis and Knowledge Discovery  2020, Vol. 4 Issue (1): 121-130    DOI: 10.11925/infotech.2096-3467.2019.0955
Current Issue | Archive | Adv Search |
Automatic Concept Update Strategy Towards Heterogeneous Terminology Integration
Haixia Sun1,2,Panpan Deng2,Jiao Li2,Liu Shen2,Qing Qian2()
1School of Information Management, Nanjing University, Nanjing 210093, China
2Institute of Medical Information,Chinese Academy of Medical Sciences, Beijing 100020, China
Download: PDF(819 KB)   HTML ( 8
Export: BibTeX | EndNote (RIS)      
Abstract  

[Objective] This paper proposes a method updating integrated concept for the version evolution of source Knowledge Organization Systems (KOSs), aiming to promote the dynamic development of the heterogeneous terminology integration system.[Methods] Our model focuses on terms, synonym sets and preferred terms of concepts. Firstly, we identified terms changing types and preferred terms changing modes of concepts in source KOSs by exact string matching. Then, we recognized their synonym sets changing patterns through concept vector space. Finally, we updated synonym sets and preferred terms of integrated concepts fusion rule and similarity. We also assessed the results yielded by our method using medical integration concept set of STKOS and its important sources, MeSH and HUGO.[Results] The synonymous merging rate of new term from source KOSs reached 94.96%, and the update accuracy of preferred term of changed integrated concepts reached 99.91%.[Limitations] We did not consider ambiguity of the terms and the results were affected by the number of vocabulary and update order.[Conclusions] The proposed method can be applied to update concepts of synonymous knowledge organization systems because of their source KOSs evolution.

Key wordsSynonymous Interoperability      Interoperability Maintenance      Integrated Concept Updating      Knowledge Organization Systems     
Received: 20 August 2019      Published: 14 March 2020
ZTFLH:  TP393  
Corresponding Authors: Qing Qian     E-mail: qing@imicams.ac.cn

Cite this article:

Haixia Sun,Panpan Deng,Jiao Li,Liu Shen,Qing Qian. Automatic Concept Update Strategy Towards Heterogeneous Terminology Integration. Data Analysis and Knowledge Discovery, 2020, 4(1): 121-130.

URL:

http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.2096-3467.2019.0955     OR     http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2020/V4/I1/121

Automatic Update of Integrated Concepts in Synonymous Interoperability System According to Source Vocabularies Evolution
Extension Algorithm for New Terms
新旧版本来源词表概念变化类别 变更数量
MeSH 2017 HUGO 2016
术语变更 删除术语DT 6873 20 383
新增术语NT 29 209 52 002
未变化术语UT 205 760 115 646
同义词集合变更 模式“0” 删除概念同义词集合Cdel 228 1 522
模式“1” 完全新增概念同义词集合Cnew 4 341 10 186
模式“2” 未变化同义词集合Sunc 44 883 16 766
拆分同义词集合Sut-split 135 0
合并同义词集合Sut-merge 91 2
复合操作变更同义词集合Sut-change 15 0
模式“3” 变化概念同义词集合Sc 5 246 12 862
优选术语变更 优选术语未变化PTunc 49 804 1 522
优选术语变化PTc 566 17 968
新概念优选术语PTnew 4 314 10 186
Version Change of MeSH and HUGO
整合概念变更类别 实验1 实验2 实验3 实验4
删除概念Cdel 127 1 575 1 649 1 693
新增概念Cnew Ptnew 2 272 9 446 7 187 11 704
同义词集合不变Sunc 优选术语未变Ptunc 388 867 16 856 55 037 55 489
同义词集合不变Sunc 优选术语改变Ptc 2 0 2 2
同义词集合改变Sc 优选术语未变Ptunc 4 438 11 638 16 492 16 049
同义词集合改变Sc 优选术语改变Ptc 114 1 411 1 565 1 521
Change of Integrated Concepts After 4 Experiments
评价数据集 实验1 实验2 实验3 实验4
序号 整合概念变更类别 整合概念数 新增术语数 整合概念数 新增术语数 整合概念数 新增术语数 整合概念数 新增术语数
1 Cnew 114 565 472 1 502 359 1 338 585 1 942
2 Sunc Ptc 2 0 0 0 2 0 2 0
3 Sc Ptunc 222 940 581 861 824 1 763 802 1 840
4 Sc Ptc 6 22 70 244 78 628 76 244
合计 344 1 527 1 123 2 607 1 263 3 729 1 465 4 026
Numbers of Concepts and New Terms in 4 Sampled Data Sets for Evaluation
同义词集合变更类别 实验1 实验2 实验3 实验4
Cnew 100.00% 100.00% 100.00% 100.00%
Sc 92.00% 87.33% 72.48% 85.41%
合计 94.96% 94.63% 82.35% 92.45%
Merging Precion of New Terms
整合概念变更类别 实验1 实验2 实验3 实验4
Cnew Ptnew 98.25% 100.00% 98.33% 99.66%
Sunc Ptc 100.00% - 100.00% 100.00%
Sc Ptunc 97.30% 99.83% 99.88% 100.00%
Sc Ptc 50.00% 100.00% 94.87% 92.11%
合计 96.80% 99.91% 99.13% 99.45%
Precion of Preferred Terms Recommendation for Changed Concepts
[1] 司莉 . 知识组织系统的互操作及其实现[J]. 现代图书情报技术, 2007(3):29-34.
[1] ( Si Li . Interoperability and Its Implementation Among Knowledge Organization Systems[J]. New Technology of Library and Information Service, 2007(3):29-34.)
[2] 宋文 . 知识组织体系语义互操作研究[J]. 图书馆论坛, 2012,32(6):117-121.
[2] ( Song Wen . Research on Interoperation of Knowledge Organization System[J]. Library Tribune, 2012,32(6):117-121.)
[3] 孙坦, 刘峥 . 面向外文科技文献信息的知识组织体系建设思路[J]. 图书与情报, 2013(1):2-7.
[3] ( Sun Tan, Liu Zheng . Methodology Framework of Knowledge Organization System for Scientific & Technological Literature[J]. Library and Information, 2013(1):2-7.)
[4] Lindberg C . The Unified Medical Language System (UMLS) of the National Library of Medicine[J]. Journal (American Medical Record Association), 1990,61(5):40-42.
[5] 潘洪建 . 知识本质: 内在、开放、动态——新知识观的思考[J]. 教育理论与实践, 2003,23(2):1-6.
[5] ( Pan Hongjian . The Essence of Knowledge: Inner, Open and Dynamic—On the Outlook on Knowledge[J]. Theory and Practice of Education, 2003,23(2):1-6.)
[6] MeSH [DB/OL]. [2019-03-28]..
[7] SNOMEDCT International [DB/OL]. [2019-03-28]..
[8] LOINC[DB/OL]. [2019-03-28]..
[9] HGNC [DB/OL]. [2019-03-28]..
[10] 中医药主题词表 [DB/OL]. [ 2019- 03- 28]. .
[10] ( Traditional Chinese Medicine Thesaurus [DB/OL]. [ 2019- 03- 28]. )
[11] 中文医学主题词表[DB/OL][ 2019- 03- 28]. .
[11] ( Chinese Medical Subject Headings)[DB/OL] [ 2019- 03- 28]. .)
[12] Saitwal H, Qing D, Jones S , et al. Cross-terminology Mapping Challenges: A Demonstration Using Medication Terminological Systems[J]. Journal of Biomedical Informatics, 2012,45(4):613-625.
[13] Dos Reis J C, Pruski C, Da SilveiraM , et al. Understanding Semantic Mapping Evolution by Observing Changes in Biomedical Ontologies[J]. Journal of Biomedical Informatics, 2014,47:71-82.
[14] The National Library of Medicine. UMLS [DB/OL].[2019-03-28]. .
[15] Da Silveira M, Dos Reis J C, Pruski C, . Management of Dynamic Biomedical Terminologies: Current Status and Future Challenges[J]. Yearbook of Medical Informatics, 2015,24(1):125-133.
[16] Noy N F, Musen M A. PROMPTDIFF: A Fixed-Point Algorithm for Comparing Ontology Versions [C]//Proceedings of the 18th National Conference on Artificial Intelligence,Edmonton. 2002: 744-750.
[17] Hartung M, Kirsten T, Rahm E. Analyzing the Evolution of Life Science Ontologies and Mappings [C]// Proceedings of the 5th International Workshop on Data Integration in the Life Sciences.Berlin: Springer, 2008: 11-27.
[18] Hartung M, Kirsten T, Gross A , et al. OnEX: Exploring Changes in Life Science Ontologies[J]. BMC Bioinformatics, 2009, 10:Article No. 250.
[19] Hartung M, Gross A, Rahm E . COnto-Diff: Generation of Complex Evolution Mappings for Life Science Ontologies[J]. Journal of Biomedical Informatics, 2013,46(1):15-32.
[20] Gross A, Hartung M, Thor A, et al. How do Computed Ontology Mappings Evolve? A Case Study for Life Science Ontologies [C]// Proceedings of the 2012 Joint Workshop on Knowledge Evolution and Ontology Dynamics, Boston. 2012: 1-12.
[21] Dos Reis J C, Dinh D, Da Silveira M , et al. Recognizing Lexical and Semantic Change Patterns in Evolving Life Science Ontologies to Inform Mapping Adaptation[J]. Artificial Intelligence in Medicine, 2015,63(3):153-170.
[22] Dinh D, Dos Reis J C, Pruski C , et al. Identifying Relevant Concept Attributes to Support Mapping Maintenance Under Ontology Evolution[J]. Web Semantics: Science, Services and Agents on the World Wide Web, 2014,29:53-66.
[23] Meilicke C, Stuckenschmidt H, Tamilin A . Reasoning Support for Mapping Revision[J]. Journal of Logic and Computation, 2009,19(5):807-829.
[24] Castano S, Ferrara A, Lorusso D . Mapping Validation by Probabilistic Reasoning [C]// Proceedings of the 5th European Semantic Web Conference on the Semantic Web: Research & Applications, Canary Islands, Spain. Berlin, Heidelberg: Springer, 2008: 170-184.
[25] Meyniel F, Schlunegger D, Dehaene S . The Sense of Confidence During Probabilistic Learning: A Normative Account[J]. PLoS Computational Biology, 2015,11(6):e1004305.
[26] Khattak A M, Pervez Z, Latif K , et al. Time Efficient Reconciliation of Mappings in Dynamic Web Ontologies[J]. Knowledge-Based Systems, 2012,35:369-374.
[27] Tang F, Tang R. Minimizing Influence of Ontology Evolution In Ontology-based Data Access System [C]// Proceedings of the 2010 IEEE International Conference on Progress in Informatics and Computing, Shanghai, China. 2010: 10-14.
[28] Martins H, Silva N. A User-driven and a Semantic-based Ontology Mapping Evolution Approach [C]// Proceedings of the 11th International Conference on Enterprise Information Systems, Milan, Italy. 2009: 214-221.
[29] Dos Reis J C, Pruski C, Da Silveira M , et al. DyKOSMap: A Framework for Mapping Adaptation Between Biomedical Knowledge Organization Systems[J]. Journal of Biomedical Informatics, 2015,55:153-173.
[30] 孙海霞, 成颖 . 信息集成中的字符串匹配技术研究[J]. 现代图书情报技术, 2007(7):22-26.
[30] ( Sun Haixia, Cheng Ying . Study on String-based Matching of Information Integration[J]. New Technology of Library and Information Service, 2007(7):22-26.)
[31] 孙海霞, 李军莲, 华薇娜 , 等. 科技知识组织体系语义互操作网络协同工作平台设计与实现[J]. 农业图书情报, 2019,31(1):23-34.
[31] ( Sun Haixia, Li Junlian, Hua Weina , et al. Design and Implementation of Network Collaborative Work Platform for Semantic Interoperability of Science and Technology Knowledge Organization Systems[J]. Agricultural Library and Information, 2019,31(1):23-34.)
[32] STKOS超级科技词表协同构建与管理系统[DB/OL]. [ 2019- 04- 28]. .
[32] ( The Collaborative Building and Management System for Super Scientific and Technological Thesaurus [DB/OL]. [ 2019- 04- 28]. .)
[1] Qu Jianfeng, Li Fang, Zhang Yihua, Li Bao. Study and Implementation on the Automatic Mapping Rules Between Knowledge Organization Systems——The Case of the Dewey Decimal Classification and the Chinese Library Classification[J]. 现代图书情报技术, 2012, (10): 83-88.
[2] Si Li . Interoperability and Its Implementation Among Knowledge Organization Systems[J]. 现代图书情报技术, 2007, 2(3): 29-34.
[3] Alistair Miles,Brian Matthews,Michael Wilson,Dan Brickley. SKOS Core: Simple Knowledge Organisation for the Web[J]. 现代图书情报技术, 2006, 22(1): 3-9.
[4] Linda Hill,Olha Buchel,Greg Janée, Zeng Lei. Integration of Knowledge Organization Systems into Digital Library Architectures[J]. 现代图书情报技术, 2004, 20(1): 4-8.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn