|
|
Fusion of Organization Authority Files from Multiple Sources |
Fan Yunman,Chen Ying,Tang Xiaoli() |
Institute of Medical Information, Chinese Academy of Medical Sciences, Beijing 100020, China |
|
|
Abstract [Objective] This paper aims to improve the selection and evaluation of the organization authority files (OAF) and address the mapping issues between OAF and redundant relationships. [Methods] First, we examined the existing OAF and related studies. Then, we constructed a fusion model with six steps: data collection and analysis, metadata framework fusion, organization relationship fusion, alias fusion, OAF data model construction, and verification of fusion results. Finally, we examined the new model using data from Dimensions, Scopus, and Web of Science. [Results] Our new model’s F1 value reached 0.97 or above in the first, second, and third-level organizations, and the Dimensions made the most significant contribution. We constructed an OAF containing 5,128 organizations. [Limitations] The organization relationship only included the parent-child relations. Cross-reference relations and the choice of standard organization names need to be studied. We also need to verify the proposed model with more data. [Conclusions] The new model could effectively integrate OAF from multiple sources.
|
Received: 19 May 2023
Published: 15 March 2024
|
|
Fund:Chinese Academy of Medical Sciences Medical and Health Science and Technology Innovation Project (Major Collaborative Innovation Project)(2021-I2M-1-033) |
Corresponding Authors:
Tang Xiaoli,ORCID: 0000-0001-6946-3482,E-mail:tang.xiaoli@imicams.ac.cn。
|
[1] |
薛明, 王丽萍. 我国规范文档研究综述[J]. 图书馆学刊, 1999(1): 28-30.
|
[1] |
(Xue Ming, Wang Liping. The Review of the Research on Authority Documents in China[J]. Journal of Library Science, 1999(1): 28-30.)
|
[2] |
MacEwan A, Angjeli A, Gatenby J. The International Standard Name Identifier (ISNI): The Evolving Future of Name Authority Control[J]. Cataloging & Classification Quarterly, 2013, 51(1-3): 55-71.
|
[3] |
DataSalon. OrgRef[EB/OL].[2022-08-22]. https://web.archive.org/web/20140912085615/http://www.orgref.org/web/index.htm.
|
[4] |
Loesch M F. VIAF (The Virtual International Authority File)-http://viaf.org[J]. Technical Services Quarterly, 2011, 28(2): 255-256.
|
[5] |
Burnham J F. Scopus Database: A Review[J]. Biomedical Digital Libraries, 2006, 3: Article No.1.
|
[6] |
Clarivate Analytics. Web of Science Core Collection Help-Corporate and Institution Abbreviations[EB/OL].[2022-06-08]. https://images.webofknowledge.com/WOKRS58B4/help/WOS/hs_corporate_abbreviations.html.
|
[7] |
Digital Science. Dimensions[EB/OL].[2022-08-22]. https://app.dimensions.ai/.
|
[8] |
Lammey R. Solutions for Identification Problems: A Look at the Research Organization Registry[J]. Science Editing, 2020, 7(1): 65-69.
|
[9] |
贾君枝, 石燕青. 中文名称规范文档与VIAF的关联[J]. 国家图书馆学刊, 2014, 23(6): 85-90.
|
[9] |
(Jia Junzhi, Shi Yanqing. The Association of Chinese Name Authority File with VIAF[J]. Journal of the National Library of China, 2014, 23(6): 85-90.)
|
[10] |
胡媛. 中文名称规范文档与VIAF共享问题分析[J]. 河南图书馆学刊, 2018, 38(2): 111-113.
|
[10] |
(Hu Yuan. Analysis of Sharing Problems Between Chinese Name Authority Document and VIAF[J]. The Library Journal of Henan, 2018, 38(2): 111-113.)
|
[11] |
王锦华, 陈锐, 冯占英, 等. 基于多源数据融合的军事医学机构名称规范研究[J]. 中华医学图书情报杂志, 2020, 29(2): 52-57.
|
[11] |
(Wang Jinhua, Chen Rui, Feng Zhanying, et al. Multisource Data Fusion-Based Normalization of Military Medical Institution Names[J]. Chinese Journal of Medical Library and Information Science, 2020, 29(2): 52-57.)
|
[12] |
王星, 曾建勋, 苏静, 等. 机构规范文档构建方式研究[J]. 数字图书馆论坛, 2015(7): 2-8.
|
[12] |
(Wang Xing, Zeng Jianxun, Su Jing, et al. Research on the Construction of Institutional Authority File[J]. Digital Library Forum, 2015(7): 2-8.)
|
[13] |
Huang Y W, Li J, Sun T, et al. Institution Information Specification and Correlation Based on Institutional PIDs and IND Tool[J]. Scientometrics, 2020, 122(1): 381-396.
|
[14] |
王瑞云, 贾君枝. 基于外部ID的中文实体对齐分析——以中国科学院院士Wikidata数据子集为例[J]. 国家图书馆学刊, 2020, 29(2): 102-113.
|
[14] |
(Wang Ruiyun, Jia Junzhi. Analysis of Named Entity Alignment Based on External-ID—Taking Data Subset of Wikidata for Academician of Chinese Academy of Sciences as an Example[J]. Journal of the National Library of China, 2020, 29(2): 102-113.)
|
[15] |
刘翔, 黄晨. 基于ISNI的学术应用生态构建[J]. 数字图书馆论坛, 2020(5): 49-53.
|
[15] |
(Liu Xiang, Huang Chen. Construction of Academic Application Ecology Based on ISNI[J]. Digital Library Forum, 2020(5): 49-53.)
|
[16] |
Huang S Q, Yang B, Yan S L, et al. Institution Name Disambiguation for Research Assessment[J]. Scientometrics, 2014, 99: 823-838.
|
[17] |
孙海霞, 王蕾, 吴英杰, 等. 科技文献数据库中机构名称匹配策略研究[J]. 数据分析与知识发现, 2018, 2(8): 88-97.
|
[17] |
(Sun Haixia, Wang Lei, Wu Yingjie, et al. Matching Strategies for Institution Names in Literature Database[J]. Data Analysis and Knowledge Discovery, 2018, 2(8): 88-97.)
|
[18] |
苏娜, 张志强. 科学计量学中多重关系融合方法研究进展及分析[J]. 情报科学, 2010, 28(9): 1309-1313.
|
[18] |
(Su Na, Zhang Zhiqiang. On the Multiple Relation Fusion Research in Scientometrics[J]. Information Science, 2010, 28(9): 1309-1313.)
|
[19] |
Xu H Y, Dong K, Luo R, et al. Research on Topic Recognition Based on Multivariate Relation Fusion[C]// Proceedings of the 23rd International Conference on Science and Technology Indicators. 2018: 378-384.
|
[20] |
周毅, 张建勇, 刘峥, 等. 科研实体名称规范的关联数据模型构建[J]. 图书情报工作, 2020, 64(10): 109-117.
doi: 10.13266/j.issn.0252-3116.2020.10.012
|
[20] |
(Zhou Yi, Zhang Jianyong, Liu Zheng, et al. Research on the Construction of Linked Data Model for Research Entity’s Name Authority Data[J]. Library and Information Service, 2020, 64(10): 109-117.)
doi: 10.13266/j.issn.0252-3116.2020.10.012
|
[21] |
陈辰, 周莉, 王璐, 等. 科研实体唯一标识符互操作研究[J]. 情报理论与实践, 2018, 41(12): 99-103.
doi: 10.16353/j.cnki.1000-7490.2018.12.018
|
[21] |
(Chen Chen, Zhou Li, Wang Lu, et al. Interoperability of Scientific Research Entity Unique Identifier[J]. Information Studies: Theory & Application, 2018, 41(12): 99-103.)
doi: 10.16353/j.cnki.1000-7490.2018.12.018
|
[22] |
ISNI. FAQs[EB/OL]. [2023-07-12]. https://isni.org/page/faqs.
|
[23] |
Wikidata. VIAF ID[EB/OL]. [2023-07-13]. https://www.wikidata.org/wiki/Q19832964.
|
[24] |
贤信, 曾建勋. 科研实体唯一标识系统研究[J]. 图书情报工作, 2015, 59(12): 113-119.
doi: 10.13266/j.issn.0252-3116.2015.12.017
|
[24] |
(Xian Xin, Zeng Jianxun. Research on Identification Systems of Scientific Research Entity[J]. Library and Information Service, 2015, 59(12): 113-119.)
doi: 10.13266/j.issn.0252-3116.2015.12.017
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|