|
|
Automatic Expression of Co-occurrence Clustering Based on Indexing Rules of Medical Subject Headings |
Wu Jinming1,Hou Yuefang2,Cui Lei2() |
1Institute of Medical Information/Medical Library, Chinese Academy of Medical Science & Peking Union Medical College, Beijing 100020, China 2College of Medical Informatics, China Medical University, Shenyang 110122, China |
|
|
Abstract [Objective] This study proposes an automatic procedure to present the clustering results, aiming to promote the development of co-word clustering analysis.[Methods] First, we examined the indexing rules of neoplastic diagnosis and chose 10 common neoplasms as sample sets for co-occurrence clustering analysis. Then, we reviewed the results and combined the indexing rules to identify the semantic types / subheading combination patterns of high-frequency subject headings. Third, we developed a python application to automatically interpret the clustering results for four groups of neoplasms. Finally, we invited 12 experts to evaluate the accuracy, comprehensiveness, practicality, comprehensibility and simplicity of the presentation.[Results] We found 30 indexing patterns of neoplastic diagnosis as well as 98 combination semantic patterns. The scores of the accuracy, comprehensiveness, practicality, comprehensibility and simplicity were 4.282, 4.435, 4.209, 4.457, and 4.206 out of 5.[Limitations] It was difficult to reveal the “hidden relations” among the subject headings with the proposed method.[Conclusions] Our new method could effectively present results of co-occurrence clustering analysis for medical records.
|
Received: 16 March 2020
Published: 17 June 2020
|
|
Corresponding Authors:
Cui Lei
E-mail: lcui@cmu.edu.cn
|
[1] |
崔雷, 隋明爽. 共现聚类分析结果表达方式的研究[J]. 情报学报, 2015,34(12):1270-1277.
|
[1] |
( Cui Lei, Sui Mingshuang. Study on an Approach to Presenting the Co-word Clustering Analysis Results[J]. Journal of Library Science in China, 2015,34(12):1270-1277.)
|
[2] |
Zhou Q J, Leng F H, Leydesdorff L. The Reflection of Hierarchical Cluster Analysis of Co-occurrence Matrices in SPSS[J]. Chinese Journal of Library and Information Science, 2015,8(2):11-24.
|
[3] |
Rasmussen M, Karypis G. gCLUTO-An Interactive Clustering, Visualization, and Analysis System[R]. UMN-CS TR-04-021, 2004.
|
[4] |
Song Y, Liu B, Chen X, et al. Atmospheric Pollution Mapping of the Yangtze River Basin: An AQI-based Weighted Co-word Analysis[J]. International Journal of Environmental Research and Public Health, 2020,17(3):817.
|
[5] |
Xing Y N, Wang Y B, Zhang W, et al. The Hotspots Analysis of Education and Management of Childhood Asthma Based on Cluster Analysis Method[J]. Studies in Health Technology and Informatics, 2019,264:1618-1619.
pmid: 31438260
|
[6] |
Yang A L, Lv Q Q, Chen F, et al. Identification of Recent Trends in Research on Vitamin D: A Quantitative and Co-word Analysis[J]. Medical Science Monitor: International Medical Journal of Experimental and Clinical Research, 2019,25:643-655.
|
[7] |
Callon M, Courtial J P, Turner W A, et al. From Translations to Problematic Networks: An Introduction to Co-word Analysis[J]. Social Science Information, 1983,22(2):191-235.
|
[8] |
Ding Y, Chowdhury G G, Foo S. Bibliometric Cartography of Information Retrieval Research by Using Co-word Analysis[J]. Information Processing & Management, 2001,37(6):817-842.
|
[9] |
钟伟金, 李佳. 共词分析法研究(二)——类团分析[J]. 情报杂志, 2008,27(6):141-143.
|
[9] |
( Zhong Weijin, Li Jia. The Research of Co-word Analysis (2)[J]. Journal of Information, 2008,27(6):141-143.)
|
[10] |
钟伟金. 共词聚类分析法的类团实例研究——对肿瘤治疗热点主题的分析[J]. 中华医学图书情报杂志, 2009,18(2):48-53.
|
[10] |
( Zhong Weijin. Clustered Word Group in Co-word Cluster Analysis of Hot Subject Terms of Tumor Therapy[J]. Chinese Journal of Medical Library and Information Science, 2009,18(2):48-53.)
|
[11] |
赵兴烈. 医学文献主题标引[M]. 北京: 首都医学院图书馆, 1985.
|
[11] |
( Zhao Xinglie. Subject Indexing of Medical Literature[M]. Beijing: Capital Medical University Library, 1985.)
|
[12] |
李守凉. 生物医学文献主题标引[M]. 长沙: 湖南科学技术出版社, 1992.
|
[12] |
( Li Shouliang. Subject Indexing of Biomedical Literature[M]. Changsha: Hunan Science & Technology Press, 1992.)
|
[13] |
肖晓旦, 张士靖. 医学文献主题标引[M]. 北京: 高等教育出版社, 2006.
|
[13] |
( Xiao Xiaodan, Zhang Shijing. Subject Indexing of Medical Literature[M]. Beijing: Higher Education Press, 2006.)
|
[14] |
李丹亚, 胡铁军, 诸文雁, 等. 中文医学主题词表检索系统[J]. 中华医学图书馆杂志, 2001,10(4):1-2,9.
|
[14] |
( Li Danya, Hu Tiejun, Zhu Wenyan, et al. Retrieval System for the Chinese Medical Subject Headings[J]. Chinese Journal of Medical Library, 2001,10(4):1-2,9.)
|
[15] |
崔雷, 刘伟, 闫雷, 等. 文献数据库中书目信息共现挖掘系统的开发[J]. 现代图书情报技术, 2008(8):70-75.
|
[15] |
( Cui Lei, Liu Wei, Yan Lei, et al. Development of a Text Mining System Based on the Co-occurrence of Bibliographic Items in Literature Databases[J]. New Technology of Library and Information Service, 2008(8):70-75.)
|
[16] |
于跃, 徐志健, 王坤, 等. 基于双聚类方法的生物医学信息学文本数据挖掘研究[J]. 图书情报工作, 2012,56(18):133-136.
|
[16] |
( Yu Yue, Xu Zhijian, Wang Kun, et al. Text Data Mining in Biomedical Informatics Based on Biclustering Method[J]. Library and Information Service, 2012,56(18):133-136.)
|
[17] |
方丽, 崔雷. 利用双聚类算法探测学科前沿及知识基础——以h指数研究领域为例[J]. 情报理论与实践, 2014,37(11):55-60.
|
[17] |
( Fang Li, Cui Lei. Detection of Research Front and Intellectual Base Based on Biclustering Algorithm[J]. Information Studies: Theory & Application, 2014,37(11):55-60.)
|
[18] |
李范, 李敏, 王丽, 等. 利用共词分析挖掘国际护理信息学研究热点[J]. 医学信息学志, 2014,35(9):48-53.
|
[18] |
( Li Fan, Li Min, Wang Li, et al. Mining Research Hotpots of International Nursing Informatics by Co-word Analysis[J]. Journal of Medical Informatics, 2014,35(9):48-53.)
|
[19] |
Miñarro-Giménez J A, Kreuzthaler M, Schulz S. Knowledge Extraction from MEDLINE by Combining Clustering with Natural Language Processing[J]. AMIA Annual Symposium Proceedings, 2015: 915-924.
pmid: 14728421
|
[20] |
钱庆, 李军莲. 中国生物医学文献数据库的知识管理[J]. 医学情报工作, 2004,25(5):347-349.
|
[20] |
( Qian Qing, Li Junlian. Knowledge Management of Chinese Biomedical Literature Database[J]. Journal of Medical Intelligence, 2004,25(5):347-349.)
|
[21] |
Cimino J J, Barnett G O. Automatic Knowledge Acquisition from MEDLINE[J]. Methods of Information in Medicine, 1993,32(2):120-130.
|
[22] |
Wang L Q, Del Fiol G, Bray B E, et al. Generating Disease-pertinent Treatment Vocabularies from MEDLINE Citations[J]. Journal of Biomedical Informatics, 2017,65:46-57.
doi: 10.1016/j.jbi.2016.11.004
pmid: 27866001
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|