Automatic Expression of Co-occurrence Clustering Based on Indexing Rules of Medical Subject Headings
Wu Jinming1,Hou Yuefang2,Cui Lei2()
1Institute of Medical Information/Medical Library, Chinese Academy of Medical Science & Peking Union Medical College, Beijing 100020, China 2College of Medical Informatics, China Medical University, Shenyang 110122, China
[Objective] This study proposes an automatic procedure to present the clustering results, aiming to promote the development of co-word clustering analysis.[Methods] First, we examined the indexing rules of neoplastic diagnosis and chose 10 common neoplasms as sample sets for co-occurrence clustering analysis. Then, we reviewed the results and combined the indexing rules to identify the semantic types / subheading combination patterns of high-frequency subject headings. Third, we developed a python application to automatically interpret the clustering results for four groups of neoplasms. Finally, we invited 12 experts to evaluate the accuracy, comprehensiveness, practicality, comprehensibility and simplicity of the presentation.[Results] We found 30 indexing patterns of neoplastic diagnosis as well as 98 combination semantic patterns. The scores of the accuracy, comprehensiveness, practicality, comprehensibility and simplicity were 4.282, 4.435, 4.209, 4.457, and 4.206 out of 5.[Limitations] It was difficult to reveal the “hidden relations” among the subject headings with the proposed method.[Conclusions] Our new method could effectively present results of co-occurrence clustering analysis for medical records.
邬金鸣,侯跃芳,崔雷. 基于医学主题词标引规则的词共现聚类分析结果自动判读和表达的研究[J]. 数据分析与知识发现, 2020, 4(9): 133-144.
Wu Jinming,Hou Yuefang,Cui Lei. Automatic Expression of Co-occurrence Clustering Based on Indexing Rules of Medical Subject Headings. Data Analysis and Knowledge Discovery, 2020, 4(9): 133-144.
( Cui Lei, Sui Mingshuang. Study on an Approach to Presenting the Co-word Clustering Analysis Results[J]. Journal of Library Science in China, 2015,34(12):1270-1277.)
Zhou Q J, Leng F H, Leydesdorff L. The Reflection of Hierarchical Cluster Analysis of Co-occurrence Matrices in SPSS[J]. Chinese Journal of Library and Information Science, 2015,8(2):11-24.
Rasmussen M, Karypis G. gCLUTO-An Interactive Clustering, Visualization, and Analysis System[R]. UMN-CS TR-04-021, 2004.
Song Y, Liu B, Chen X, et al. Atmospheric Pollution Mapping of the Yangtze River Basin: An AQI-based Weighted Co-word Analysis[J]. International Journal of Environmental Research and Public Health, 2020,17(3):817.
Xing Y N, Wang Y B, Zhang W, et al. The Hotspots Analysis of Education and Management of Childhood Asthma Based on Cluster Analysis Method[J]. Studies in Health Technology and Informatics, 2019,264:1618-1619.
Yang A L, Lv Q Q, Chen F, et al. Identification of Recent Trends in Research on Vitamin D: A Quantitative and Co-word Analysis[J]. Medical Science Monitor: International Medical Journal of Experimental and Clinical Research, 2019,25:643-655.
Callon M, Courtial J P, Turner W A, et al. From Translations to Problematic Networks: An Introduction to Co-word Analysis[J]. Social Science Information, 1983,22(2):191-235.
Ding Y, Chowdhury G G, Foo S. Bibliometric Cartography of Information Retrieval Research by Using Co-word Analysis[J]. Information Processing & Management, 2001,37(6):817-842.
( Cui Lei, Liu Wei, Yan Lei, et al. Development of a Text Mining System Based on the Co-occurrence of Bibliographic Items in Literature Databases[J]. New Technology of Library and Information Service, 2008(8):70-75.)
( Li Fan, Li Min, Wang Li, et al. Mining Research Hotpots of International Nursing Informatics by Co-word Analysis[J]. Journal of Medical Informatics, 2014,35(9):48-53.)
Miñarro-Giménez J A, Kreuzthaler M, Schulz S. Knowledge Extraction from MEDLINE by Combining Clustering with Natural Language Processing[J]. AMIA Annual Symposium Proceedings, 2015: 915-924.
( Qian Qing, Li Junlian. Knowledge Management of Chinese Biomedical Literature Database[J]. Journal of Medical Intelligence, 2004,25(5):347-349.)
Cimino J J, Barnett G O. Automatic Knowledge Acquisition from MEDLINE[J]. Methods of Information in Medicine, 1993,32(2):120-130.
Wang L Q, Del Fiol G, Bray B E, et al. Generating Disease-pertinent Treatment Vocabularies from MEDLINE Citations[J]. Journal of Biomedical Informatics, 2017,65:46-57.