1School of Management, Shanxi Medical University, Taiyuan 030000, China 2School of Humanities and Social Sciences, Shanxi Medical University, Taiyuan 030000, China
[Objective] This paper proposes a new knowledge discovery method for social media, aiming to predict the topic-related opportunities and emerging topics in medicine.[Methods] We developed a method combining the Co-LDA topic model and the link prediction algorithm to identify topic association opportunities. We examined the new model with data on diabetes drugs from social media. [Results] The AUC value of the prediction for the common network link without the right topics was higher than those with the right topics, while the Katz index is the optimal one. The future research on diabetes drugs is most likely to be related to the improvement of pharmacodynamic research and treatment plans. The development of the pharmaceutical industry and the new drug indications were related. [Limitations] We did not conduct multi-level analysis with emotional and time dimensions, and the new algorithm is very complex and did not perform well with poor network connectivity. [Conclusions] The proposed method could effectively predict the topic association opportunities in the field of medicine.
(Guo Hongtao, Zheng Guang, Zhang Zhihua, et al. Exploring Clinical Indications of Liuwei-Dihuang Pill Through Text Mining[J]. World Science and Technology-Modernization of Traditional Chinese Medicine, 2013, 15(3): 535-538.)
(Ding Wen, Zhang Xuefang, Chen Wen, et al. The Relationship Between Cognitive Function and Blood Pressure Variability in Patients with Hypertension: A Meta-Analysis[J]. Chinese Journal of Integrative Medicine on Cardio/Cerebrovascular Disease, 2021, 19(3): 389-395.)
[6]
Kim M, Baek I, Song M. Topic Diffusion Analysis of a Weighted Citation Network in Biomedical Literature[J]. Journal of the Association for Information Science & Technology, 2018, 69(2): 329-342.
[7]
Gopalakrishnan V, Jha K, Jin W, et al. A Survey on Literature Based Discovery Approaches in Biomedical Domain[J]. Journal of Biomedical Informatics, 2019, 93: 103141.
doi: S1532-0464(19)30059-0
pmid: 30857950
[8]
Blei D M, Ng A Y, Jordan M I, et al. Latent Dirichlet Allocation[J]. Journal of Machine Learning Research, 2003, 3: 993-1022.
(Yue Lixin, Zhou Xiaoying, Chen Yini. Research on Topic Identification of Papers Core Research Subjects and Evolution Path Visualization Method: Taking China’s Journal of Medical and Health Information as an Example[J]. Library and Information Service, 2020, 64(5): 89-99.)
(Gao Huiying, Liu Jiawei, Yang Shuxin. Identifying Topics of Online Healthcare Reviews Based on Improved LDA[J]. Transactions of Beijing Institute of Technology, 2019, 39(4): 427-434.)
[13]
Cannistraci C V, Alanis-Lobato G, Ravasi T. From Link-Prediction in Brain Connectomes and Protein Interactomes to the Local-Community-Paradigm in Complex Networks[J]. Scientific Reports, 2013, 3: 1613.
doi: 10.1038/srep01613
pmid: 23563395
[14]
Shibata N, Kajikawa Y, Sakata I. Link Prediction in Citation Networks[J]. Journal of the American Society for Information Science and Technology, 2012, 63(1): 78-85.
doi: 10.1002/asi.v63.1
[15]
Kossinets G. Effects of Missing Data in Social Networks[J]. Social Networks, 2006, 28(3): 247-268.
doi: 10.1016/j.socnet.2005.07.002
[16]
Wang W, Lv H, Zhao Y, et al. DLS: A Link Prediction Method Based on Network Local Structure for Predicting Drug-Protein Interactions[J]. Frontiers in Bioengineering and Biotechnology, 2020, 8: 330.
doi: 10.3389/fbioe.2020.00330
pmid: 32391341
[17]
Kaya B, Poyraz M. Finding Relations Between Diseases by Age-Series Based Supervised Link Prediction [C]//Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining. 2015: 1097-1103.
(An Ying, Wang Zhina, Chen Xianlai, et al. Prediction of Latent Comorbidity Relationship in Weighted Disease Network[J]. Journal of Hunan University (Natural Sciences), 2019, 46(12): 33-40.)
[19]
Granovetter M S. The Strength of Weak Ties[J]. American Journal of Sociology, 1973, 78(6): 1360-1380.
doi: 10.1086/225469
[20]
Adamic L A, Adar E. Friends and Neighbors on the Web[J]. Social Networks, 2003, 25(3): 211-230.
doi: 10.1016/S0378-8733(03)00009-1
[21]
Zhou T, Lv L, Zhang Y C. Predicting Missing Links via Local Information[J]. The European Physical Journal B, 2009, 71(4): 623-630.
doi: 10.1140/epjb/e2009-00335-8
[22]
Barabási A L, Albert R. Emergence of Scaling in Random Networks[J]. Science, 1999, 286(5439): 509-512.
doi: 10.1126/science.286.5439.509
[23]
Murata T, Moriyasu S. Link Prediction of Social Networks Based on Weighted Proximity Measures [C]//Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence. 2007: 85-88.
(Chen Jiaying, Yu Jiong, Yang Xingyao, et al. Link Prediction Algorithm Based on Node Importance in Complex Networks[J]. Journal of Computer Applications, 2016, 36(12): 3251-3255, 3268.)
[25]
Lorrain F, White H C. Structural Equivalence of Individuals in Social Networks[J]. The Journal of Mathematical Sociology, 1971, 1(1): 49-80.
doi: 10.1080/0022250X.1971.9989788
[26]
Salton G, McGill M J. Introduction to Modern Information Retrieval[M]. Auckland: MuGraw-Hill, 1983.
(Yue Zenghui, Xu Haiyun, Wang Qianfei. Dynamic Link Prediction of Knowledge Diffusion in Disciplinary Citation Networks Based on Local Information[J]. Information Studies: Theory & Application, 2020, 43(2): 84-91, 99.)
[28]
Jaccard P. Etude Comparative de la Distribution Florale dans une Portion des Alpes et des Jura[J]. Bulletin de la Société Vaudoise des Sciences Naturelles, 1901, 37: 547-579.
[29]
Sørensen T. A Method of Establishing Groups of Equal Amplitude in Plant Sociology Based on Similarity of Species and Its Application to Analyses of the Vegetation on Danish Commons[J]. Biologiske Skrifter/Kongelige Danske Videnskabernes Selskab, 1948, 5: 1-34.
[30]
Ravasz E, Somera A L, Mongru D A, et al. Hierarchical Organization of Modularity in Metabolic Networks[J]. Science, 2002, 297(5586): 1551-1555.
pmid: 12202830
[31]
Leicht E A, Holme P Newman M E. Vertex Similarity in Networks[J]. Physical Review E, Statistical, Nonlinear & Soft Matter Physics, 2006, 73(2): Article No. 026120.
[32]
Bai M, Hu K, Tang Y. Link Prediction Based on a Semi-Local Similarity Index[J]. Chinese Physics B, 2011, 20(12): 128902.
doi: 10.1088/1674-1056/20/12/128902
[33]
Hanley J A, McNeil B J. The Meaning and Use of the Area under a Receiver Operating Characteristic (ROC) Curve[J]. Radiology, 1982, 143(1): 29-36.
pmid: 7063747
[34]
Chakraborty D P, Zhai X T. On the Meaning of the Weighted Alternative Free-Response Operating Characteristic Figure of Merit[J]. Medical Physics, 2016, 43(5): 2548-2557.
doi: 10.1118/1.4947125
pmid: 27147365
(The Chinese Medical Association Diabetes Credit Association. Guidelines for the Prevention and Control of Type 2 Diabetes in CHINA (2017 Edition)[J]. Chinese Journal of Practical Internal Medicine, 2018, 38(4): 292-344.)
(Sheng Jinfang, Liu Jiaguang, Wang Bin. Katz Auto Encoder for Urban Road Network Link Prediction Model[J]. Computer Engineering and Applications, 2019, 55(8): 116-123, 131.)
(Yang Jinqing, Wei Yuxuan, Huang Shengzhi, et al. Research Review on Emerging Topic Identification Based on Scientific Literatures[J]. Infornation Science, 2020, 38(8): 159-163, 177.)
(Department of Drug Policy and Essential Medicines System of the National Health Commission. To Consolidate and Improve the Basic Drug System to Meet the Basic Drug Needs of the People[J]. Chinese Health Resources, 2020, 23(6): 525-526, 532.)
[41]
DiMasi J A, Grabowski H G, Hansen R W. Innovation in the Pharmaceutical Industry: New Estimates of R&D Costs[J]. Journal of Health Economics, 2016, 47: 20-33.
doi: 10.1016/j.jhealeco.2016.01.012
(Xu Chunhua, He Zhuojun, Zeng Li, et al. Research Overview of the Pathogenesis of Obesity and Drug Therapy[J]. Chinese Journal of Convalescent Medicine, 2021, 30(2): 131-135.)
(Li Meihua, Zhang Sujuan. Efficacy of Metformin in Metabolic Syndrome Induced by Olanzapine in Schizophrenia Patients[J]. China Journal of Modern Medicine, 2021, 31(2): 82-86.)
[45]
Xu S, Ilyas I, Little P J, et al. Endothelial Dysfunction in Atherosclerotic Cardiovascular Diseases and Beyond: From Mechanism to Pharmacotherapies[J]. Pharmacological Reviews, 2021, 73(3): 924-967.
doi: 10.1124/pharmrev.120.000096
[46]
Li R, Zeng X, Yang M, et al. Antidiabetic Agent DPP-4i Facilitates Murine Breast Cancer Metastasis by Oncogenic ROS-NRF2-HO-1 Axis via a Positive NRF2-HO-1 Feedback Loop[J]. Frontiers in Oncology, 2021, 11: 679816.
doi: 10.3389/fonc.2021.679816
(Fang Hezhi, Shen Lijun, Wu Xucong, et al. Metformin Targets COX6B2 in the Preparation of Drugs for the Treatment of Pancreatic Cancer: China, CN111419831A[P]. 2020-07-17.)