Automatic Question-Answering in Chinese Medical Q & A Community with Knowledge Graph
Wang Yinqiu1,2,Yu Wei1,2(),Chen Junpeng3
1Jiangsu Key Laboratory of Data Engineering and Knowledge Service, Nanjing 210023, China 2School of Information Management, Nanjing University, Nanjing 210023, China 3College of Information Engineering, Nanjing University of Finance & Economics, Nanjing 210023, China
[Objective] This paper proposes a new method to determine the reliability of answers from the online Chinese medical question and answer (Q&A) community, aiming to enhance the accuracy of answer selection models for medical Q&A recognition with the help of professional medical knowledge graphs. [Methods] Based on the answer selection model using a hybrid neural network (fusing RNN and multi-scale CNN to capture context and local information), we constructed a professional medical knowledge graph that integrated entity and relationship embeddings to enrich the semantic information of the Q&A text. Combined with the Q&A pair attention mechanism, we obtained the final similarity of the pairs and selected candidate answers with the highest scores. [Results] We examined the proposed model on the cMedQA2.0 dataset. Compared to the hybrid neural network model without incorporating knowledge graph entity relationship, the Top-1 accuracy of the answer selection in our new model increased by 2.3% (to 62.2%), demonstrating its effectiveness for improving answer selection. [Limitations] The medical knowledge graph used is of small size, only including the common entities in the medical community Q&A. The incomplete relationship between medical entities may affect the answer selection effectiveness when facing niche questions. [Conclusions] Combining professional Chinese medical knowledge graphs and deep learning models could improve the answer selection technology. It helps people with medical consultation needs obtain reliable medical advice in the Q & A community. Our model also monitors the online medical community’s information quality and reduces the burden of hospital outpatient service.
王寅秋, 虞为, 陈俊鹏. 融合知识图谱的中文医疗问答社区自动问答研究*[J]. 数据分析与知识发现, 2023, 7(3): 97-109.
Wang Yinqiu, Yu Wei, Chen Junpeng. Automatic Question-Answering in Chinese Medical Q & A Community with Knowledge Graph. Data Analysis and Knowledge Discovery, 2023, 7(3): 97-109.
( Li Ming, Li Ying, Zhou Qing, et al. Analyzing Knowledge Demand and Supply of Community Question Answering with TF-PIDF[J]. Data Analysis and Knowledge Discovery, 2021, 5(2):106-115.)
( Shi Jing, Li Chenlu, Qian Yuxing, et al. Information Needs of Domestic and International HCQA Users——An Empirical Analysis[J]. Data Analysis and Knowledge Discovery, 2019, 3(5):1-10.)
[4]
Kalchbrenner N, Grefenstette E, Blunsom P. A Convolutional Neural Network for Modelling Sentences[C]// Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1:Long Papers). 2014: 655-665.
[5]
Tan M, dos Santos C, Xiang B, et al. LSTM-Based Deep Learning Models for Non-Factoid Answer Selection[OL]. arXiv Preprint, arXiv:1511.04108.
[6]
Zhang S, Zhang X, Wang H, et al. Multi-scale Attentive Interaction Networks for Chinese Medical Question Answer Selection[J]. IEEE Access, 2018, 6:74061-74071.
doi: 10.1109/Access.6287639
[7]
Deng Y, Xie Y X, Li Y L, et al. Multi-task Learning with Multi-View Attention for Answer Selection and Knowledge Base Question Answering[C]// Proceedings of the AAAI Conference on Artificial Intelligence. 2019:6318-6325.
[8]
Bilotti M W, Ogilvie P, Callan J, et al. Structured Retrieval for Question Answering[C]// Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 2007:351-358.
[9]
Shen D, Lapata M. Using Semantic Roles to Improve Question Answering[C]// Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning. 2007:12-21.
[10]
Heilman M, Smith N A. Tree Edit Models for Recognizing Textual Entailments, Paraphrases, and Answers to Questions[C]// Proceedings of the 2010 Annual Conference of the North American Chapter of the Association of Computational Linguistics. 2010:1011-1019.
[11]
Wang M Q, Smith N A, Mitamura T. What is the Jeopardy Model? A Quasi-Synchronous Grammar for QA[C]// Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning. 2007:22-32.
[12]
Lai A, Hockenmaier J. Illinois-LH: A Denotational andDistributional Approach to Semantics[C]// Proceedings of the 8th International Workshop on Semantic Evaluation. 2014: 329-334.
[13]
Yao X C, van Durme B, Callison-Burch C, et al. Semi-Markov Phrase-Based Monolingual Alignment[C]// Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing. 2013:590-600.
[14]
Feng M W, Xiang B, Glass M R, et al. Applying Deep Learning to Answer Selection: A Study and an Open Task[C]// Proceedings of 2015 IEEE Workshop on Automatic Speech Recognition and Understanding. 2015:813-820.
[15]
Qiu X P, Huang X J. Convolutional Neural Tensor Network Architecture for Community-Based Question Answering[C]// Proceedings of the 24th International Conference on Artificial Intelligence. 2015:1305-1311.
[16]
Hochreiter S, Schmidhuber J. Long Short-Term Memory[J]. Neural Computation, 1997, 9(8):1735-1780.
doi: 10.1162/neco.1997.9.8.1735
pmid: 9377276
[17]
Zhang T X, Ren Y Q, Tadessem M M, et al. Bi-directional CapsuleNetwork Model for Chinese Biomedical Community Question Answering[C]// Proceedings of the 8th CCF International Conference on Natural Language Processing and Chinese Computing. 2019:105-116.
[18]
Xiang Y, Chen Q C, Wang X L, et al. Answer Selection in Community Question Answering via Attentive Neural Networks[J]. IEEE Signal Processing Letters, 2017, 24(4):505-509.
doi: 10.1109/LSP.2017.2673123
[19]
Song Y, Hu Q V, He L. P-CNN: Enhancing Text Matching with Positional Convolutional Neural Network[J]. Knowledge-Based Systems, 2019, 169(C):67-79.
doi: 10.1016/j.knosys.2019.01.028
[20]
Chen X C, Yang Z Y, Liang N Y, et al. Co-Attention Fusion Based Deep Neural Network for Chinese Medical Answer Selection[J]. Applied Intelligence, 2021, 51(10):6633-6646.
doi: 10.1007/s10489-021-02212-w
[21]
Deng Y, Xie Y X, Li Y L, et al. Contextualized Knowledge-Aware Attentive Neural Network: Enhancing Answer Selection with Knowledge[J]. ACM Transactions on Information Systems, 2022, 40(1):Article No.2.
( Li He, Liu Jiayu, Li Shiyu, et al. Optimizing Automatic Question Answering System Based on Disease Knowledge Graph[J]. Data Analysis and Knowledge Discovery, 2021, 5(5):115-126.)
( Hu Zhengyin, Liu Leilei, Dai Bing, et al. Discovering Subject Knowledge in Life and Medical Sciences with Knowledge Graph[J]. Data Analysis and Knowledge Discovery, 2020, 4(11):1-14.)