|
|
Recognizing Chinese Medical Literature Entities Based on Multi-Task and Transfer Learning |
Han Pu1,2(),Gu Liang1,Ye Dongyu1,Chen Wenqi1 |
1School of Management, Nanjing University of Posts & Telecommunications, Nanjing 210003, China 2Jiangsu Provincial Key Laboratory of Data Engineering and Knowledge Service, Nanjing 210023, China |
|
|
Abstract [Objective] This paper uses transfer learning and multi-task learning to solve the problems of cold start and boundary in Chinese medical literature entity recognition, and further improve the recognition accuracy. [Methods] Firstly, we constructed a hybrid deep learning BERT-BiLSTM-IDCNN-CRF medical literature entity recognition model. Secondly, based on transfer learning, the medical semantic features were enriched through instance, model and feature transfer. Thirdly, we constructed a coarse-grained three-classification task through multi-task learning to assist the main task in utilizing the entity boundary information effectively. Finally, we introduced the self-attention mechanism and highway network to capture global information, optimize deep network training and establish the TLMT-BBIC-HS model. [Results] The model had an F1 value of 92.98% on the Chinese diabetes medical literature dataset, which is 15.99% and 16.44% higher than the benchmark models BERT-BiLSTM-CRF and BERT-IDCNN-CRF. [Limitations] The domain suitability of this model needs to be verified. [Conclusions] The TLMT-BBIC-HS model can transfer and share medical knowledge, which is more suitable for Chinese medical Literature entity recognition. It could effectively extract medical information and construct knowledge graphs and question answering systems.
|
Received: 03 August 2022
Published: 21 March 2023
|
|
Fund:The National Social Science Fund of China(22BTQ096) |
Corresponding Authors:
Han Pu, ORCID:0000-0001-5867-4292,E-mail: hanpu@njupt.edu.cn。
|
[1] |
赵旸, 张智雄, 刘欢, 等. 基于BERT模型的中文医学文献分类研究[J]. 数据分析与知识发现, 2020, 4(8): 41-49.
|
[1] |
(Zhao Yang, Zhang Zhixiong, Liu Huan, et al. Classification of Chinese Medical Literature with BERT Model[J]. Data Analysis and Knowledge Discovery, 2020, 4(8): 41-49.)
|
[2] |
李跃艳, 王昊, 邓三鸿, 等. 面向事件本体的医学文本语义关联化研究[J]. 情报学报, 2022, 41(5): 497-511.
|
[2] |
(Li Yueyan, Wang Hao, Deng Sanhong, et al. Research on Semantic Relevance of Medical Text Oriented to Event Ontology[J]. Journal of the China Society for Scientific and Technical Information, 2022, 41(5): 497-511.)
|
[3] |
Coden A, Savova G, Sominsky I, et al. Automatically Extracting Cancer Disease Characteristics from Pathology Reports into a Disease Knowledge Representation Model[J]. Journal of Biomedical Informatics, 2009, 42(5): 937-949.
doi: 10.1016/j.jbi.2008.12.005
pmid: 19135551
|
[4] |
Jiang M, Chen Y K, Liu M, et al. A Study of Machine-Learning-Based Approaches to Extract Clinical Entities and Their Assertions from Discharge Summaries[J]. Journal of the American Medical Informatics Association, 2011, 18(5): 601-606.
doi: 10.1136/amiajnl-2011-000163
pmid: 21508414
|
[5] |
Liu Z J, Yang M, Wang X L, et al. Entity Recognition from Clinical Texts via Recurrent Neural Network[J]. BMC Medical Informatics and Decision Making, 2017, 17(Suppl 2): Article No.67.
|
[6] |
Gajendran S, Manjula D, Sugumaran V. Character Level and Word Level Embedding with Bidirectional LSTM—Dynamic Recurrent Neural Network for Biomedical Named Entity Recognition from Literature[J]. Journal of Biomedical Informatics, 2020, 112: Article No.103609.
|
[7] |
Li X Y, Zhang H, Zhou X H. Chinese Clinical Named Entity Recognition with Variant Neural Structures Based on BERT Methods[J]. Journal of Biomedical Informatics, 2020, 107: Article No.103422.
|
[8] |
吕江海, 杜军平, 周南, 等. 基于膨胀卷积迭代与注意力机制的实体名识别方法[J]. 计算机工程, 2021, 47(1): 58-65.
doi: 10.19678/j.issn.1000-3428.0055986
|
[8] |
(Lü Jianghai, Du Junping, Zhou Nan, et al. Entity Name Recognition Method Based on Dilated Convolutional Iterative and Attention Mechanism[J]. Computer Engineering, 2021, 47(1): 58-65.)
doi: 10.19678/j.issn.1000-3428.0055986
|
[9] |
Giorgi J M, Bader G D. Transfer Learning for Biomedical Named Entity Recognition with Neural Networks[J]. Bioinformatics, 2018, 34(23): 4087-4094.
doi: 10.1093/bioinformatics/bty449
pmid: 29868832
|
[10] |
Smetanin S, Komarov M. Deep Transfer Learning Baselines for Sentiment Analysis in Russian[J]. Information Processing & Management, 2021, 58(3): Article No.102484.
|
[11] |
Fu W L, Xue B, Gao X Y, et al. Transductive Transfer Learning Based Genetic Programming for Balanced and Unbalanced Document Classification Using Different Types of Features[J]. Applied Soft Computing, 2021, 103: Article No.107172.
|
[12] |
Mignone P, Pio G, D’Elia D, et al. Exploiting Transfer Learning for the Reconstruction of the Human Gene Regulatory Network[J]. Bioinformatics, 2020, 36(5): 1553-1561.
doi: 10.1093/bioinformatics/btz781
pmid: 31608946
|
[13] |
熊欣, 王昊, 邓三鸿. 面向方志知识图谱的术语抽取模型迁移学习研究[J]. 情报理论与实践, 2021, 44(4): 176-184.
|
[13] |
(Xiong Xin, Wang Hao, Deng Sanhong. A Study on Term Extraction Model with Transfer Learning for Knowledge Graph of Local Chronicles[J]. Information Studies: Theory & Application, 2021, 44(4): 176-184.)
|
[14] |
韩普, 张展鹏, 张伟. 基于多任务学习和多态语义特征的中文疾病名称归一化研究[J]. 情报学报, 2021, 40(11): 1234-1244.
|
[14] |
(Han Pu, Zhang Zhanpeng, Zhang Wei. Chinese Disease Name Normalization Based on Multi-Task Learning and Polymorphic Semantic Features[J]. Journal of the China Society for Scientific and Technical Information, 2021, 40(11): 1234-1244.)
|
[15] |
Crichton G, Pyysalo S, Chiu B, et al. A Neural Network Multi-Task Learning Approach to Biomedical Named Entity Recognition[J]. BMC Bioinformatics, 2017, 18(1): 1-14.
|
[16] |
Wu C C, Luo G, Guo C, et al. An Attention-Based Multi-Task Model for Named Entity Recognition and Intent Analysis of Chinese Online Medical Questions[J]. Journal of Biomedical Informatics, 2020, 108: Article No.103511.
|
[17] |
Aguilar G, Maharjan S, López-Monroy A P, et al. A Multi-Task Approach for Named Entity Recognition in Social Media Data[OL]. arXiv Preprint, arXiv: 1906.04135.
|
[18] |
Wang D S, Fan H J, Liu J F. Learning with Joint Cross-Document Information via Multi-Task Learning for Named Entity Recognition[J]. Information Sciences, 2021, 579: 454-467.
doi: 10.1016/j.ins.2021.08.015
|
[19] |
Srivastava R K, Greff K, Schmidhuber J. Highway Networks[OL]. arXiv Preprint, arXiv: 1505.00387.
|
[20] |
Zuo M, Zhang Y. Dataset-Aware Multi-Task Learning Approaches for Biomedical Named Entity Recognition[J]. Bioinformatics, 2020, 36(15): 4331-4338.
doi: 10.1093/bioinformatics/btaa515
pmid: 32415963
|
[21] |
Liu L Y, Shang J B, Ren X A, et al. Empower Sequence Labeling with Task-Aware Neural Language Model[C]// Proceedings of the 32nd AAAI Conference on Artificial Intelligence. 2018, 32(1): 5253-5260.
|
[22] |
Narayanan S, Achan P, Rangan P V, et al. Unified Concept and Assertion Detection Using Contextual Multi-Task Learning in a Clinical Decision Support System[J]. Journal of Biomedical Informatics, 2021, 122: Article No.103898.
|
[23] |
王东波, 刘畅, 朱子赫, 等. SikuBERT与SikuRoBERTa:面向数字人文的《四库全书》预训练模型构建及应用研究[J]. 图书馆论坛, 2022, 42(6): 31-43.
|
[23] |
(Wang Dongbo, Liu Chang, Zhu Zihe, et al. Construction and Application of Pre-Trained Models of Siku Quanshu in Orientation to Digital Humanities[J]. Library Tribune, 2022, 42(6): 31-43.)
|
[24] |
Aliyun. A Labeled Chinese Dataset for Diabetes[EB/OL]. [2022-06-28]. https://tianchi.aliyun.com/competition/entrance/231687/information.
|
[25] |
Aya Mohamed Abdelaty Elkased. 面向生物医学文献的基于BioBERT的药品相互作用抽取增强模型[D]. 哈尔滨: 哈尔滨工业大学, 2021.
|
[25] |
(Aya Mohamed Abdelaty Elkased. Enhanced Drug-Drug Interaction Extraction Model from Biomedical Text Using BioBERT[D]. Harbin: Harbin Institute of Technology, 2021.)
|
[26] |
何春辉, 王梦贤, 何小波. 基于双层Bi-LSTM-CRF模型的糖尿病领域命名实体识别[J]. 邵阳学院学报(自然科学版), 2020, 17(1): 21-26.
|
[26] |
He Chunhui, Wang Mengxian, He Xiaobo. Named Entity Recognition in the Field of Diabetes Based on Double-layer Bi-LSTM-CRF Model[J]. Journal of Shaoyang University (Natural Science Edition), 2020, 17(1): 21-26.)
|
[27] |
Shang F J, Ran C F. An Entity Recognition Model Based on Deep Learning Fusion of Text Feature[J]. Information Processing & Management, 2022, 59(2): Article No.102841.
|
[28] |
Deng J F, Cheng L L, Wang Z W. Self-Attention-Based BiGRU and Capsule Network for Named Entity Recognition[OL]. arXiv Preprint, arXiv: 2002.00735.
|
[29] |
Wang Y, Sun Y N, Ma Z C, et al. Named Entity Recognition in Chinese Medical Literature Using Pretraining Models[J]. Scientific Programming, 2020, 2020: Article No.8812754.
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|