Please wait a minute...
Data Analysis and Knowledge Discovery
Current Issue | Archive | Adv Search |
Research on Medical Named Entity Recognition with Word Information
Ben Yanyan,Pang Xueqin
(School of Mathematics and Statistics, Huazhong University of Science and Technology, 430074, China) (Archives of Wuhan University of science and technology, 430081, China)
Download:
Export: BibTeX | EndNote (RIS)      
Abstract  

[Objective]Aiming at the difficulty of identifying the boundaries of named entities, the word information is integrated to improve the identification and inference of the identification of key clinical features in online consultation records.

[Methods]The model is constructed based on MacBERT and conditional random fields, and the positional "soft" embedding of word information such as word position and part of speech is carried out, and the dialogue text information is introduced by the speaker role embedding. At the same time, weighted multi-class cross-entropy is introduced to solve the problem of entity category imbalance.

[Results]An empirical study was carried out on the online consultation records of Chunyu Doctor, and the F1 value of the proposed model in the named entity recognition task was 74.35%, an increase of nearly 2%.

[Limitations]No model is designed specifically for Chinese word segmentation.

[Conclusions]Compared with directly using the MacBERT model for modeling, incorporating more dimensional features such as word information can effectively improve the model's ability to recognize key features of clinical findings.


Key words Chinese entity recognition      Online medical question answering      Word information embedding      MacBERT      Weighted cross entropy      
Published: 29 July 2022
ZTFLH:  TP393,G250  

Cite this article:

Ben Yanyan, Pang Xueqin. Research on Medical Named Entity Recognition with Word Information . Data Analysis and Knowledge Discovery, 0, (): 1-.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.2096-3467.2022-0547     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y0/V/I/1

[1] Ben Yanyan, Pang Xueqin. Identifying Medical Named Entities with Word Information[J]. 数据分析与知识发现, 2023, 7(5): 123-132.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn