Please wait a minute...
Data Analysis and Knowledge Discovery  2019, Vol. 3 Issue (2): 90-97    DOI: 10.11925/infotech.2096-3467.2018.0617
Current Issue | Archive | Adv Search |
Construction of an Adverse Drug Reaction Extraction Model Based on Bi-LSTM and CRF
Xiaoxiao Zhu,Zunqi Yang,Jing Liu()
Department of Management Information System, Tianjin University of Finance and Economics,Tianjin 300222, China
Download: PDF (608 KB)   HTML ( 10
Export: BibTeX | EndNote (RIS)      
Abstract  

[Objective] To improve the performance of extracting adverse drug reactions from social media, a method is proposed to deal with non-standard texts in social media. [Methods] This method Bi-LSTM-CRF combined LSTM and CRF, and was implemented using TensorFlow. LSTM Could utilize context information, while CRF Could consider the dependence of output tags. An adverse drug reaction extraction model was constructed based on Bi-LSTM-CRF. [Results] A series of experiments were carried out on the Twitter dataset. The experimental results showed that the proposed Bi-LSTM-CRF method achieved the highest F-measure (0.7963) for adverse drug reaction extraction, compared with other methods, including CRF, forward LSTM, backward LSTM, and Bi-LSTM. [Limitations] The experiments were performed on only one corpus, and the validity of the proposed method need be verified on other data sources. [Conclusions] Combining Bi-LSTM and CRF can effectively deal with non-standard texts in social media. The constructed model in this paper can identify adverse drug reactions effectively and support relevant departments in decision-making.

Key wordsSocial Media      Adverse Drug Reactions      CRF      LSTM     
Received: 04 June 2018      Published: 27 March 2019

Cite this article:

Xiaoxiao Zhu,Zunqi Yang,Jing Liu. Construction of an Adverse Drug Reaction Extraction Model Based on Bi-LSTM and CRF. Data Analysis and Knowledge Discovery, 2019, 3(2): 90-97.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.2096-3467.2018.0617     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2019/V3/I2/90

[1] Hammond M.Users of the World, Unite! The Challenges and Opportunities of Social Media[J]. Business Horizons, 2010, 53(1): 59-68.
[2] Andreu P J, Poon C, Merrifield R, et al.Big Data for Health[J]. IEEE Journal of Biomedical & Health Informatics, 2015, 19(4): 1.
[3] Roughead E E, Semple S J.Medication Safety in Acute Care in Australia: Where are We Now? Part 1: A Review of the Extent and Causes of Medication Problems 2002-2008[J]. Australia & New Zealand Health Policy, 2009, 6(1): 1-12.
[4] Liu J, Zhao S, Zhang X.An Ensemble Method for Extracting Adverse Drug Events from Social Media[J]. Artificial Intelligence in Medicine, 2016, 70(9): 62-76.
[5] Yang C C, Yang H, Jiang L, et al.Social Media Mining for Drug Safety Signal Detection[C]// Proceedings of the 2012 International Workshop on Smart Health and Wellbeing. 2012.
[6] Ioannis K, Azadeh N, Matthew S, et al.Analysis of the Effect of Sentiment Analysis on Extracting Adverse Drug Reactions from Tweets and Forum Posts[J]. Journal of Biomedical Informatics, 2016, 62: 148-158.
[7] Azadeh N, Abeed S, Karen O, et al.Pharmacovigilance from Social Media: Mining Adverse Drug Reaction Mentions Using Sequence Labeling with Word Embedding Cluster Features[J]. Journal of the American Medical Informatics Association Jamia, 2015, 22(3): 671-681.
[8] Hochreiter S, Schmidhuber J.Long Short-Term Memory[J]. Neural Computation, 1997, 9(8): 1735-1780.
[9] Lafferty J D, McCallum A, Pereira F C N. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data[C]//Proceedings of the 18th International Conference on Machine Learning. 2001.
[10] Jiang K, Zheng Y.Mining Twitter Data for Potential Drug Effects[A]// Advanced Data Mining and Applications[M]. Berlin, Heidelberg: Springer, 2013: 434-443.
[11] Leaman R, Wojtulewicz L, Sullivan R, et al.Towards Internet-Age Pharmacovigilance: Extracting Adverse Drug Reactions from User Posts to Health-Related Social Networks[C]// Proceedings of the 2010 Workshop on Biomedical Natural Language Processing. 2010: 117-125.
[12] Liu X, Chen H.A Research Framework for Pharmacovigilance in Health Social Media: Identification and Evaluation of Patient Adverse Drug Event Reports[J]. Journal of Biomedical Informatics, 2015, 58: 268-279.
[13] Bian J, Topaloglu U, Yu F.Towards Large-scale Twitter Mining for Drug-related Adverse Events[C]// Proceedings of the 2012 International Workshop on Smart Health and Wellbeing. 2012.
[14] Benton A, Ungar L, Hill S, et al.Identifying Potential Adverse Effects Using the Web: A New Approach to Medical Hypothesis Generation[J]. Journal of Biomedical Informatics, 2011, 44(6): 989-989.
[15] Wu D J H, Man C F, Kwong K, et al. Postmarketing Drug Safety Surveillance[J]. Pharmaceutical Development & Regulation, 2003, 1(4): 231-244.
[16] Freifeld C C, Brownstein J S, Menone C M, et al.Digital Drug Safety Surveillance: Monitoring Pharmaceutical Products in Twitter[J]. Drug Safety, 2014, 37(5): 343-344.
[17] Sampathkumar H, Chen X W, Luo B.Mining Adverse Drug Reactions from Online Healthcare Forums Using Hidden Markov Model[J]. BMC Medical Informatics & Decision Making, 2014, 14(1): 91-92.
[18] Rastegarmojarad M, Liu H, Nambisan P.Using Social Media Data to Identify Potential Candidates for Drug Repurposing: A Feasibility Study[J]. JMIR Research Protocols, 2016, 5(2): e121.
[19] Feldman R, Netzer O, Peretz A, et al.Utilizing Text Mining on Online Medical Forums to Predict Label Change due to Adverse Drug Reactions[C]// Proceedings of the 2015 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 2015: 1779-1788.
[20] Metkejimenez A, Karimi S. Concept Extraction to Identify Adverse Drug Reactions in Medical Forums: A Comparison of Algorithms[OL]. arXiv Preprint, arXiv:1504.06936v1.
[21] Lai S, Liu K, He S, et al.How to Generate a Good Word Embedding[J]. IEEE Intelligent Systems, 2016, 31(6): 5-14.
[22] Dyer C, Ballesteros M, Ling W, et al. Transition-Based Dependency Parsing with Stack Long Short-Term Memory[OL]. arXiv Preprint, arXiv:1505.08075v1.
[23] Chen X, Qiu X, Zhu C, et al.Long Short-Term Memory Neural Networks for Chinese Word Segmentation[C]// Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. 2015: 1197-1206.
[24] Viterbi A.Error Bounds for Convolutional Codes and an Asymptotically Optimum Decoding Algorithm[J]. IEEE Transactions on Informatation Theory, 1967, 13(2): 260-269.
[25] Friggstad Z, Rezapour M, Salavatipour M R.Local Search Yields a PTAS for k-Means in Doubling Metrics[A]// Foundations of Computer Science[M]. IEEE, 2016: 365-374.
[26] Davidian D. Feed-forward Neural Network: USA, US5438646[P].1995-08-01.
[27] Rumelhart D E, Hinton G E, Williams R J.Learning Representations by Back-propagating Errors[J]. Nature, 1986, 323(6088): 399-421.
[28] Graves A, Schmidhuber J.Framewise Phoneme Classification with Bidirectional LSTM and Other Neural Network Architectures[J]. Neural Network, 2005, 18(5): 602-610.
[29] Lin B Y, Xu F, Luo Z, et al.Multi-channel BiLSTM-CRF Model for Emerging Named Entity Recognition in Social Media[C]//Proceedings of the 3rd Workshop on Noisy User-generated Text. 2017: 160-165.
[30] Palangi H, Deng L, Shen Y, et al.Deep Sentence Embedding Using Long Short-Term Memory Networks: Analysis and Application to Information Retrieval[J]. IEEE/ACM Transactions on Audio Speech & Language Processing, 2016, 24(4): 694-707.
[31] Soltau H, Liao H, Sak H. Neural Speech Recognizer: Acoustic-to-Word LSTM Model for Large Vocabulary Speech Recognition[OL]. arXiv Preprint, arXiv:1610.09975v1.
[32] Carlezon W A, Béguin C, Knoll A T, et al.Kappa-Opioid Ligands in the Study and Treatment of Mood Disorders[J]. Pharmacology & Therapeutics, 2009, 123(3): 334-343.
[33] Yang E S, Kim J D, Park C Y, et al.Hyperparameter Tuning for Hidden Unit Conditional Random Fields[J]. Engineering Computations, 2017, 34(6): 2054-2062.
[34] Xu W, Auli M, Clark S.Expected F-Measure Training for Shift-Reduce Parsing with Recurrent Neural Networks[C]// Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2016: 210-220.
[1] Wang Hao, Lin Kerou, Meng Zhen, Li Xinlei. Identifying Multi-Type Entities in Legal Judgments with Text Representation and Feature Generation[J]. 数据分析与知识发现, 2021, 5(7): 10-25.
[2] Yu Xuehan, He Lin, Xu Jian. Extracting Events from Ancient Books Based on RoBERTa-CRF[J]. 数据分析与知识发现, 2021, 5(7): 26-35.
[3] Zhao Danning,Mu Dongmei,Bai Sen. Automatically Extracting Structural Elements of Sci-Tech Literature Abstracts Based on Deep Learning[J]. 数据分析与知识发现, 2021, 5(7): 70-80.
[4] Ma Yingxue,Zhao Jichang. Patterns and Evolution of Public Opinion on Weibo During Natural Disasters: Case Study of Typhoons and Rainstorms[J]. 数据分析与知识发现, 2021, 5(6): 66-79.
[5] Xie Hao,Mao Jin,Li Gang. Sentiment Classification of Image-Text Information with Multi-Layer Semantic Fusion[J]. 数据分析与知识发现, 2021, 5(6): 103-114.
[6] Zhang Guobiao,Li Jie. Detecting Social Media Fake News with Semantic Consistency Between Multi-model Contents[J]. 数据分析与知识发现, 2021, 5(5): 21-29.
[7] Li Feifei,Wu Fan,Wang Zhongqing. Sentiment Analysis with Reviewer Types and Generative Adversarial Network[J]. 数据分析与知识发现, 2021, 5(4): 72-79.
[8] Hu Haotian,Ji Jinfeng,Wang Dongbo,Deng Sanhong. An Integrated Platform for Food Safety Incident Entities Based on Deep Learning[J]. 数据分析与知识发现, 2021, 5(3): 12-24.
[9] Jiang Cuiqing,Wang Xiangxiang,Wang Zhao. Forecasting Car Sales Based on Consumer Attention[J]. 数据分析与知识发现, 2021, 5(1): 128-139.
[10] Liu Qian, Li Chenliang. A Survey of Topic Evolution on Social Media[J]. 数据分析与知识发现, 2020, 4(8): 1-14.
[11] Li Gang, Guan Weidong, Ma Yaxue, Mao Jin. Predicting Social Media Visibility of Scholarly Articles[J]. 数据分析与知识发现, 2020, 4(8): 63-74.
[12] Xue Fuliang,Liu Lifang. Fine-Grained Sentiment Analysis with CRF and ATAE-LSTM[J]. 数据分析与知识发现, 2020, 4(2/3): 207-213.
[13] Ma Jianxia,Yuan Hui,Jiang Xiang. Extracting Name Entities from Ecological Restoration Literature with Bi-LSTM+CRF[J]. 数据分析与知识发现, 2020, 4(2/3): 78-88.
[14] Yan Jinghua,Hou Miaomiao. Predicting Time Series of Theft Crimes Based on LSTM Network[J]. 数据分析与知识发现, 2020, 4(11): 84-91.
[15] Ying Tan,Jin Zhang,Lixin Xia. A Survey of Sentiment Analysis on Social Media[J]. 数据分析与知识发现, 2020, 4(1): 1-11.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn