An Integrated Platform for Food Safety Incident Entities Based on Deep Learning
Hu Haotian1,2,Ji Jinfeng3,Wang Dongbo3,4(),Deng Sanhong1,2
1School of Information Management, Nanjing University, Nanjing 210023, China 2Jiangsu Key Laboratory of Data Engineering and Knowledge Service, Nanjing 210023, China 3School of Information Management, Nanjing Agricultural University, Nanjing 210095, China 4Research Center for Correlation of Domain Knowledge, Nanjing Agricultural University,Nanjing 210095, China
[Objective] This paper tries to promote the national administration of food safety, and strengthen the prediction, warning and response of related emergencies. It not only facilitates research, but also informs the public on food safety issues concisely and intuitively. [Methods] We collected news reports on food safety incidents from leading websites and constructed a corpus for the food safety incident entities through data cleansing, annotation, and organization. Then, we compared performance of Bi-LSTM, Bi-LSTM-CRF, IDCNN, IDCNN-CRF and BERT models on entity recognition. [Results] In the 10-fold cross validation, the highest F-score of the BERT model reached 81.39%, while its average F-score was 5.50% and 2.58% higher than those of IDCNN-CRF and Bi-LSTM-CRF models respectively. We built the integrated presentation platform for food safety incident entities based on the Bi-LSTM-CRF model. [Limitations] More research is needed to identify location entities from complex administrative regions. [Conclusions] The constructed platform supports policy formulation and food industry administration.
胡昊天,吉晋锋,王东波,邓三鸿. 基于深度学习的食品安全事件实体一体化呈现平台构建*[J]. 数据分析与知识发现, 2021, 5(3): 12-24.
Hu Haotian,Ji Jinfeng,Wang Dongbo,Deng Sanhong. An Integrated Platform for Food Safety Incident Entities Based on Deep Learning. Data Analysis and Knowledge Discovery, 2021, 5(3): 12-24.
( Huang Shuiqing, Wang Dongbo, He Lin. Research on Constructing Automatic Recognition Model for Ancient Chinese Place Names Based on Pre-Qi Corpus[J]. Library and Information Service, 2015,59(12):135-140.)
( Jiang Meihui, An Haizhong, Gao Xiangyun, et al. The Visualization and Analysis of News Texts About Food Safety Incidents Based on Complex Networks[J]. Journal of Intelligence, 2015,34(12):121-127.)
( Wang Dongbo, Wu Yi, Ye Wenhao, et al. Extracting Events of Food Safety Emergencies with Characteristics Knowledge[J]. Data Analysis and Knowledge Discovery, 2017,1(3):54-61.)
( Xiang Xiaowen, Shi Xiaodong, Zeng Hualin. Chinese Named Entity Recognition System Using Statistics-Based and Rules-Based Method[J]. Journal of Computer Applications, 2005,25(10):2404-2406.)
[6]
赵军. 命名实体识别、排歧和跨语言关联[J]. 中文信息学报, 2009,23(2):3-17.
[6]
( Zhao Jun. A Survey on Named Entity Recognition, Disambiguation and Cross-Lingual Co-reference Resolution[J]. Journal of Chinese Information Processing, 2009,23(2):3-17.)
( Zhang Jian, Wu Qing, Yang Xinyi, et al. Chinese Agricultural Named Entity Recognition Based on Conditional Random Fields[J]. Computer and Modernization, 2018(1):123-126.)
( Qiao Wei, Sun Maosong. Joint Chinese Word Segmentation and Named Entity Recognition Based on Max-Margin Markov Networks[J]. Journal of Tsinghua University (Science and Technology), 2010,50(5):758-762, 767.)
( Huang Shilin, Zheng Xiaolin, Chen Deren. A Semi-Supervised Learning Method for Product Named Entity Recognition[J]. Journal of Beijing University of Posts and Telecommunications, 2013,36(2):20-23, 54.)
[10]
王国昱. 基于深度学习的中文命名实体识别研究[D]. 北京: 北京工业大学, 2015.
[10]
( Wang Guoyu. Research of Chinese Named Entity Recognition Based on Deep Learning[D]. Beijing: Beijing University of Technology, 2015.)
( Xu Chenfei, Ye Haiying, Bao Ping. Automatic Recognition of Produce Entities from Local Chronicles with Deep Learning[J]. Data Analysis and Knowledge Discovery, 2020,4(8):86-97.)
( Shen Si, Zhu Danhao. Chinese Place Name Recognition Based on Deep Learning[J]. Transactions of Beijing Institute of Technology, 2017,37(11):1150-1155.)
[14]
Lafferty J D, McCallum A, Pereira F C N. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data[C]// Proceedings of the 18th International Conference on Machine Learning. 2001: 282-289.
[15]
Pham T H, Phuong L H . End-to-End Recurrent Neural Network Models for Vietnamese Named Entity Recognition: Word-level vs. Character-level[C]// Proceedings of the 15th International Conference of the Pacific Association for Computational Linguistics. Springer, 2017: 534-542.
[16]
Lample G, Ballesteros M, Subramanian S, et al. Neural Architectures for Named Entity Recognition[C]// Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies.
[17]
Strubell E, Verga P, Belanger D, et al. Fast and Accurate Entity Recognition with Iterated Dilated Convolutions[OL]. arXiv Preprint, arXiv: 1702.02098.
[18]
Collobert R, Weston J, Bottou L, et al. Natural Language Processing (Almost) from Scratch[J]. Journal of Machine Learning Research, 2011,12(1):2493-2537.
[19]
Devlin J, Chang M W, Lee K, et al. Bert: Pre-training of Deep Bidirectional Transformers for Language Understanding[C]// Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019.
[20]
Vaswani A, Shazeer N, Parmar N, et al. Attention is All You Need[C]// Proceedings of the 31st International Conference on Neural Information Processing Systems. 2017: 5998-6008.