Please wait a minute...
Data Analysis and Knowledge Discovery  2023, Vol. 7 Issue (12): 64-74    DOI: 10.11925/infotech.2096-3467.2022.0549
Current Issue | Archive | Adv Search |
Event Detection Model Based on Semantic Information Fusion
Wei Jianxiang1(),Lu Qian2,Han Pu1,Huang Weidong1,3
1School of Management, Nanjing University of Posts and Telecommunications, Nanjing 210003, China
2School of Internet of Things, Nanjing University of Posts and Telecommunications, Nanjing 210003, China
3Key Research Base of Philosophy and Social Sciences in Jiangsu-Information Industry Integration Innovation and Emergency Management Research Center, Nanjing 210003, China
Download: PDF (1702 KB)   HTML ( 11
Export: BibTeX | EndNote (RIS)      
Abstract  

[Objective] This paper aims to improve the accuracy of event detection tasks by fusing semantic information. [Methods] First, we stored the non-relational semantic information with an initial word vector and encoded them with the Bi-LSTM model to aggregate their contexts. Then, we developed a relation graph based on relational semantic information. Third, we used a multi-scale convolutional neural network to capture the spatial information from the adjacency matrix and fuse it with the word vector. Finally, we built a Gate-GCN model to aggregate relational semantic information between adjacent word vectors to enhance their representation ability. [Results] We examined the new model with the ACE05 benchmark dataset. Our method’s F1 value reached 76.3%, which was 1.2% higher than the existing mainstream models. [Limitations] The proposed model needs to be validated with general datasets. [Conclusions] Fusion of multiple types of semantic information can effectively improve the event detection performance.

Key wordsEvent Detection      Information Extraction      Multi-Semantic Fusion      Gated Linear Unit      Graph Convolutional Neural Network     
Received: 30 May 2022      Published: 30 March 2023
ZTFLH:  TP391  
  G35  
Fund:National Social Science Fund of China(17CTQ022);National Natural Science Foundation of China(7227011403);Major Project of Philosophy and Social Science Research of Jiangsu Universities(2020SJZDA102)
Corresponding Authors: Wei Jianxiang,ORCID:0000-0001-9052-9212,E-mail:jxwei@njupt.edu.cn。   

Cite this article:

Wei Jianxiang, Lu Qian, Han Pu, Huang Weidong. Event Detection Model Based on Semantic Information Fusion. Data Analysis and Knowledge Discovery, 2023, 7(12): 64-74.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.2096-3467.2022.0549     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2023/V7/I12/64

S1:North Korea’s military may have fired a loser at a U.S helicopter.
Words
North Korea’s military
U.S
Entity types
PER:Group
GPE:Nation
Words
North Korea’s military
Loser
helicopter
Event arguments
Attacker
Instrument
Target
Trigger:fired
Event type:Attack
Event Detection Example Sentences Containing Multiple Types of Semantic Information
Event Detection Model Based on Multi-Semantic Information Fusion
Relationship Graph and Adjacency Matrix
超参数名 参数值 超参数名 参数值
初始词向量维度 100 批量大小 30
事件要素向量维度 100 最大句子长度 50
实体类型向量维度 25 Bi-LSTM层数 1
关系类型向量维度 50 GCN层数 2
Bi-LSTM隐藏层维度 100 学习率 0.1
GCN隐藏层维度 200 偏置项参数α 5
Hyperparameter Settings of the Model
网络层 FLOPs Params
CNN 9.02MMac 22.55k
Bi-LSTM 6.07MMac 301.6k
Gated-GCN 11.6MMac 52.75k
Model Complexity Analysis
模型 P/% R/% F1/%
Cross-entity 72.9 64.3 68.3
MaxEntropy 74.5 59.1 65.9
PSL 75.3 64.4 69.4
DMCNN 75.6 63.6 69.1
JRNN 66.0 73.0 69.3
ANN 78.0 66.3 71.7
GCN-ED 77.9 68.8 73.1
JMEE 76.3 71.3 73.7
SELF 71.3 74.7 73.0
BGCN
AD-DMBERT
77.5
77.9
72.4
72.5
74.2
75.1
本文 76.0 76.7 76.3
Comparison of Model Performance
特征组合 P/% R/% F1/%
词向量表示: [ w ]
关系表示: [ d e p _ r ]
69.7 70.6 70.1
词向量表示: [ w ; ? s ]
关系表示: [ d e p _ r ]
71.1 74.1 72.5
词向量表示: [ w ; ? s ; ? a ]
关系表示: [ d e p _ r ]
76.6 73.5 75.0
Performance Comparison of Different Feature Combinations
模型 P/% R/% F1/%
Bi-LSTM+GCN 76.6 73.5 75.0
CNN+Bi-LSTM+GCN 77.0 74.3 75.6
Model Performance for Fusing Spatial Information
模型 P/% R/% F1/%
CNN+Bi-LSTM+GCN 77.0 74.3 75.6
CNN+Bi-LSTM+Gated-GCN 76.0 76.7 76.3
Model Performance by Using Gated-GCN
Confusion Matrix Based on 8 Categories
Confusion Matrix Based on 34 Subclass Classification (Partial)
Confusion Matrix of Test Results of Benchmark Model
Confusion Matrix of the Test Results of the Proposed Model
[1] Doddington G, Mitchell A, Przybocki M. The Automatic Content Extraction (ACE) Program—Tasks, Data, and Evaluation[C]// Proceedings of the 4th International Conference on Language Resources and Evaluation. 2004: 837-840.
[2] Li L, Jin L, Zhang Z Q, et al. Graph Convolution over Multiple Latent Context-Aware Graph Structures for Event Detection[J]. IEEE Access, 2020, 8: 171435-171446.
doi: 10.1109/Access.6287639
[3] Dutta S, Ma L, Saha T K, et al. GTN-ED: Event Detection Using Graph Transformer Networks[C]// Proceedings of the 15th Workshop on Graph-Based Methods for Natural Language Processing. 2021: 132-137.
[4] 万齐智, 万常选, 胡蓉, 等. 基于句法语义依存分析的中文金融事件抽取[J]. 计算机学报, 2021, 44 (3): 508-530.
[4] (Wan Qizhi, Wan Changxuan, Hu Rong, et al. Chinese Financial Event Extraction Base on Syntactic and Semantic Dependency Parsing[J]. Chinese Journal of Computers, 2021, 44 (3): 508-530.)
[5] 陈佳丽, 洪宇, 王捷, 等. 利用门控机制融合依存与语义信息的事件检测方法[J]. 中文信息学报, 2020, 34(8): 51-60.
[5] (Chen Jiali, Hong Yu, Wang Jie, et al. Combination of Dependency and Semantic Information via Gated Mechanism for Event Detection[J]. Journal of Chinese Information Processing, 2020, 34(8): 51-60.)
[6] Cui Z C, Chen W L, Chen Y X. Multi-Scale Convolutional Neural Networks for Time Series Classification[OL]. arXiv Preprint, arXiv: 1603.06995.
[7] Kipf T N, Welling M. Semi-Supervised Classification with Graph Convolutional Networks[OL]. arXiv Preprint, arXiv: 1609.02907.
[8] Dauphin Y N, Fan A, Auli M, et al. Language Modeling with Gated Convolutional Networks[C]// Proceedings of the 34th International Conference on Machine Learning - Volume 70. 2017: 933-941.
[9] Grishman R, Westbrook D, Meyers A. NYU’s English ACE 2005 System Description[C]// Proceedings of ACE 2005 Evaluation Workshop. 2005.
[10] Ji H, Grishman R. Refining Event Extraction Through Cross-Document Inference[C]// Proceedings of ACL-08:HLT. 2008: 254-262.
[11] Liao S S, Grishman R. Using Document Level Cross-Event Inference to Improve Event Extraction[C]// Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics. 2010: 789-797.
[12] Hong Y, Zhang J F, Ma B, et al. Using Cross-Entity Inference to Improve Event Extraction[C]// Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1. 2011: 1127-1136.
[13] Nguyen T H, Grishman R. Event Detection and Domain Adaptation with Convolutional Neural Networks[C]// Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2:Short Papers). 2015: 365-371.
[14] Chen Y B, Xu L H, Liu K, et al. Event Extraction via Dynamic Multi-Pooling Convolutional Neural Networks[C]// Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1:Long Papers). 2015: 167-176.
[15] Nguyen T H, Cho K, Grishman R. Joint Event Extraction via Recurrent Neural Networks[C]// Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies. 2016: 300-309.
[16] Feng X C, Qin B, Liu T. A Language-Independent Neural Network for Event Detection[J]. Science China Information Sciences, 2018, 61(9): 092106.
doi: 10.1007/s11432-017-9359-x
[17] Sha L, Qian F, Chang B B, et al. Jointly Extracting Event Triggers and Arguments by Dependency-Bridge RNN and Tensor-Based Argument Interaction[C]// Proceedings of the 32nd AAAI Conference on Artificial Intelligence and the 30th Innovative Applications of Artificial Intelligence Conference and the 8th AAAI Symposium on Educational Advances in Artificial Intelligence. 2018: 5916-5923.
[18] Orr J W, Tadepalli P, Fern X. Event Detection with Neural Networks: A Rigorous Empirical Evaluation[OL]. arXiv Preprint, arXiv:1808.08504.
[19] Nguyen T H, Grishman R. Graph Convolutional Networks with Argument-Aware Pooling for Event Detection[C]// Proceedings of the 32nd AAAI Conference on Artificial Intelligence and the 30th Innovative Applications of Artificial Intelligence Conference and the 8th AAAI Symposium on Educational Advances in Artificial Intelligence. 2018: 5900-5907.
[20] Liu X, Luo Z C, Huang H Y. Jointly Multiple Events Extraction via Attention-Based Graph Information Aggregation [OL]. arXiv Preprint, arXiv:1809.09078.
[21] Yan H R, Jin X L, Meng X B, et al. Event Detection with Multi-Order Graph Convolution and Aggregated Attention[C]// Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing. 2019: 5766-5770.
[22] Cui S Y, Yu B W, Liu T W, et al. Edge-Enhanced Graph Convolution Networks for Event Detection with Syntactic Relation[OL]. arXiv Preprint, arXiv: 2020.10757.
[23] Devlin J, Chang M W, Lee K, et al. BERT: Pre-Training of Deep Bidirectional Transformers for Language Understanding[OL]. arXiv Preprint, arXiv: 1810.04805.
[24] Wang X Z, Wang Z Q, Han X, et al. HMEAE: Hierarchical Modular Event Argument Extraction[C]// Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing. 2019: 5777-5783.
[25] Walker C, Strassel S, Medero J, et al. ACE 2005 Multilingual Training Corpus[OL]. Philadelphia: Linguistic Data Consortium, 2006. https://doi.org/10.35111/mwxc-vh88.
[26] LeCun Y, Bottou L, Bengio Y, et al. Gradient-Based Learning Applied to Document Recognition[J]. Proceedings of the IEEE, 1998, 86(11): 2278-2324.
doi: 10.1109/5.726791
[27] Mikolov T, Chen K, Corrado G, et al. Efficient Estimation of Word Representations in Vector Space[OL]. arXiv Preprint, arXiv:1301.3781.
[28] Zhang S, Zheng D Q, Hu X C, et al. Bidirectional Long Short-Term Memory Networks for Relation Classification[C]// Proceedings of the 29th Pacific Asia Conference on Language, Information and Computation. 2015: 73-78.
[29] Chen Y B, Yang H, Liu K, et al. Collective Event Detection via a Hierarchical and Bias Tagging Networks with Gated Multi-Level Attention Mechanisms[C]// Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. 2018: 1267-1276.
[30] Li Q, Ji H, Huang L. Joint Event Extraction via Structured Prediction with Global Features[C]// Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1:Long Papers). 2013: 73-82.
[31] Liu S L, Liu K, He S Z, et al. A Probabilistic Soft Logic Based Approach to Exploiting Latent and Global Information in Event Classification[C]// Proceedings of the 30th AAAI Conference on Artificial Intelligence. 2016: 2993-2999.
[32] Liu S L, Chen Y B, Liu K, et al. Exploiting Argument Information to Improve Event Detection via Supervised Attention Mechanisms[C]// Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1:Long Papers). 2017: 1789-1798.
[33] Hong Y, Zhou W X, Zhang J L, et al. Self-Regulation: Employing a Generative Adversarial Network to Improve Event Detection[C]// Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1:Long Papers). 2018: 515-526.
[34] 程思伟, 葛唯益, 王羽, 等. BGCN: 基于BERT和图卷积网络的触发词检测[J]. 计算机科学, 2021, 48(7): 292-298.
doi: 10.11896/jsjkx.200500133
[34] (Cheng Siwei, Ge Weiyi, Wang Yu, et al. BGCN: Trigger Detection Based on BERT and Graph Convolution Network[J]. Computer Science, 2021, 48(7): 292-298.)
doi: 10.11896/jsjkx.200500133
[35] Wang X Z, Han X, Liu Z Y, et al. Adversarial Training for Weakly Supervised Event Detection[C]// Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies, Volume 1 (Long and Short Papers). 2019: 998-1008.
[1] He Li, Yang Meihua, Liu Luyao. Detecting Events with SPO Semantic and Syntactic Information[J]. 数据分析与知识发现, 2023, 7(9): 114-124.
[2] Bao Tong, Zhang Chengzhi. Extracting Chinese Information with ChatGPT:An Empirical Study by Three Typical Tasks[J]. 数据分析与知识发现, 2023, 7(9): 1-11.
[3] Zhang Yingyi, Zhang Chengzhi, Zhou Yi, Chen Bikun. ChatGPT-Based Scientific Paper Entity Recognition: Performance Measurement and Availability Research[J]. 数据分析与知识发现, 2023, 7(9): 12-24.
[4] Yu Bengong, Ji Xiaohan. Detecting Multimodal Sarcasm Based on ADGCN-MFM[J]. 数据分析与知识发现, 2023, 7(10): 85-94.
[5] Guo Fanrong, Huang Xiaoxi, Wang Rongbo, Chen Zhiqun, Hu Chuang, Xie Yimin, Si Boyu. Identifying Metaphor with Transformer and Graph Convolutional Network[J]. 数据分析与知识发现, 2022, 6(4): 120-129.
[6] Yu Chuanming, Lin Hongjun, Zhang Zhengang. Joint Extraction Model for Entities and Events with Multi-task Deep Learning[J]. 数据分析与知识发现, 2022, 6(2/3): 117-128.
[7] Tan Ying, Tang Yifei. Extracting Citation Contents with Coreference Resolution[J]. 数据分析与知识发现, 2021, 5(8): 25-33.
[8] Wang Yi,Shen Zhe,Yao Yifan,Cheng Ying. Domain-Specific Event Graph Construction Methods:A Review[J]. 数据分析与知识发现, 2020, 4(10): 1-13.
[9] Tao Yue,Yu Li,Zhang Runjie. Active Learning Strategies for Extracting Phrase-Level Topics from Scientific Literature[J]. 数据分析与知识发现, 2020, 4(10): 134-143.
[10] Zhiqiang Liu,Yuncheng Du,Shuicai Shi. Extraction of Key Information in Web News Based on Improved Hidden Markov Model[J]. 数据分析与知识发现, 2019, 3(3): 120-128.
[11] Chengzhi Zhang,Zheng Li. Extracting Sentences of Research Originality from Full Text Academic Articles[J]. 数据分析与知识发现, 2019, 3(10): 12-18.
[12] Mu Dongmei,Jin Shan,Ju Yuanhong. Finding Association Between Diseases and Genes from Literature Abstracts[J]. 数据分析与知识发现, 2018, 2(8): 98-106.
[13] Li Jinhua,An Zhongjie. Analyzing Geographical Coordinates Data for Micro-blog Trending Events[J]. 现代图书情报技术, 2016, 32(2): 90-101.
[14] Yufeng Duan,Sisi Huang. Information Extraction from Chinese Plant Species Diversity Description Text[J]. 现代图书情报技术, 2016, 32(1): 87-96.
[15] Liu Wei, Wang Xing, Song Peiyan. A Noise Cleaning Method for Synonym Extraction Results[J]. 现代图书情报技术, 2015, 31(6): 64-70.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn