|
|
Identifying Relationship of Chinese Characters with Attention Mechanism and Convolutional Neural Network |
Zhao Pengwu1,Li Zhiyi2(),Lin Xiaoqi2 |
1Institute of Scientific and Technical Information of China, Beijing 100038, China 2School of Economics & Management, South China Normal University, Guangzhou 510006, China |
|
|
Abstract [Objective] The paper tries to identify the features and relationship of dynamic semantic information from the Chinese character entities. [Methods] First, we used the attention mechanism and improved convolution neural network model to automatically extract features from the training data of public corpus with character entity relationship. Then, we compared our model’s performance with the existing ones from the perspectives of entity relationship recognition efficiency, as well as entity relationship extraction effects and efficiency. [Results] The performance of CNN+Attention model is better than those of the SVM, LR, LSTM, BiLSTM and CNN model in prediction accuracy. Our new model is 0.92% higher in accuracy, 0.80% higher in recall and 0.86% higher in F1 value than the BiLSTM model with relatively better extraction effect. [Limitations] We need to examine our model with more sample data sets. [Conclusions] The proposed model could effectively improve the accuracy and recall of entity relationship extraction for Chinese characters.
|
Received: 22 September 2021
Published: 23 September 2022
|
|
Fund:National Social Science Fund of China(17BTQ062) |
Corresponding Authors:
Li Zhiyi,ORCID: 0000-0001-6407-2554
E-mail: leeds@scnu.edu.cn
|
[1] |
Bekoulis G, Deleu J, Demeester T, et al. Joint Entity Recognition and Relation Extraction as A Multi-Head Selection Problem[J]. Expert Systems with Applications, 2018, 114: 34-45.
doi: 10.1016/j.eswa.2018.07.032
|
[2] |
Kim E K, Choi K S. Improving Distantly Supervised Relation Extraction by Knowledge Base-Driven Zero Subject Resolution[J]. IEICE Transactions on Information and Systems, 2018, E101.D(10): 2551-2558.
doi: 10.1587/transinf.2017EDL8270
|
[3] |
贾真, 冶忠林, 尹红风, 等. 基于Tri-training与噪声过滤的弱监督关系抽取[J]. 中文信息学报, 2016, 30(4): 142-149, 158.
|
[3] |
(Jia Zhen, Ye Zhonglin, Yin Hongfeng, et al. Weakly Supervised Relation Extraction Based on Tri-training and Noise Filtering[J]. Journal of Chinese Information Processing, 2016, 30(4): 142-149, 158.)
|
[4] |
甘丽新, 万常选, 刘德喜, 等. 基于句法语义特征的中文实体关系抽取[J]. 计算机研究与发展, 2016, 53(2): 284-302.
|
[4] |
(Gan Lixin, Wan Changxuan, Liu Dexi, et al. Chinese Named Entity Relation Extraction Based on Syntactic and Semantic Features[J]. Journal of Computer Research and Development, 2016, 53(2): 284-302.)
|
[5] |
吴胜, 刘茂福, 胡慧君, 等. 中文文本中实体数值型关系无监督抽取方法[J]. 武汉大学学报(理学版), 2016, 62(6): 552-560.
|
[5] |
(Wu Sheng, Liu Maofu, Hu Huijun, et al. Unsupervised Extraction of Attribute-Value Entity Relation from Chinese Texts[J]. Journal of Wuhan University (Natural Science Edition), 2016, 62(6): 552-560.)
|
[6] |
肜博辉, 付琨, 黄宇, 等. 基于多通道卷积神经网的实体关系抽取[J]. 计算机应用研究, 2017, 34(3): 689-692.
|
[6] |
(Rong Bohui, Fu Kun, Huang Yu, et al. Relation Extraction Based on Multi-Channel Convolutional Neural Network[J]. Application Research of Computers, 2017, 34(3): 689-692.)
|
[7] |
张晓斌, 陈福才, 黄瑞阳. 基于CNN和双向LSTM融合的实体关系抽取[J]. 网络与信息安全学报, 2018, 4(9): 44-51.
|
[7] |
(Zhang Xiaobin, Chen Fucai, Huang Ruiyang. Relation Extraction Based on CNN and Bi-LSTM[J]. Chinese Journal of Network and Information Security, 2018, 4(9): 44-51.)
|
[8] |
延浩然, 靳小龙, 贾岩涛, 等. 一种改进的实体关系抽取算法——OptMultiR[J]. 中文信息学报, 2018, 32(9): 66-74.
|
[8] |
(Yan Haoran, Jin Xiaolong, Jia Yantao, et al. An Improved Entity Relation Extraction Algorithm——OptMultiR[J]. Journal of Chinese Information Processing, 2018, 32(9): 66-74.)
|
[9] |
车金立, 唐力伟, 邓士杰, 等. 基于双重注意力机制的远程监督中文关系抽取[J]. 计算机工程与应用, 2019, 55(20): 107-113.
doi: 10.3778/j.issn.1002-8331.1806-0438
|
[9] |
(Che Jinli, Tang Liwei, Deng Shijie, et al. Distant Supervision Chinese Relation Extraction Based on Dual Attention Mechanism[J]. Computer Engineering and Applications, 2019, 55(20): 107-113.)
doi: 10.3778/j.issn.1002-8331.1806-0438
|
[10] |
Cui Z Y, Pan L, Liu S J. Hybrid BiLSTM-Siamese Network for Relation Extraction[C]// Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems. 2019: 1907-1909.
|
[11] |
闫跃, 霍其润, 李天昊, 等. 融合多重注意力机制的卷积神经网络文本分类设计与实现[J]. 小型微型计算机系统, 2021, 42(2): 362-367.
|
[11] |
(Yan Yue, Huo Qirun, Li Tianhao, et al. Design and Implementation of Text Classification Based on Convolutional Neural Network with Multiple Attention Mechanisms[J]. Journal of Chinese Computer Systems, 2021, 42(2): 362-367.)
|
[12] |
张婷婷, 李卫疆, 李涛. 基于自注意力卷积神经网络的实体关系抽取[J]. 信息技术, 2022, 46(1): 11-15.
|
[12] |
(Zhang Tingting, Li Weijiang, Li Tao. Entity Relation Extraction Based on Self-Attention Convolutional Neural Network[J]. Information Technology, 2022, 46(1): 11-15.)
|
[13] |
冯建周, 宋沙沙, 王元卓, 等. 基于改进注意力机制的实体关系抽取方法[J]. 电子学报, 2019, 47(8): 1692-1700.
doi: 10.3969/j.issn.0372-2112.2019.08.012
|
[13] |
(Feng Jianzhou, Song Shasha, Wang Yuanzhuo, et al. Entity Relation Extraction Based on Improved Attention Mechanism[J]. Acta Electronica Sinica, 2019, 47(8): 1692-1700.)
doi: 10.3969/j.issn.0372-2112.2019.08.012
|
[14] |
张兰霞, 胡文心. 基于双向GRU神经网络和双层注意力机制的中文文本中人物关系抽取研究[J]. 计算机应用与软件, 2018, 35(11): 130-135, 189.
|
[14] |
(Zhang Lanxia, Hu Wenxin. Character Relation Extraction in Chinese Text Based on Bidirectional GRU Neural Network and Dual-attention Mechanism[J]. Computer Applications and Software, 2018, 35(11): 130-135, 189.)
|
[15] |
黄杨琛, 贾焰, 甘亮, 等. 基于远程监督的多因子人物关系抽取模型[J]. 通信学报, 2018, 39(7): 103-112.
|
[15] |
(Huang Yangchen, Jia Yan, Gan Liang, et al. Multi-Factor Person Entity Relation Extraction Model Based on Distant Supervision[J]. Journal on Communications, 2018, 39(7): 103-112.)
|
[16] |
Lv C Y, Pan D, Li Y X, et al. A Novel Chinese Entity Relationship Extraction Method Based on the Bidirectional Maximum Entropy Markov Model[J]. Complexity, 2021, 2021: 6610965.
|
[17] |
Simonyan K, Zisserman A. Very Deep Convolutional Networks for Large-Scale Image Recognition[OL]. arXiv Preprint, arXiv: 1409.1556.
|
[18] |
Mikolov T, Chen K, Corrado G, et al. Efficient Estimation of Word Representations in Vector Space[OL]. arXiv Preprint, arXiv: 1301.3781
|
[19] |
Lecun Y, Bottou L, Bengio Y, et al. Gradient-Based Learning Applied to Document Recognition[J]. Proceedings of the IEEE, 1998, 86(11): 2278-2324.
doi: 10.1109/5.726791
|
[20] |
Mnih V, Heess N, Graves A. Recurrent Models of Visual Attention[C]// Proceedings of the 27th Annual Conference on Neural Information Processing Systems. 2014: 2204-2212.
|
[21] |
Chen X, Hsieh C J, Gong B. When Vision Transformers Outperform ResNets without Pre-training or Strong Data Augmentations[OL]. arXiv Preprint, arXiv: 2106.01548.
|
[22] |
Elsayed G F, Kornblith S, Le Q V. Saccader: Improving Accuracy of Hard Attention Models for Vision[OL]. arXiv Preprint, arXiv: 1908.07644.
|
[23] |
Luong T, Pham H, Manning C D. Effective Approaches to Attention-based Neural Machine Translation[C]// Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. 2015: 2412-2421.
|
[24] |
Chinchor N, Sundheim B. MUC-5 Evaluation Metrics[C]// Proceedings of the 5th Conference on Message Understanding. 1993: 69-78.
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|