|
|
Chinese Text Sentiment Analysis Based on Dual Channel Attention Network with Hybrid Word Embedding |
Zhou Ning(),Zhong Na,Jin Gaoya,Liu Bin |
School of Electronic and Information Engineering, Lanzhou Jiaotong University, Lanzhou 730070, China |
|
|
Abstract [Objective] This paper addresses the challenges facing the traditional static word vector embedding method, aiming to handle polysemy in Chinese texts effectively. It also excavates the contextual emotional features and internal semantic association structure. [Methods] In one channel, we integrated the sentiment elements related to the text into Word2Vec and FastText word vectors through rough data reasoning. We also used CNN to extract the local features of the text. In the other channel, we employed BERT for word embedding supplement and used BiLSTM to obtain the global features of the texts. Finally, we added the attention calculation module for the deep interaction of dual channel features. [Results] The experiment on three Chinese datasets achieved the highest accuracy of 92.43%, representing an improvement of 0.81% over the best value of the benchmark model. [Limitations] The selected datasets are only for modelling coarse-grained sentiment classification. We did not conduct experiments in the fine-grained domain. [Conclusions] The proposed model could effectively improve the performance of Chinese text sentiment classification.
|
Received: 11 April 2022
Published: 13 April 2023
|
|
Fund:Research Results of National Natural Science Foundation of China(61650207);Research Results of National Natural Science Foundation of China(61963023);Tianyou Innovation Team of Lanzhou Jiaotong University(TY202003) |
Corresponding Authors:
Zhou Ning,ORCID:0000-0001-7466-8925,E-mail:zhouning@mail.lzjtu.cn。
|
[1] |
Hearst M A. Direction-Based Text Interpretation as an Information Access Refinement[A]// Text-Based Intelligent Systems: Current Research and Practice in Information Extraction and Retrieval[M]. 1992: 257-274.
|
[2] |
Xie R B, Yuan X C, Liu Z Y, et al. Lexical Sememe Prediction via Word Embeddings and Matrix Factorization[C]// Proceedings of the 26th International Joint Conference on Artificial Intelligence. ACM, 2017: 4200-4206.
|
[3] |
Pang B, Lee L, Vaithyanathan S. Thumbs up? Sentiment Classification Using Machine Learning Techniques[C]// Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing. ACM, 2002: 79-86.
|
[4] |
陈钊, 徐睿峰, 桂林, 等. 结合卷积神经网络和词语情感序列特征的中文情感分析[J]. 中文信息学报, 2015, 29(6): 172-178.
|
[4] |
( Chen Zhao, Xu Ruifeng, Gui Lin, et al. Combining Convolutional Neural Networks and Word Sentiment Sequence Features for Chinese Text Sentiment Analysis[J]. Journal of Chinese Information Processing, 2015, 29(6): 172-178.)
|
[5] |
Tang D Y, Qin B, Feng X C, et al. Effective LSTMS for Target-Dependent Sentiment Classification[OL]. arXiv Preprint, arXiv: 1512.01100.
|
[6] |
Lu W J, Duan Y, Song Y T. Self-Attention-Based Convolutional Neural Networks for Sentence Classification[C]// Proceedings of the 6th International Conference on Computer and Communications. IEEE, 2021: 2065-2069.
|
[7] |
Mikolov T, Chen K, Corrado G, et al. Efficient Estimation of Word Representations in Vector Space[OL]. arXiv Preprint, arXiv: 1301.3781.
|
[8] |
Pennington J, Socher R, Manning C. GloVe: Global Vectors for Word Representation[C]// Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. 2014: 1532-1543.
|
[9] |
Bojanowski P, Grave E, Joulin A, et al. Enriching Word Vectors with Subword Information[J]. Transactions of the Association for Computational Linguistics, 2017, 5: 135-146.
doi: 10.1162/tacl_a_00051
|
[10] |
Choi J, Lee S W. Improving FastText with Inverse Document Frequency of Subwords[J]. Pattern Recognition Letters, 2020, 133: 165-172.
doi: 10.1016/j.patrec.2020.03.003
|
[11] |
Kim Y. Convolutional Neural Networks for Sentence Classification[C]// Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. 2014: 1746-1751.
|
[12] |
Dong M, Li Y F, Tang X, et al. Variable Convolution and Pooling Convolutional Neural Network for Text Sentiment Classification[J]. IEEE Access, 2020, 8: 16174-16186.
doi: 10.1109/Access.6287639
|
[13] |
Li L Q, Yang L, Zeng Y Y. Improving Sentiment Classification of Restaurant Reviews with Attention-Based Bi-GRU Neural Network[J]. Symmetry, 2021, 13(8): 1517.
doi: 10.3390/sym13081517
|
[14] |
程艳, 孙欢, 陈豪迈, 等. 融合卷积神经网络与双向GRU的文本情感分析胶囊模型[J]. 中文信息学报, 2021, 35(5): 118-129.
|
[14] |
( Cheng Yan, Sun Huan, Chen Haomai, et al. Text Sentiment Analysis Capsule Model Combining Convolutional Neural Network and Bidirectional GRU[J]. Journal of Chinese Information Processing, 2021, 35(5): 118-129.)
|
[15] |
韩普, 张伟, 张展鹏, 等. 基于特征融合和多通道的突发公共卫生事件微博情感分析[J]. 数据分析与知识发现, 2021, 5(11): 68-79.
|
[15] |
( Han Pu, Zhang Wei, Zhang Zhanpeng, et al. Sentiment Analysis of Weibo Posts on Public Health Emergency with Feature Fusion and Multi-Channel[J]. Data Analysis and Knowledge Discovery, 2021, 5(11): 68-79.)
|
[16] |
Cheng Y, Yao L B, Xiang G X, et al. Text Sentiment Orientation Analysis Based on Multi-Channel CNN and Bidirectional GRU with Attention Mechanism[J]. IEEE Access, 2020, 8: 134964-134975.
doi: 10.1109/Access.6287639
|
[17] |
Peters M, Neumann M, Iyyer M, et al. Deep Contextualized Word Representations[C]// Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies. 2018: 2227-2237.
|
[18] |
Devlin J, Chang M W, Lee K, et al. BERT: Pre-Training of Deep Bidirectional Transformers for Language Understanding[OL]. arXiv Preprint, arXiv: 1810.04805.
|
[19] |
Man R, Lin K. Sentiment Analysis Algorithm Based on BERT and Convolutional Neural Network[C]// Proceedings of the 2021 IEEE Asia-Pacific Conference on Image Processing, Electronics and Computers. IEEE, 2021: 769-772.
|
[20] |
Mihalcea R, Tarau P. TextRank: Bringing Order into Text[C]// Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing. 2004: 404-411.
|
[21] |
Yan S, Yan L, Wu J Z. Rough Data-Deduction Based on the Upper Approximation[J]. Information Sciences, 2016, 373: 308-320.
doi: 10.1016/j.ins.2016.09.011
|
[22] |
闫硕. 基于上近似的粗糙数据推理研究及应用[D]. 北京: 北京交通大学, 2017.
|
[22] |
Yan Shuo. Rough Data-Deduction Based on the Upper Approximation and Its Applications[D]. Beijing: Beijing Jiaotong University, 2017.)
|
[23] |
周宁, 石雯茜, 朱昭昭. 基于粗糙数据推理的TextRank关键词提取算法[J]. 中文信息学报, 2020, 34(9): 44-52.
|
[23] |
( Zhou Ning, Shi Wenqian, Zhu Zhaozhao. TextRank Keyword Extraction Algorithm Based on Rough Data-Deduction[J]. Journal of Chinese Information Processing, 2020, 34(9): 44-52.)
|
[24] |
Turney P D, Littman M L. Unsupervised Learning of Semantic Orientation from a Hundred-Billion-Word Corpus[OL]. arXiv Preprint, arXiv: 0212012.
|
[25] |
朱新华, 马润聪, 孙柳, 等. 基于知网与词林的词语语义相似度计算[J]. 中文信息学报, 2016, 30(4): 29-36.
|
[25] |
( Zhu Xinhua, Ma Runcong, Sun Liu, et al. Word Semantic Similarity Computation Based on HowNet and CiLin[J]. Journal of Chinese Information Processing, 2016, 30(4): 29-36.)
|
[26] |
谢思雅, 施一萍, 胡佳玲, 等. 基于BiLSTM-ATT的微博用户情感分类研究[J]. 传感器与微系统, 2021, 40(2): 26-29.
|
[26] |
( Xie Siya, Shi Yiping, Hu Jialing, et al. Research on Emotional Classification of Weibo Users Based on BiLSTM-ATT[J]. Transducer and Microsystem Technologies, 2021, 40(2): 26-29.)
|
[27] |
Tan S B, Zhang J. An Empirical Study of Sentiment Analysis for Chinese Documents[J]. Expert Systems with Applications, 2008, 34(4): 2622-2629.
doi: 10.1016/j.eswa.2007.05.028
|
[28] |
陈波, 谢珺, 苗夺谦, 等. 基于粗糙集和多通道词向量的中文文本情感特征分析[J]. 中文信息学报, 2020, 34(8): 94-104.
|
[28] |
( Chen Bo, Xie Jun, Miao Duoqian, et al. Chinese Text Sentiment Feature Analysis Based on Rough Set and Multi Channel Word Vector[J]. Journal of Chinese Information Processing, 2020, 34(8): 94-104.)
|
[29] |
Feng Y, Cheng Y. Short Text Sentiment Analysis Based on Multi-Channel CNN with Multi-Head Attention Mechanism[J]. IEEE Access, 2021, 9: 19854-19863.
doi: 10.1109/Access.6287639
|
[30] |
杨杰, 杨文军. 基于BERT模型的文本评论情感分析[J]. 天津理工大学学报, 2021, 37(2): 12-16.
|
[30] |
( Yang Jie, Yang Wenjun. Text Comment Sentiment Analysis Based on BERT Model[J]. Journal of Tianjin University of Technology, 2021, 37(2): 12-16.)
|
[31] |
杨璐, 何明祥. 基于门控机制和卷积神经网络的中文文本情感分析模型[J]. 计算机应用, 2021, 41(10): 2842-2848.
doi: 10.11772/j.issn.1001-9081.2020122043
|
[31] |
( Yang Lu, He Mingxiang. Chinese Text Sentiment Analysis Model Based on Gated Mechanism and Convolutional Neural Network[J]. Journal of Computer Applications, 2021, 41(10): 2842-2848.)
doi: 10.11772/j.issn.1001-9081.2020122043
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|