[Objective] This paper builds a model to monitor the trending topics from microblogs, aiming to deal with the issues of text drifting and quantitation of sentimental polarity. [Methods] First, we proposed a public opinion analysis model based on topic clustering and sentiment intensity. Then, we used the time series regression analysis to predict the sentimental changes among the trending topics. [Results] The prediction accuracy of our model reached 88.97%, which was about 7% higher than the iLab-Edinburgh model. [Limitations] More research is needed to study the early warning mechanisms for emergency events. [Conclusions] The proposed model could improve the prediction accuracy of sentimental changes, which provides an effective way to analyze the public opinion from microblogs.
王秀芳, 盛姝, 路燕. 一种基于话题聚类及情感强度的微博舆情分析模型*[J]. 数据分析与知识发现, 2018, 2(6): 37-47.
Wang Xiufang,Sheng Shu,Lu Yan. Analyzing Public Opinion from Microblog with Topic Clustering and Sentiment Intensity. Data Analysis and Knowledge Discovery, 2018, 2(6): 37-47.
(Ma Xiaoling, Jin Biyi, Fan Bingsi.An Analysis of Chinese Text Emotional Tendency[J]. Information and Documentation Service, 2013(1): 52-56.)
[2]
Vaibhavi N, Patodkar N P, Shaikh I R.Sentimental Analysis on Twitter Data Using Naive Bayes[C]// Proceedings of the 6th Post Graduate Conference for Computer Engineering. 2017.
(Tang Xiaobo, Luo Yingli.Integrating Emotional Divergence and User Interests into the Prediction of Microblog Retweeting[J]. Library and Information Service, 2017, 61(9): 102-110.)
[4]
Ingle M M, Emmanues M.Evaluations on Sentiment Analysis of Micro Blogging Site Using Topic Modeling[C]// Proceedings of the 2016 International Conference on Signal Processing, Communication, Power and Embedded System. 2016.
[5]
Giatsoglou M, Vozalis M G, Diamantaras K, et al.Sentiment Analysis Leveraging Emotions and Word Embeddings[J]. Expert Systems with Applications, 2017, 69: 214-224.
doi: 10.1016/j.eswa.2016.10.043
(Han Zhongming, Chen Ni, Le Jiajin, et al.An Efficient and Effective Clustering Algorithm for Time Series of Hot Topics[J]. Chinese Journal of Computers, 2012, 35(11): 2337-2347.)
doi: 10.3724/SP.J.1016.2012.02337
(Wu Qinglin, Zhou Tianhong.Public Opinion Analysis of Chinese Microblog Based on Topic Clustering and Emotion Intensity[J]. Information Studies: Theory & Application, 2016, 39(1): 109-112.)
doi: 10.16353/j.cnki.1000-7490.2016.01.019
(He Yue, Xiao Min, Zhang Yue.Sentiment Analysis of Trending Topics Based on Relevance[J]. Data Analysis and Knowledge Discovery, 2017, 1(3): 46-53.)
[9]
Sotiropoulos D N, Kounavis C D, Kourouthanassis P, et al.What Drives Social Sentiment? An Entropic Measure-based Clustering Approach Towards Identifying Factors that Influence Social Sentiment Polarity[C]// Proceedings of the 5th International Conference on Information, Intelligence, Systems and Applications. 2014.
[10]
Manek A S, Shenoy P D, Mohan M C, et al.Aspect Term Extraction for Sentiment Analysis in Large Movie Reviews Using Gini Index Feature Selection Method and SVM Classifier[J]. World Wide Web, 2017, 20(2): 135-154.
doi: 10.1007/s11280-015-0381-x
(Li Hui, Chai Yaqing.Analysis Sentiment Polarity of Comments Based on Attributes[J]. Data Analysis and Knowledge Discovery, 2017, 1(10): 1-11.)
[12]
Meisheri H, Saha R, Sinha P, et al.Textmining at EmoInt-2017: A Deep Learning Approach to Sentiment Intensity Scoring of English Tweets[C]//Proceedings of the 8th Workshop on Computational Approaches to Subjectivity, Copenhagen, Denmark. 2017.
(Zheng Lijuan, Wang Hongwei, Guo Kaiqiang.Sentiment Intensity of Online Reviews Based on Fuzzy-Statistics of Sentiment Words[J]. Journal of Systems & Management, 2014, 23(3): 324-330.)
[14]
Pérez-Ortiz M, Gutiérrez P A, Carbonero-Ruz M, et al.Semi-supervised Learning for Ordinal Kernel Discriminant Analysis[J]. Neural Networks, 2016, 84: 57-66.
doi: 10.1016/j.neunet.2016.08.004
pmid: 27639724
(Zhong Minjuan, Wan Changxuan, Liu Dexi.Opinion Lexicon Construction Based on Association Rule and Orientation Analysis for Production Review[J]. Journal of the China Society for Scientific and Technical Information, 2016, 35(5): 501-509.)
(Liu Dexi.Effect of Sentimental Word Expansion on the Performance of Microblog Sentiment Classification Task[J]. Journal of Chinese Computer Systems, 2016, 37(5): 957-965.)
(Yang Lin.Emotional Term Weight Research and Application to Emotional Polarity Analysis[J]. Journal of Computer Applications, 2015, 35(S2): 125-127.)
[21]
Van Arthur G, Staals F, Löffler M, et al.Multi-Granular Trend Detection for Time-Series Analysis[J]. IEEE Transactions on Visualization and Computer Graphics, 2017, 23(1): 661-670.
doi: 10.1109/TVCG.2016.2598619
(Tang Xiaobo, Tong Haiyan, Yan Chengxi.Microblogging Public Opinion Analysis Based on Emotional Intensity of the Topic[J]. Research on Libray Science, 2014(17): 85-93.)
[23]
Refaee E, Rieser V. iLab-Edinburgh at SemEval-2016 Task 7: A Hybrid Approach for Determining Sentiment Intensity of Arabic Twitter Phrases[C]// Proceedings of the 10th International Workshop on Semantic Evaluation. 2016.