Research on Public Policy Support Based on Character-level CNN Technology
Qiu Erli1(),He Hongwei2,Yi Chengqi1,Li Huiying1
1Big Data Development Department, State Information Center, Beijing 100045, China 2Department of Information Management, Peking University, Beijing 100871, China
[Objective] This paper proposed an index of Internet users’ sentiment classification which is more suitable for public policy evaluation, and explored the automatic method for Internet users’ stance detection based on the deep learning technology.[Methods] Three important public policies of different types and in different fields were selected as research objects. After collecting, cleaning and labeling the related data of Sina Weibo, this paper analyzed the three policies’ support on Internet, and constructed a text classification model based on the character-level convolutional neural network (CNN) technology. Meanwhile this paper compared and interpretd the effectiveness and efficiency of the experimental results.[Results] The results showed that our model can achieve good performance on the indicators of the accuracy and recall rate of the three datasets.There were two datasets with F1 value above 0.8 and one dataset with F1 value above 0.6. Meanwhile the model took less time than the recurrent neural network (RNN) model, and the training time gap is dozens of times.[Limitations] The data sample size and policy coverage are limited, and the calculation method for Internet users’ support needs to be further studied.[Conclusions] The stance classification method and the character-level CNN technology perform well in the effectiveness and efficiency of public policy evaluation, and may play a significant role especially in the evaluation of emergency policies.
邱尔丽,何鸿魏,易成岐,李慧颖. 基于字符级CNN技术的公共政策网民支持度研究 *[J]. 数据分析与知识发现, 2020, 4(7): 28-37.
Qiu Erli,He Hongwei,Yi Chengqi,Li Huiying. Research on Public Policy Support Based on Character-level CNN Technology. Data Analysis and Knowledge Discovery, 2020, 4(7): 28-37.
Bollen J, Pepe A, Mao H. Modeling Public Mood and Emotion: Twitter Sentiment and Socio-economic Phenomena[C] // Proceedings of the 5th International AAAI Conference on Weblogs and Social Media (ICWSM). 2011: 450-453.
( Dong Yinghong, Chen Hao, Lai Kaisheng, et al. Weibo Social Moods Measurement and Validation[J]. Journal of Psychological Science, 2015,38(5):1141-1146.)
[3]
Bermingham A, Smeaton A F. On Using Twitter to Monitor Political Sentiment and Predict Election Results[C] // Proceedings of the 2011 Workshop on Sentiment Analysis Where AI Meets Psychology (SAAIP). 2011: 2-10.
[4]
Agarwal A, Xie B, Vovsha I, et al. Sentiment Analysis of Twitter Data[C] // Proceedings of the 2011 Workshop on Language in Social Media (LSM). 2011: 30-38.
( Zhou Yanfang, Zhou Gang, Lu Zhonglei. Approach of Stance Detection in Micro-blog Based on Transfer Learning and Multi-representation[J]. Computer Science, 2018,45(9):243-247.)
[6]
Durahim A O, Coşkun M. #iamhappybecause: Gross National Happiness Through Twitter Analysis and Big Data[J]. Technological Forecasting & Social Change, 2015,99:92-105.
[7]
朱廷劭. 大数据时代的心理学研究及应用[M]. 北京: 科学出版社, 2016.
[7]
( Zhu Tingshao. Psychological Research and Application in the Age of Big data[M]. Beijing: Science Press, 2016.)
( Wei Ying, Zhang Jing, Qiu Erli. Evaluation of the Effect of Polices on Promoting Mass Entrepreneurship and Innovation Based on Big Data[J]. China Economic & Trade Herald, 2019(16):12-14.)
( Wang Jiandong, Tong Nannan, Yi Chengqi. The Transformation of Public Policy Assessment in the Age of Big Data[M]. Beijing: Social Sciences Academic Press, 2019.)
( Wang Yamin, Ning Jing, Ma Xubu. Research on Public Opinion Support of Public Policy Based on Social Media[J]. Information Studies: Theory & Application, 2018,41(3):95-100.)
[11]
Hu G, Yang Y, Yi D, et al. When Face Recognition Meets with Deep Learning: An Evaluation of Convolutional Neural Networks for Face Recognition[C] // Proceedings of the 2015 IEEE International Conference on Computer Vision Workshops. 2015: 142-150.
[12]
Nian F, Li T, Wang Y, et al. Pornographic Image Detection Utilizing Deep Convolutional Neural Networks[J]. Neurocomputing, 2016,210:283-293.
[13]
Liu Z, Wu Z, Li T, et al. GMM and CNN Hybrid Method for Short Utterance Speaker Recognition[J]. IEEE Transactions on Industrial Informatics, 2018,14(7):3244-3252.
[14]
Athiwaratkun B, Stokes J W. Malware Classification with LSTM and GRU Language Models and a Character-level CNN[C] // Proceedings of the 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2017: 2482-2486.
[15]
Kim Y. Convolutional Neural Networks for Sentence Classification[C] // Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) . 2014: 1746-1751.
[16]
Zhang X, Zhao J, LeCun Y. Character-level Convolutional Networks for Text Classification[C] // Proceedings of the 28th International Conference on Neural Information Processing Systems (NIPS) . 2015: 649-657.
[17]
Bojanowski P, Grave E, Joulin A, et al. Enriching Word Vectors with Subword Information[J]. Transactions of the Association for Computational Linguistics (TACL), 2017,5:135-146.
[18]
Yang Z, Yang D, Dyer C, et al. Hierarchical Attention Networks for Document Classification[C] // Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL). 2016: 1480-1489.
[19]
Kim Y, Jernite Y, Sontag D, et al. Character-aware Neural Language Models[C] // Proceedings of the 30th AAAI Conference on Artificial Intelligence. 2016: 2741-2749.
( Liu Wanjun, Liang Xuejian, Qu Haicheng. Learning Performance of Convolutional Neural Networks with Different Pooling Models[J]. Journal of Image and Graphics, 2016,21(9):1178-1190.)
[21]
Wang X, Jiang W, Luo Z. Combination of Convolutional and Recurrent Neural Network for Sentiment Analysis of Short Texts[C] // Proceedings of the 26th International Conference on Computational Linguistics (COLING). 2016: 2428-2437.
[22]
Manning C, Raghavan P, Schütze H. Book Review: Introduction to Information Retrieval[J]. Natural Language Engineering, 2010,16(1):100-103.
[23]
Brown C D, Davis H T. Receiver Operating Characteristics Curves and Related Decision Measures: A Tutorial[J]. Chemometrics and Intelligent Laboratory Systems, 2006,80(1):24-38.
[24]
Pascanu R, Mikolov T, Bengio Y. On the Difficulty of Training Recurrent Neural Networks[C] // Proceedings of the 30th International Conference on Machine Learning (ICML). 2013: 1310-1318.