1School of Cyberspace Security, Beijing University of Posts and Telecommunications, Beijing 100876, China 2Key Laboratory of Trustworthy Distributed Computing and Service (BUPT), Ministry of Education, Beijing 100876, China 3Beijing University of Posts and Telecommunications Library, Beijing 100876, China
[Objective] This paper studies the domain discrimination for public opinions of online communities, aiming to improve knowledge base, as well as the effectiveness of the machine learning models.[Methods] We retrieved 478,303 pieces of textual data from multiple online communities for college students. Then, we created a semantic relationship graph with a total of 5,248 nodes and 16,488 edges, which could also be extended automatically. Finally, we proposed a short text analysis model to conduct domain analysis for the texts.[Results] The F value of the proposed model reached 83.94%, which was 8.56%, 5.97% and 4.27% higher than those of the SVM, NB and CNN methods.[Limitations] The sample size needs to be expanded and the parameter feedback mechanism needs to be modified.[Conclusions] Compared with methods based on machine learning, the proposed model’s accuracy is improved. It could also conduct real-time analysis.
田钟林,吴旭,颉夏青,许晋,陆月明. 一种基于领域语义关系图的短文本实时分析模型*[J]. 数据分析与知识发现, 2020, 4(2/3): 239-248.
Tian Zhonglin,Wu Xu,Xie Xiaqing,Xu Jin,Lu Yueming. Real-time Analysis Model for Short Texts with Relationship Graph of Domain Semantics. Data Analysis and Knowledge Discovery, 2020, 4(2/3): 239-248.
( Zuo Meng, Li Changzu . A Review of Network Public Opinion: from Theoretical Research to Practical Application[J]. Journal of Intelligence, 2017,36(10):71-78, 140.)
丁诗晴 . 基于在线网站评论的中文文本挖掘[D]. 武汉:华中科技大学, 2016.
( Ding Shiqing . Chinese Text Mining Based on Online Customer Review[D]. Wuhan: Huazhong University of Science & Technology, 2016.)
张璐 . 基于情感计算的网络社区舆情分析预警技术研究[D]. 北京:北京邮电大学, 2018.
( Zhang Lu . Analysis and Early Warning Technology Research Based on Affective Computing in Online Community[D]. Beijing: Beijing University of Posts and Telecommunications, 2018.)
严仲培 . 面向旅游在线评论的文本挖掘方法研究[D]. 合肥:合肥工业大学, 2018.
( Yan Zhongpei . Research on the Method of Text Mining for Travel Online Comments[D]. Hefei: Hefei University of Technology, 2018.)
杨郁琪 . 基于文本挖掘的用户满意度影响因素研究[D]. 太原:中北大学, 2018.
( Yang Yuqi . Study on the Influencing Factors of User Satisfaction Based on Text Mining[D]. Taiyuan: North University of China, 2018.)
范宁 . 基于文本挖掘在民宿满意度中的研究[D]. 桂林:广西师范大学, 2019.
( Fan Ning . Research on Satisfaction of Homestay Based on Text Mining[D]. Guilin: Guangxi Normal University, 2018.)
Ramanathan V, Meyyappan T . Twitter Text Mining for Sentiment Analysis on People’s Feedback About Oman Tourism [C]// Proceedings of the 4th MEC International Conference on Big Data and Smart City (ICBDSC), Muscat, Oman. 2019.
( Li Lirong . Research on Key Technologies in Network Public Opinion Analysis System[J]. Journal of Shanxi Police College, 2019,27(1):43-46.)
Ramadhani A M, Goo H S . Twitter Sentiment Analysis Using Deep Learning Methods [C]// Proceedings of the 7th International Annual Engineering Seminar (InAES), Yogyakarta, Indonesia. 2017.
Halibas A S, Shaffi A S, Mohamed M A K V . Application of Text Classification and Clustering of Twitter Data for Business Analytics [C]// Proceedings of the 2018 Majan International Conference (MIC), Muscat, Oman. 2018.
张祥 . 面向政务需求的网络舆情分析方法研究[D]. 成都:电子科技大学, 2017.
( Zhang Xiang . Research on Public Opinion Analysis Method of the Network for the Needs of Government[D]. Chengdu: University of Electronic Science and Technology of China, 2017.)