|
|
Real-time Analysis Model for Short Texts with Relationship Graph of Domain Semantics |
Tian Zhonglin1,2,Wu Xu1,2,3(),Xie Xiaqing1,2,Xu Jin1,2,Lu Yueming1,2 |
1School of Cyberspace Security, Beijing University of Posts and Telecommunications, Beijing 100876, China 2Key Laboratory of Trustworthy Distributed Computing and Service (BUPT), Ministry of Education, Beijing 100876, China 3Beijing University of Posts and Telecommunications Library, Beijing 100876, China |
|
|
Abstract [Objective] This paper studies the domain discrimination for public opinions of online communities, aiming to improve knowledge base, as well as the effectiveness of the machine learning models.[Methods] We retrieved 478,303 pieces of textual data from multiple online communities for college students. Then, we created a semantic relationship graph with a total of 5,248 nodes and 16,488 edges, which could also be extended automatically. Finally, we proposed a short text analysis model to conduct domain analysis for the texts.[Results] The F value of the proposed model reached 83.94%, which was 8.56%, 5.97% and 4.27% higher than those of the SVM, NB and CNN methods.[Limitations] The sample size needs to be expanded and the parameter feedback mechanism needs to be modified.[Conclusions] Compared with methods based on machine learning, the proposed model’s accuracy is improved. It could also conduct real-time analysis.
|
Received: 24 May 2019
Published: 26 April 2020
|
|
Corresponding Authors:
Xu Wu
E-mail: wux@bupt.edu.cn
|
[1] |
左蒙, 李昌祖 . 网络舆情研究综述:从理论研究到实践应用[J]. 情报杂志, 2017,36(10):71-78,140.
|
[1] |
( Zuo Meng, Li Changzu . A Review of Network Public Opinion: from Theoretical Research to Practical Application[J]. Journal of Intelligence, 2017,36(10):71-78, 140.)
|
[2] |
丁诗晴 . 基于在线网站评论的中文文本挖掘[D]. 武汉:华中科技大学, 2016.
|
[2] |
( Ding Shiqing . Chinese Text Mining Based on Online Customer Review[D]. Wuhan: Huazhong University of Science & Technology, 2016.)
|
[3] |
张璐 . 基于情感计算的网络社区舆情分析预警技术研究[D]. 北京:北京邮电大学, 2018.
|
[3] |
( Zhang Lu . Analysis and Early Warning Technology Research Based on Affective Computing in Online Community[D]. Beijing: Beijing University of Posts and Telecommunications, 2018.)
|
[4] |
严仲培 . 面向旅游在线评论的文本挖掘方法研究[D]. 合肥:合肥工业大学, 2018.
|
[4] |
( Yan Zhongpei . Research on the Method of Text Mining for Travel Online Comments[D]. Hefei: Hefei University of Technology, 2018.)
|
[5] |
杨郁琪 . 基于文本挖掘的用户满意度影响因素研究[D]. 太原:中北大学, 2018.
|
[5] |
( Yang Yuqi . Study on the Influencing Factors of User Satisfaction Based on Text Mining[D]. Taiyuan: North University of China, 2018.)
|
[6] |
范宁 . 基于文本挖掘在民宿满意度中的研究[D]. 桂林:广西师范大学, 2019.
|
[6] |
( Fan Ning . Research on Satisfaction of Homestay Based on Text Mining[D]. Guilin: Guangxi Normal University, 2018.)
|
[7] |
Ramanathan V, Meyyappan T . Twitter Text Mining for Sentiment Analysis on People’s Feedback About Oman Tourism [C]// Proceedings of the 4th MEC International Conference on Big Data and Smart City (ICBDSC), Muscat, Oman. 2019.
|
[8] |
李丽蓉 . 网络舆情分析系统中关键技术研究[J]. 山西警察学院学报, 2019,27(1):43-46.
|
[8] |
( Li Lirong . Research on Key Technologies in Network Public Opinion Analysis System[J]. Journal of Shanxi Police College, 2019,27(1):43-46.)
|
[9] |
Ramadhani A M, Goo H S . Twitter Sentiment Analysis Using Deep Learning Methods [C]// Proceedings of the 7th International Annual Engineering Seminar (InAES), Yogyakarta, Indonesia. 2017.
|
[10] |
Halibas A S, Shaffi A S, Mohamed M A K V . Application of Text Classification and Clustering of Twitter Data for Business Analytics [C]// Proceedings of the 2018 Majan International Conference (MIC), Muscat, Oman. 2018.
|
[11] |
张祥 . 面向政务需求的网络舆情分析方法研究[D]. 成都:电子科技大学, 2017.
|
[11] |
( Zhang Xiang . Research on Public Opinion Analysis Method of the Network for the Needs of Government[D]. Chengdu: University of Electronic Science and Technology of China, 2017.)
|
[12] |
张健立 . 一种基于语义关系图的词义消歧算法[J]. 科技通报, 2015,31(3):228-232,257.
|
[12] |
( Zhang Jianli . Word Sense Disambiguation Algorithm Based on Semantic Relation Graph[J]. Bulletin of Science and Technology, 2015,31(3):228-232,257.)
|
[13] |
张仰森, 郑佳, 李佳媛 . 一种基于语义关系图的词语语义相关度计算模型[J]. 自动化学报, 2018,44(1):87-98.
|
[13] |
( Zhang Yangsen, Zheng Jia, Li Jiayuan . A Model for Calculating Semantic Relatedness of Words Considering Semantic Relationship Graph[J]. Acta Automatica Sinica, 2018,44(1):87-98.)
|
[14] |
王宏显, 周强, 邬晓钧 . 《知网》语义关系图的自动构建[J]. 中文信息学报, 2008,22(5):90-96.
|
[14] |
( Wang Hongxian, Zhou Qiang, Wu Xiaojun . The Automatic Construction of Lexical Semantic Graph Based on HowNet[J]. Journal of Chinese Information Processing, 2008,22(5):90-96.)
|
[15] |
王知津, 郑悦萍 . 图书馆工作与研究[J].图书馆工作与研究, 2013(11):13-19.
|
[15] |
( Wang Zhijin, Zheng Yueping . The Concepts and Types of Semantic Relations in Information Organization[J]. Library Work and Study,2013(11):13-19.)
|
[16] |
Xie W, Zhu F, Jiang J , et al. Topicsketch: Real-time Bursty Topic Detection from Twitter[J]. IEEE Transactions on Knowledge and Data Engineering, 2016,28(8):2216-2229.
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|