Finding Geographic Locations of Popular Online Topics
Liu Yuwen1,2(),Wang Kai1
1School of Health Management, Bengbu Medical College, Bengbu 233030, China 2College of Computer Science and Technology, University of Science and Technology of China, Hefei 230027, China
[Objective] This paper analyzes the geographic distributions of popular online topics, aiming to provide decision-making support for public opinion management and social governance.[Methods] First, we introduced location parameters of comments into the LDA model, and proposed a region-oriented topic recognition model (RO-LDA). Then, we used this model to label texts, topics, locations and vocabularies with location tags. Third, we created text-topics, topic-words and topic-locations matrices. Finally, we identified trending topics and their geographic distributions with the help of topic-words and topic-locations distributions.[Results] We examined the proposed model with real data set. The F value reached 80.05%, which is higher than the existing models.[Limitations] The location tags were set manually, which impacted the accuracy of region recognition.[Conclusions] The proposed method could identify geographic features of trending topics effectively.
刘玉文,王凯. 面向地域的网络话题识别方法*[J]. 数据分析与知识发现, 2020, 4(2/3): 173-181.
Liu Yuwen,Wang Kai. Finding Geographic Locations of Popular Online Topics. Data Analysis and Knowledge Discovery, 2020, 4(2/3): 173-181.
Momtazi S . Unsupervised Latent Dirichlet Allocation for Supervised Question Classification[J]. Information Processing and Management, 2018,54(3):380-393.
( Xu Yuemei, Lv Sining, Cai Lianqiao , et al. Analyzing News Topic Evolution with Convolutional Neural Networks and Topic2Vec[J]. Data Analysis and Knowledge Discovery, 2018,2(9):31-41.)
[3]
Chen L, Zhang H Z, Jose J M , et al. Topic Detection and Tracking on Heterogeneous Information[J]. Journal of Intelligent Information Systems, 2018,51(1):115-137.
( Fu Peng, Lin Zheng, Yuan Fengcheng , et al. Convolutional Neural Network and User Information Based Model for Microblog Topic Tracking[J]. Pattern Recognition and Artificial Intelligence, 2017,30(1):73-80.)
( Zhou Yadong, Liu Xiaoming, Du Youtian , et al. A Method for Identifying the Evolutionary Focuses of Online Social Topics[J]. Chinese Journal of Computers, 2015,38(2):261-271.)
( He Yue, Zhu Can, Zhu Tingting , et al. Research on the Emotional Tendency of Hot Topics in Micro-blogs[J]. Information Studies: Theory & Application, 2018,41(7):155-160.)
( Liao Haihan, Wang Yuefen, Guan Peng . Topic Mining and Viewpoint Recognition of Different Communicators in the Transmission Cycle of Micro-blog Public Opinion[J]. Library and Information Service, 2018,62(19):77-85.)
( Yu Chong, Li Jing, Sun Xudong , et al. Social Media Topic Recognition Based on Word Embedding and Probabilistic Topic Model[J]. Computer Engineering, 2017,43(12):184-191.)
( Fang Xiaofei, Huang Xiaoxi, Wang Rongbo , et al. Identifying Hot Topics from Mobile Complaint Texts[J]. Data Analysis and Knowledge Discovery, 2017,1(2):19-27.)
[10]
Blei D M, Ng A Y, Jordan M I . Latent Dirichlet Allocation[J]. Journal of Machine Learning Research, 2003,3:993-1022.
( Li Ping, Zhang Luyao, Cao Xia , et al. Hybrid Context Recommendation Algorithm Based on Latent Topic[J]. Journal of Electronics and Information Technology, 2018,40(4):957-963.)
[12]
Zou Y P, Ouyang J H, Li X M . Supervised Topic Models with Weighted Words: Multi-Label Document Classification[J]. Frontiers of Information Technology & Electronic Engineering, 2018,19(4):513-523.
( Xian Xuefeng, Cui Zhiming, Zhao Pengpeng , et al. Location-awareness Publication Subscription System Based on Topic Model[J]. Computer Science, 2018,45(3):167-172.)
[15]
Twinandilla S, Adhy S, Surarso B , et al. Multi-Document Summarization Using K-Means and Latent Dirichlet Allocation (LDA)-Significance Sentences[J]. Procedia Computer Science, 2018,135:663-670.
[16]
Chen M L, Wang Q, Li X L . Patch-based Topic Model for Group Detection[J]. Science China Information Sciences, 2017, 60: Article No. 113101.