[Objective] This study aims to retrieve the trending events from the micro-blog platform with the help of data mining algorithms. [Methods] First, we collected micro-blog message with geographic coordinates from the most popular platform (the Sina Weibo) using its API service. Then, we used the K-means, KNN and decision trees algorithms to construct the geographical patterns of those collected posts. The number of published posts, re-tweets, and comments, as well as user activity and movement strength were also examined. Third, we compared these geographical patterns with the daily regional micro-blog data to identify breaking news in that area. [Results] We analyzed data collected on April 15 and April 16 of 2015 with the help of the proposed model, and found a trending event of “Beijing Sandstorm”. [Limitations] The sample size was small, which might influence the results. [Conclusions] Geographic coordinates could help us detect trending events on the Sina Weibo, and this new method will also support the government’s crisis management strategy and decision-making process.
李进华,安仲杰. 基于地理坐标的微博事件检测与分析*[J]. 现代图书情报技术, 2016, 32(2): 90-101.
Li Jinhua,An Zhongjie. Analyzing Geographical Coordinates Data for Micro-blog Trending Events. New Technology of Library and Information Service, 2016, 32(2): 90-101.
(Li Biao.The “Portrait Sketch” of Microblogging Opinion Leaders Group——Take 40 Opinion Leaders from Microblogs as an Example[J]. Journalism Review, 2012(09): 19-25.)
(Yang Liang, Lin Yuan, Lin Hongfei.Micro-Blog Hot Events Detection Based on Emotion Distribution[J]. Journal of Chinese Information Processing, 2012, 26(1): 84-90.)
(Wang Lin, Shi Kan, Zhao Yang, et al.Experimental Studies on Public Opinion Perception of the Micro Blog’s Collective Behavior Based on the Emergencies[J]. Journal of Intelligence, 2013, 32(5): 32-37.)
(Yang Juanjuan, Yang Lanrong, Zeng Runxi, et al.Research on Communication Mechanism of Internet Public Opinion of Government Affairs Microblog in Public Security Events: A Case Study of the “Shanghai Fabu”[J]. Journal of Intelligence, 2013, 32(9): 11-15.)
(Wang Yong, Xiao Shibin, Guo Yixiu, et al.Research on Chinese Micro-blog Bursty Topics Detection[J]. New Technology of Library and Information Service, 2013(2): 57-62.)
(Wei Zhihui, He Yue.Identify Microblogging Opinion Leaders Based on Information Entropy and Unascertained Measure Model——Taking “Emergencies of Qingyang School Bus” as an Example[J]. Information Science, 2014, 32(10): 38-43.)
[11]
田野. 基于微博平台的事件趋势分析及预测研究[D]. 武汉: 武汉大学, 2012.
[11]
(Tian Ye.On Trends Analysis and Prediction Based on Micro-Blogging Platforms [D]. Wuhan: Wuhan University, 2012.)
[12]
Yang Y, Carbonell J, Brown R.Multi-Strategy Learning for Topic Detection and Tracking [A]. // Topic Detection and Tracking[M]. Springer, 2002: 85-114.
(Feng Yong, Han Nan, Jia Dongfeng.Microblog Events Detection and Tracking with Incremental Hierarchical DBSCAN Based on Representative Posts Using Cloud Framework[J]. Journal of Computer Applications, 2013, 33(12): 3559-3562.)
(Wang Lianxi.A Literature Review on Pre-processing and Learning of Microtext[J]. Library and Information Service, 2013, 57(11): 125-131.)
[15]
Fu C, Samet H, Sankaranarayanan J.WeiboStand: Capturing Chinese Breaking News Using Weibo “Tweets” [C]. In: Proceedings of the 7th ACM SIGSPATIAL Workshop on Location-Based Social Networks. 2014.
(Wang Feng.“Micro” Forces of the Catastrophic Event——Qinghai Yushu Weibo Application Analysis in the Earthquake[J]. News World, 2010(S2): 149-150.)
[17]
Zhang P.Social Inclusion or Exclusion? When Weibo (Microblogging) Meets the “New Generation” of Rural Migrant Workers[J]. Library Trends, 2013, 62(1):63-80.
[18]
微博数据中心. 2014年微博用户发展报告[R/OL]. [2015- 02-06]. .
[18]
(Weibo Data Center. The 2014 Report of Weibo Users Development [R/OL]. [2015-02-06].
(Qi Feng, Liu Kun, Zhang Chao, et al.A Novel Base Station Coverage Simulation Based on Intersection of Circle and Voronoi[J]. Journal of Beijing University of Posts and Telecommunications, 2014, 37(S1): 108-114.)
(Lu Ansheng, Chen Yongqiang, Tu Haowen.The Analysis and Application of Decision Tree Algorithm of C5[J]. Computer Knowledge and Technology, 2005(3): 17-20.)
(Chi Chengying, Li Hong.Network News Hot Topics Detection and Tracking Based on Modified TF*PDF Algorithm[J]. Computer Applications and Software, 2013, 30(12): 311-314.)
(Xie Kefan, Zhao Shi, Chen Gang, et al.Research on Lifecycle Principle and Group Decision- making of Network Public Sentiment Emergency[J]. Journal of Wuhan University of Technology: Social Sciences Edition, 2010, 23(4): 482-486.)
[25]
Narayanam R, Narahari Y.A Shapley Value-based Approach to Discover Influential Nodes in Social Networks[J]. IEEE Transactions on Automation Science and Engineering, 2011, 8(1): 130-147.