Please wait a minute...
Data Analysis and Knowledge Discovery  2021, Vol. 5 Issue (2): 32-42    DOI: 10.11925/infotech.2096-3467.2020.1027
Current Issue | Archive | Adv Search |
Identifying Leaders and Dissemination Paths of Public Opinion
Xu Yabin1,2(),Sun Qiutian2
1Beijing Key Laboratory of Internet Culture and Digital Dissemination Research, Beijing Information Science and Technology University, Beijing 100101, China
2School of Computer, Beijing Information Science and Technology University, Beijing 100101, China
Download: PDF (1281 KB)   HTML ( 34
Export: BibTeX | EndNote (RIS)      
Abstract  

[Objective] This study proposes new method to monitor social media, aiming to limit or guide the spread of public opinion. [Methods] First, we constructed an OLMT model to identify opinion leaders based on the dissemination force and topological potential. Then, we modified the Transformer model to build a social media behavior prediction model (MF-Transformer) with high parallelism and attention mechanism. [Results] The proposed models identified opinion leaders and their retweeting behaviors, as well as the main dissemination paths of online public opinion. The recall and accuracy of the predicted results were 92.17% and 99.07%, respectively, which were higher than those of the existing methods. [Limitations] We only examined our new models with data from Sina Weibo. [Conclusions] The proposed models could effectively identify online opinion leaders, as well as predict the dissemination paths of their comments and retweets.

Key wordsPublic Opinion      Opinion Leader      Social Network Behavior Prediction      Dissemination Paths Identification     
Received: 21 October 2020      Published: 24 November 2020
ZTFLH:  TP393  
Fund:National Natural Science Foundation of China(61672101);Open Project of Beijing Key Laboratory of Network Culture and Digital Communication(ICDDXN004);Open Project of Key Laboratory of Information Network Security of Ministry of Public Security(C18601)
Corresponding Authors: Xu Yabin ORCID:0000-0003-2727-3773     E-mail: xyb@bistu.edu.cn

Cite this article:

Xu Yabin, Sun Qiutian. Identifying Leaders and Dissemination Paths of Public Opinion. Data Analysis and Knowledge Discovery, 2021, 5(2): 32-42.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.2096-3467.2020.1027     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2021/V5/I2/32

Framework of Propagation Behavior Prediction Model Based on MF-Transformer
Implementation Mechanism of Self-Attention
名称 含义 获取方式
userfols 用户粉丝数 网络爬虫技术爬取用户的基本资料
userinfl 用户影响力 通过意见领袖挖掘算法OLMT
usercati 用户活跃度 原创微博数、转发微博数、评论微博数加权求和
proauth 上游用户的认证情况 爬取用户的认证情况,认证用户记做1,非认证用户记做0
averretw 上游用户的平均被转发次数 上游用户历史微博被转发次数之和与历史微博条数的比值
commmoti 用户的传播积极性 用户最近转发的微博数量与发布微博的总数量的比值
textleng 文本长度 计算需预测被转发与否的微博文本长度
whetherpict 是否包含图片 通过微博文本判断是否包含图片,包含记做1,不包含记做0
whetherurl 是否包含URL 通过微博文本判断是否包含URL,包含记做1,不包含记做0
publishtime 文本发布时间段 爬取文本发布的时间,如5:23记做5
publishacti 发布时间段用户的活跃度 发布时间段内的原创微博数、转发微博数、评论微博数加权求和
textinte 用户对微博文本的兴趣度 计算方法见4.2节
forwardfreq 对上游用户的转发频率 用户转发上游用户的微博数与用户转发的微博总数的比值
transpower 传播源的传播力 计算方法见公式(2)
intersimi 与上游用户的兴趣相似度 计算方法见4.2节
Transmission Behavior Characteristics and Acquisition Methods
Accuracy of Different Mining Algorithms
Coverage of Different Mining Algorithms
Performance of Propagation Prediction Models
The Relationship Between Path Length and Path Number with Covering Ability
Forecasting Result of Public Opinion Communication
[1] Pal A, Counts S. Identifying Topical Authorities in Microblogs[C]//Proceedings of the 4th ACM International Conference on Web Search and Data Mining. ACM, 2011.
[2] 彭丽徽, 李贺, 张艳丰. 基于灰色关联分析的网络舆情意见领袖识别及影响力排序研究——以新浪微博“8·12滨海爆炸事件”为例[J]. 情报理论与实践, 2017,40(9):90-94.
[2] ( Peng Lihui, Li He, Zhang Yanfeng. Research on the Identification and Influence Ranking of Network Public Opinion Leaders Based on Grey Relational Analysis[J]. Information Studies: Theory & Application, 2017,40(9):90-94.)
[3] 陈芬, 付希, 何源, 等. 融合社会网络分析与影响力扩散模型的微博意见领袖发现研究[J]. 数据分析与知识发现, 2018,2(12):60-67.
[3] ( Chen Fen, Fu Xi, He Yuan, et al. Identifying Weibo Opinion Leaders with Social Network Analysis and Influence Diffusion Model[J]. Data Analysis and Knowledge Discovery, 2018,2(12):60-67.)
[4] Matsumura N, Ohsawa Y, Ishizuka M. Influence Diffusion Model in Text-Based Communication[J]. Transactions of the Japanese Society for Artificial Intelligence, 2002,17(3):259-267.
doi: 10.1527/tjsai.17.259
[5] 徐慧, 冯雪晴, 施磊磊, 等. 基于影响力扩散内容模型的舆论领袖识别方法[J]. 软件导刊, 2016,15(3):9-11.
[5] ( Xu Hui, Feng Xueqing, Shi Leilei, et al. An Opinion Leader Identification Method Based on Influence Diffusion Content Model[J]. Software Guide, 2016,15(3):9-11.)
[6] 樊兴华, 赵静, 方滨兴, 等. 影响力扩散概率模型及其用于意见领袖发现研究[J]. 计算机学报, 2013,36(2):360-367.
[6] ( Fan Xinghua, Zhao Jing, Fang Binxing, et al. Influence Diffusion Probability Model and Utilizing It to Identify Network Opinion Leader[J]. Chinese Journal of Computers, 2013,36(2):360-367.)
[7] 曹玖新, 陈高君, 吴江林, 等. 基于多维特征分析的社交网络意见领袖挖掘[J]. 电子学报, 2016,44(4):898-905.
doi: 10.3969/j.issn.0372-2112.2016.04.021
[7] ( Cao Jiuxin, Chen Gaojun, Wu Jianglin, et al. Multi-Feature Based Opinion Leader Mining in Social Networks[J]. Acta Electronica Sinica, 2016,44(4):898-905.)
doi: 10.3969/j.issn.0372-2112.2016.04.021
[8] 黄贤英, 阳安志, 刘小洋, 等. 一种改进的微博用户影响力评估算法[J]. 计算机工程, 2019,45(12):294-299.
[8] ( Huang Xianying, Yang Anzhi, Liu Xiaoyang, et al. An Improved Algorithm for Microblog User Influence Evaluation[J]. Computer Engineering, 2019,45(12):294-299.)
[9] 曹苗苗. 社交网络中的意见领袖挖掘方法研究[D]. 重庆: 重庆邮电大学, 2016.
[9] ( Cao Miaomiao. Research of Opinion Leaders Mining Method in Social Network[D]. Chongqing: Chongqing University of Posts and Telecommunications, 2016.)
[10] Li H, Zhao Y, Bai J, et al. Comparative Study in Complex Network: Node Degree and Topological Potential[C]//Proceedings of the 2nd International Conference on Image, Vision and Computing (ICIVC). IEEE, 2017.
[11] Sun R, Luo W. Using Topological Potential Method to Evaluate Node Importance in Public Opinion[C]//Proceedings of the 2017 International Conference on Electronic Industry & Automation. EIA, 2017.
[12] 肖俐平, 孟晖, 李德毅. 基于拓扑势的网络节点重要性排序及评价方法[J]. 武汉大学学报(信息科学版), 2008,33(4):379-383.
[12] ( Xiao Liping, Meng Hui, Li Deyi. Approach to Node Ranking in a Network Based on Topology Potential[J]. Geomatics and Information Science of Wuhan University, 2008,33(4):379-383.)
[13] 霍明奎, 竺佳琪, 赵丹. 移动环境下微博舆情信息传播网络结构研究[J]. 情报科学, 2019,37(5):98-102, 107.
[13] ( Huo Mingkui, Zhu Jiaqi, Zhao Dan. Propagation Characteristics and Network Structure of Micro-blog Public Opinion Information in Mobile Environment[J]. Information Science, 2019,37(5):98-102, 107.)
[14] Liu L, Zheng M, Xie Y. Information Propagation Model for Social Network Based on Information Characteristic and Social Status[C]//Proceedings of the 2016 International Conference on Human Centered Computing. Springer International Publishing, 2016.
[15] 石小月. 微博中环境风险舆情的传播机制研究——以“泉港碳九泄露事件”为例[D]. 保定: 河北大学, 2019.
[15] ( Shi Xiaoyue. Research on the Spreading Mechanism of Environmental Risk and Public Opinion in Weibo —Taking the Pollution of Carbon Harbor in Quangang as an Example[D]. Baoding: Hebei University, 2019.)
[16] Li D H, Zhang Y Q, Chen X, et al. Propagation Regularity of Hot Topics in Sina Weibo Based on SIR Model—A Simulation Research[C]//Proceedings of the 2014 Communications and IT Applications Conference. IEEE, 2014.
[17] Zhou X, Hu Y, Wu Y, et al. Influence Analysis of Information Erupted on Social Networks Based on SIR Model[J]. International Journal of Modern Physics C, 2015,26(2):1550018.
doi: 10.1142/S0129183115500187
[18] Chuan A, Chen B, Liu L, et al. Design and Implementation of Information Dissemination Simulation Algorithm in Large-Scale Complex Network Based on Spark[C]//Proceedings of the 3rd International Conference on Data Science in Cyberspace (DSC). IEEE, 2018.
[19] Kim J, Lee W, Yu H. CT-IC: Continuously Activated and Time-restricted Independent Cascade Model for Viral Marketing[C]//Proceedings of the 12th International Conference on Data Mining. IEEE, 2012.
[20] 王家坤, 王新华. 一种基于线性阈值的网络谣言离散传播模型[J]. 情报科学, 2019,37(6):163-169.
[20] ( Wang Jiakun, Wang Xinhua. A Discrete Propagation Model of Internet Rumor Based on Linear Threshold[J]. Information Science, 2019,37(6):163-169.)
[21] 曹玖新, 吴江林, 石伟, 等. 新浪微博网信息传播分析与预测[J]. 计算机学报, 2014,37(4):779-790.
[21] ( Cao Jiuxin, Wu Jianglin, Shi Wei, et al. Sina Weibo Information Diffusion Analysis and Prediction[J]. Chinese Journal of Computers, 2014,37(4):779-790.)
[22] Zhang L, Li H, Zhao C, et al. Social Network Information Propagation Model Based on Individual Behavior[J]. China Communications, 2017,14(7):1-15.
doi: 10.1109/CC.2017.8246482
[23] 刘玮, 贺敏, 王丽宏, 等. 基于用户行为特征的微博转发预测研究[J]. 计算机学报, 2016,39(10):1992-2006.
[23] ( Liu Wei, He Min, Wang Lihong, et al. Research on Microblog Retweeting Prediction Based on User Behavior Features[J]. Chinese Journal of Computers, 2016,39(10):1992-2006.)
[24] Zhang L, Zhao W, Zhao C. Research on Information Propagation Method Based on Individual User Characteristics[C]//Proceedings of the 13th International Conference on Computational Intelligence and Security (CIS). IEEE Computer Society, 2017.
[25] Li Z R, Lu T J, Shi W H, et al. Predicting the Scale of Information Diffusion in Social Network Services[J]. The Journal of China Universities of Posts and Telecommunications, 2013,20(S1):100-104.
doi: 10.1016/S1005-8885(13)60239-3
[26] 陈煜森. 基于表示学习的网络文本谣言的传播预测[D]. 武汉: 武汉大学, 2018.
[26] ( Chen Yusen. Propagation Prediction of Network Text Rumors Based on Representation Learning[D]. Wuhan: Wuhan University, 2018.)
[27] 辛悦. 社交网络信息传播及预测算法研究[D]. 西安:西安电子科技大学, 2018.
[27] ( Xin Yue. Information Dissemination and Prediction Algorithms of Social Network[D]. Xi’an: Xidian University, 2018.)
[28] 张健沛, 李泓波, 杨静, 等. 基于拓扑势的网络社区结点重要度排序算法[J]. 哈尔滨工程大学学报, 2012,33(6):745-752.
[28] ( Zhang Jianpei, Li Hongbo, Yang Jing, et al. An Importance-Sorting Algorithm of Network Community Nodes Based on Topological Potential[J]. Journal of Harbin Engineering University, 2012,33(6):745-752.)
[29] 齐超, 陈鸿昶, 于洪涛. 基于用户行为综合分析的微博用户影响力评价方法[J]. 计算机应用研究, 2014,31(7):2004-2007.
[29] ( Qi Chao, Chen Hongchang, Yu Hongtao. Method of Evaluating Micro-blog Users’ Influence Based on Comprehensive Analysis of User Behavior[J]. Application Research of Computers, 2014,31(7):2004-2007.)
[1] Fan Tao,Wang Hao,Wu Peng. Sentiment Analysis of Online Users' Negative Emotions Based on Graph Convolutional Network and Dependency Parsing[J]. 数据分析与知识发现, 2021, 5(9): 97-106.
[2] Wang Xiwei,Jia Ruonan,Wei Yanan,Zhang Liu. Clustering User Groups of Public Opinion Events from Multi-dimensional Social Network[J]. 数据分析与知识发现, 2021, 5(6): 25-35.
[3] Ma Yingxue,Zhao Jichang. Patterns and Evolution of Public Opinion on Weibo During Natural Disasters: Case Study of Typhoons and Rainstorms[J]. 数据分析与知识发现, 2021, 5(6): 66-79.
[4] Wang Nan,Li Hairong,Tan Shuru. Predicting of Public Opinion Reversal with Improved SMOTE Algorithm and Ensemble Learning[J]. 数据分析与知识发现, 2021, 5(4): 37-48.
[5] Cheng Tiejun, Wang Man, Huang Baofeng, Feng Lanping. Predicting Online Public Opinion in Emergencies Based on CEEMDAN-BP[J]. 数据分析与知识发现, 2021, 5(11): 59-67.
[6] Shao Qi,Mu Dongmei,Wang Ping,Jin Chunyan. Identifying Subjects of Online Opinion from Public Health Emergencies[J]. 数据分析与知识发现, 2020, 4(9): 68-80.
[7] Liang Ye,Li Xiaoyuan,Xu Hang,Hu Yiran. CLOpin: A Cross-Lingual Knowledge Graph Framework for Public Opinion Analysis and Early Warning[J]. 数据分析与知识发现, 2020, 4(6): 1-14.
[8] Deng Jiangao,Zhang Xuan,Fu Zhu,Wei Qingming. Tracking Online Public Opinion Based on System Dynamics: Case Study of “Xiangshui Explosion Accident”[J]. 数据分析与知识发现, 2020, 4(2/3): 110-121.
[9] Liang Yanping,An Lu,Liu Jing. Topic Resonance of Micro-blogs on Similar Public Health Emergencies[J]. 数据分析与知识发现, 2020, 4(2/3): 122-133.
[10] Ding Shengchun,Yu Fengyang,Li Zhen. Identifying Potential Trending Topics of Online Public Opinion[J]. 数据分析与知识发现, 2020, 4(2/3): 29-38.
[11] Huang Wei,Zhao Jiangyuan,Yan Lu. Empirical Research on Topic Drift Index for Trending Network Events[J]. 数据分析与知识发现, 2020, 4(11): 92-101.
[12] Lin Wang,Ke Wang,Jiang Wu. Public Opinion Propagation and Evolution of Public Health Emergencies in Social Media Era: A Case Study of 2018 Vaccine Event[J]. 数据分析与知识发现, 2019, 3(4): 42-52.
[13] Jiang Wu,Yinghui Zhao,Jiahui Gao. Research on Weibo Opinion Leaders Identification and Analysis in Medical Public Opinion Incidents[J]. 数据分析与知识发现, 2019, 3(4): 53-62.
[14] Yanshuang Mei,Hengmin Zhu,Jing Wei. A Study on the Mechanism of Media Collaboration on the Spread of Internet Public Opinion[J]. 数据分析与知识发现, 2019, 3(2): 65-71.
[15] Fen Chen,Xiaohuan Gao,Yue Peng,Yuan He,Chunxiang Xue. Identifying Weibo Opinion Leaders with Text Sentiment Analysis[J]. 数据分析与知识发现, 2019, 3(11): 120-128.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn