Please wait a minute...
Data Analysis and Knowledge Discovery  2017, Vol. 1 Issue (8): 18-30    DOI: 10.11925/infotech.2096-3467.2017.08.03
Orginal Article Current Issue | Archive | Adv Search |
Identifying Topics of Online Public Opinion
Li Zhen, Ding Shengchun(), Wang Nan
Department of Information Management, Nanjing University of Science and Technology, Nanjing 210094, China
Download: PDF (5068 KB)   HTML ( 5
Export: BibTeX | EndNote (RIS)      
Abstract  

[Objective] This paper aims to identify the topics of online public opinion. [Methods] We constructed a model to extract public opinion based on the information content of the Weibo posts, the relationship among the users, and user behaviors. [Results] We built a public opinion network, extracted and clustered relevant topics, constructed a two-mode network of “user-topic” and evolution of the opinion topics. The proposed method could identify topics of online public opinion effectively. [Limitations] The influence of users’ attributes on topic identification needed to be investigated. [Conclusions] We could identify the topics of online public opinion based on the social network analysis with the help of LDA model.

Key wordsNetwork Public Opinion      Social Network      LDA Model      Topic Identification      Opinion Topic     
Received: 31 May 2017      Published: 28 September 2017
ZTFLH:  TP391 G350  

Cite this article:

Li Zhen,Ding Shengchun,Wang Nan. Identifying Topics of Online Public Opinion. Data Analysis and Knowledge Discovery, 2017, 1(8): 18-30.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.2096-3467.2017.08.03     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2017/V1/I8/18

观点主题
1. 猪肉国内贵, 国外便宜
2. 进口猪肉就像日本买电饭煲, 香港买奶粉, 不是卖国贼
3. (卖国贼说法)哗众取宠, 不代表政府和社会主流价值观, 不值得关注
4. 政府应该对农业进行补贴, 控制市场
5. 国内物价都比国外高, 愿意去国外生活
6. 双汇采用真猪肉
7. 国内物价高, 东西造的质量差
8. 双汇收购是因为美帝生猪有价格优势
9. 买便宜东西是人性使然, 是爱国行为
10. 国外农产品远渡重洋, 经海关收税后还比国内便宜, 值得深思
微博编号 所属观点
主题编号
微博编号 所属观点
主题编号
1 Topic2 115 Topic2
2 Topic1 116 Topic7
3 Topic2 117 Topic6
4 Topic9 118 Topic8
5 Topic6 119 Topic5
6 Topic10 120 Topic4
日期 4月
2日
4月
7日
4月
8日
4月
9日
4月9日
之后
平均
准确率
准确率 0.66 0.53 0.71 0.47 0.6 0.56
观点主题编号 节点入度归一化 观点主题
6 0.19 Topic5
7 0.15 Topic1
8 0.13 Topic9
9 0.13 Topic8
10 0.11 Topic6
11 0.10 Topic2
12 0.06 Topic4
13 0.06 Topic0
14 0.04 Topic3
15 0.03 Topic7
[1] Wu W, Zhang B, Ostendorf M.Automatic Generation of Personalized Annotation Tags for Twitter Users[C]//// Proceedings of the 2010 Annual Conference of the North American Chapter of Association for Computational Linguistics, Los Angeles, California, USA. Association for Computational Linguistics, 2010: 689-692.
[2] Narang K, Nagar S, Mehta S, et al.Discovery and Analysis of Evolving Topical Social Discussions on Unstructured Microblogs[A]// Advances in Information Retrieval[M]. Berlin, Heidelberg: Springer, 2013: 545-556.
[3] Kim H G, Lee S, Kyeong S.Discovering Hot Topics Using Twitter Streaming Data Social Topic Detection and Geographic Clustering[C]////Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis & Mining, Niagara, Ontario, Canada. New York, USA: ACM, 2013: 1215-1220.
[4] Nguyen D T, Jung J E.Privacy-preserving Discovery of Topic-based Events from Social Sensor Signals: An Experimental Study on Twitter[J]. Scientific World Journal, 2014, 67(3): 435-444.
doi: 10.1155/2014/204785 pmid: 24955388
[5] Guo J, Zhang P, Tan J L, et al.Mining Hot Topics from Twitter Streams[J]. Procedia Computer Science, 2012, 9(11): 2008-2011.
doi: 10.1016/j.procs.2012.04.224
[6] 叶川, 马静. 多媒体微博评论信息的主题发现算法研究[J]. 现代图书情报技术, 2015(11): 51-59.
[6] (Ye Chuan, Ma Jing.Topic Discovery Algorithm for Multimedia Microblog Comments Information[J]. New Technology of Library and Information Service, 2015(11): 51-59.)
[7] 唐晓波, 房小可. 基于文本聚类与LDA相融合的微博主题检索模型研究[J]. 情报理论与实践, 2013, 36(8): 85-90.
[7] (Tang Xiaobo, Fang Xiaoke.Micro Blog Topic Retrieval Model Research Based on Text Clustering and LDA[J]. Information Studies: Theory & Application, 2013, 36(8): 85-90.)
[8] 伍万坤, 吴清烈, 顾锦江. 基于EM-LDA综合模型的电商微博热点话题发现[J]. 现代图书情报技术, 2015(11): 33-40.
[8] (Wu Wankun, Wu Qinglie, Gu Jinjiang.Research on Hot Topic Discovery of Microblog Based on EM-LDA Comprehensive Model[J]. New Technology of Library and Information Service, 2015(11): 33-40.)
[9] 黄炜, 程宝生, 杨青. 基于本体的网络群体性事件主题发现研究[J]. 图书情报工作, 2012, 56(20): 47-52.
[9] (Huang Wei, Cheng Baosheng, Yang Qing.Topic Discovery of Network Group Events Based on Ontology[J]. Library and Information Service, 2012, 56(20): 47-52.)
[10] Huang S, Liu Y, Dang D.Burst Topic Discovery and Trend Tracing Based on Storm[J]. Physica A: Statistical Mechanics and Its Applications, 2014, 416: 331-339.
doi: 10.1016/j.physa.2014.08.059
[11] 夏梦南, 杜永萍, 左本欣. 基于依存分析与特征组合的微博情感分析[J]. 山东大学学报: 理学版, 2014, 49(11): 22-30.
doi: 10.6040/j.issn.1671-9352.3.2014.074
[11] (Xia Mengnan, Du Yongping, Zuo Benxin.Micro-blog Opinion Analysis Based on Syntactic Dependency and Feature Combination[J]. Journal of Shandong University: Natural Science, 2014, 49(11): 22-30.)
doi: 10.6040/j.issn.1671-9352.3.2014.074
[12] Deng J, Deng K, Li Y, et al.Hot Topic Detection Based on Complex Networks[C]////Proceedings of the 10th International Conference on Fuzzy Systems and Knowledge Discovery. 2013.
[13] Yin Z, Cao L, Gu Q, et al.Latent Community Topic Analysis: Integration of Community Discovery with Topic Modeling[J]. ACM Transactions on Intelligent Systems and Technology, 2012, 3(4): 67-83.
doi: 10.1145/2337542.2337548
[14] 王曰芬, 杭伟梁, 丁洁. 微博舆情社会网络关键节点识别与应用研究[J]. 情报资料工作, 2016(3): 6-11.
[14] (Wang Yuefen, Hang Weiliang, Ding Jie.Identification and Application of Microblog Public Opinion Social Network Critical Node[J]. Information and Documentation Services, 2016(3): 6-11.)
[15] 周杰, 林琛, 李弼程. 面向网络评论的观点主题识别研究[J]. 情报学报, 2010, 29(5): 858-863.
doi: 10.3772/j.issn.1000-0135.2010.05.014
[15] (Zhou Jie, Lin Chen, Li Bicheng.Research on the Identification of Opinion Topic Expressed in Web Comments[J]. Journal of the China Society for Scientific and Technical Information, 2010, 29(5): 858-863.)
doi: 10.3772/j.issn.1000-0135.2010.05.014
[16] 丁晟春, 王颖, 李霄. 基于SVM的中文微博情绪分析研究[J]. 情报资料工作, 2016(3): 28-33.
[16] (Ding Shengchun, Wang Ying, Li Xiao.SVM-based Chinese Microblog Sentiment Analysis[J]. Information and Documentation Services, 2016(3): 28-33.)
[17] 陈晓美, 高铖, 关心惠. 网络舆情观点提取的LDA主题模型方法[J]. 图书情报工作, 2015, 59(21): 21-26.
[17] (Chen Xiaomei, Gao Cheng, Guan Xinhui.LDA Theme Model Method for the Extraction of Network Public Opinion[J]. New Technology of Library and Information Service, 2015, 59(21): 21-26.)
[18] 姚兆旭, 马静. 面向微博话题的“主题+观点”词条抽取算法研究[J]. 现代图书情报技术, 2016(7-8): 78-86.
[18] (Yao Zhaoxu, Ma Jing.Research on Topic Extraction Algorithm Based on “Topic + Opinion” for Microblog[J]. New Technology of Library and Information Service, 2016(7-8): 78-86.)
[1] Wang Xiwei,Jia Ruonan,Wei Yanan,Zhang Liu. Clustering User Groups of Public Opinion Events from Multi-dimensional Social Network[J]. 数据分析与知识发现, 2021, 5(6): 25-35.
[2] Ma Yingxue,Zhao Jichang. Patterns and Evolution of Public Opinion on Weibo During Natural Disasters: Case Study of Typhoons and Rainstorms[J]. 数据分析与知识发现, 2021, 5(6): 66-79.
[3] Gao Yilin,Min Chao. Comparing Technology Diffusion Structure of China and the U.S. to Countries Along the Belt and Road[J]. 数据分析与知识发现, 2021, 5(6): 80-92.
[4] Li Yueyan,Wang Hao,Deng Sanhong,Wang Wei. Research Trends of Information Retrieval——Case Study of SIGIR Conference Papers[J]. 数据分析与知识发现, 2021, 5(4): 13-24.
[5] Xu Yabin, Sun Qiutian. Identifying Leaders and Dissemination Paths of Public Opinion[J]. 数据分析与知识发现, 2021, 5(2): 32-42.
[6] Cai Yongming,Liu Lu,Wang Kewei. Identifying Key Users and Topics from Online Learning Community[J]. 数据分析与知识发现, 2020, 4(6): 69-79.
[7] Liu Yuwen,Wang Kai. Finding Geographic Locations of Popular Online Topics[J]. 数据分析与知识发现, 2020, 4(2/3): 173-181.
[8] Ye Guanghui,Xu Tong,Bi Chongwu,Li Xinyue. Analyzing Evolution of City Tourism Portraits with Multi-Dimensional Features and LDA Model[J]. 数据分析与知识发现, 2020, 4(11): 121-130.
[9] Peng Guan,Yuefen Wang. Advances in Patent Network[J]. 数据分析与知识发现, 2020, 4(1): 26-39.
[10] Yunfei Shao,Dongsu Liu. Classifying Short-texts with Class Feature Extension[J]. 数据分析与知识发现, 2019, 3(9): 60-67.
[11] Yan Wen,Lijian Ma,Qingtian Zeng,Wenyan Guo. POI Recommendation Based on Geographic and Social Relationship Preferences[J]. 数据分析与知识发现, 2019, 3(8): 30-39.
[12] Liqing Qiu,Wei Jia,Xin Fan. Influence Maximization Algorithm Based on Overlapping Community[J]. 数据分析与知识发现, 2019, 3(7): 94-102.
[13] Xiaolan Wu,Chengzhi Zhang. Analysis of Knowledge Flow Based on Academic Social Networks:
A Case Study of ScienceNet.cn
[J]. 数据分析与知识发现, 2019, 3(4): 107-116.
[14] Xinrui Wang,Yue He. Predicting Stock Market Fluctuations with Social Media Behaviors: Case Study of Sina Finance Blog[J]. 数据分析与知识发现, 2019, 3(11): 108-119.
[15] Xu Yanhua,Miao Yujie,Miao Lin,Lv Xueqiang. Generating HSK Writing Essays with LDA Model[J]. 数据分析与知识发现, 2018, 2(9): 80-87.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn