Detecting Social Media Fake News with Semantic Consistency Between Multi-model Contents

doi:10.11925/infotech.2096-3467.2020.0884

Data Analysis and Knowledge Discovery

2021, Vol. 5

Issue (5): 21-29 DOI: 10.11925/infotech.2096-3467.2020.0884

Current Issue | Archive | Adv Search

Detecting Social Media Fake News with Semantic Consistency Between Multi-model Contents

Zhang Guobiao^1,²,Li Jie³(

)

¹School of Information Management, Wuhan University, Wuhan 430072, China
²Institute for Information Retrieval and Knowledge Mining, Wuhan University, Wuhan 430072, China
³School of Sociology, Soochow University, Suzhou 215000, China

Download: PDF (2867 KB) HTML ( 49 )
Export: BibTeX | EndNote (RIS)

Abstract

[Objective] This study aims to detect fake news on social media earlier and curb the dissemination of mis/dis-information. [Methods] Based on the features of news images and texts, we mapped the images to semantic tags and calculated the semantic consistency between images and texts. Then, we constructed a model to detect fake news. Finally, we examined our new model with the FakeNewsNet dataset. [Results] The F1 value of our model was up to 0.775 on PolitiFact data and 0.879 on GossipCop data. [Limitations] Due to the limits of existing annotation methods for image semantics, we could not accurately describe image contents, and calculate semantic consistency. [Conclusions] The constructed model could effectively detect fake news from social media.

Key words： Fake News Detection Social Media Multi-modal Feature Fusion Semantic Consistency Deep Learning

Received: 08 September 2020 Published: 24 November 2020

ZTFLH:

TP393

Fund:The work is supported by Soochow University 2020 Humanities and Social Sciences Excellent Academic Team Project(NH33711520)

Corresponding Authors: Li Jie E-mail: allison_lijie@163.com

	Service

	E-mail this article
	Add to my bookshelf
	Add to citation manager
	E-mail Alert
	RSS
	Articles by authors
	Guobiao Zhang
	Jie Li

Cite this article:

Zhang Guobiao,Li Jie. Detecting Social Media Fake News with Semantic Consistency Between Multi-model Contents. Data Analysis and Knowledge Discovery, 2021, 5(5): 21-29.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.2096-3467.2020.0884 OR https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2021/V5/I5/21

An Example of Image and Text Semantic Inconsistency of Fake News

Image Label Mapping Process

Social Media Fake News Detection Model Based on Multi-modal Feature Fusion

FakeNewsNet Experimental Data

Experimental Parameter Settings

Fake News Detection Results

Average Value of Semantic Consistency of Each CNN Model

Examples of News Text and Image Semantic Consistency

[1]	Aldwairi M, Alwahedi A. Detecting Fake News in Social Media Networks[J]. Procedia Computer Science, 2018,141:215-222. doi: 10.1016/j.procs.2018.10.171
[2]	Kim A, Moravec P L, Dennis A R. Combating Fake News on Social Media with Source Ratings: The Effects of User and Expert Reputation Ratings[J]. Journal of Management Information Systems, 2019,36(3):931-968. doi: 10.1080/07421222.2019.1628921
[3]	Shu K, Mahudeswaran D, Wang S, et al. Hierarchical Propagation Networks for Fake News Detection: Investigation and Exploitation[C]// Proceedings of the 14th International AAAI Conference on Web and Social Media. 2020.
[4]	Qi P, Cao J, Yang T, et al. Exploiting Multi-domain Visual Information for Fake News Detection[C]// Proceedings of the 19th IEEE International Conference on Data Mining (ICDM), Beijing, China. USA: IEEE, 2019.
[5]	Castillo C, Mendoza M, Poblete B. Information Credibility on Twitter[C]// Proceedings of the 20th International Conference on World Wide Web, Hyderabad, India. New York, USA: ACM, 2011.
[6]	Rashkin H, Choi E, Jang J Y, et al. Truth of Varying Shades: Analyzing Language in Fake News and Political Fact-checking[C]// Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, Copenhagen, Denmark. USA: ACL, 2017.
[7]	Ma J, Gao W, Mitra P, et al. Detecting Rumors from Microblogs with Recurrent Neural Networks[C]// Proceedings of the 25th International Joint Conference on Artificial Intelligence (IJCAI), New York, USA. New York, USA: ACM, 2016.
[8]	Popat K, Mukherjee S, Yates A, et al. DeClarE: Debunking Fake News and False Claims Using Evidence-Aware Deep Learning[C]// Proceeding of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP), Brussels, Belgium. USA: ACL, 2018: 22-32.
[9]	Jin Z, Cao J, Guo H, et al. Multimodal Fusion with Recurrent Neural Networks for Rumor Detection on Microblogs[C]// Proceedings of the 25th ACM International Conference on Multimedia, Mountain View, USA. New York, USA: ACM, 2017: 795-816.
[10]	Wang Y, Ma F, Jin Z, et al. EANN: Event Adversarial Neural Networks for Multi-modal Fake News Detection[C]// Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, London, UK. New York, USA: ACM, 2018.
[11]	Khattar D, Goud J S, Gupta M, et al. MVAE: Multimodal Variational Autoencoder for Fake News Detection[C]// Proceedings of the 2019 World Wide Web Conference. ACM, 2019.
[12]	Sing V K, Ghosh I, Sonagara D. Detecting Fake News Stories via Multimodal Analysis[J]. Journal of the Association for Information Science and Technology, 2021,72(1):3-17. doi: 10.1002/asi.v72.1
[13]	鲍远福. 新媒体文本表意论:从“语图关系”到“语图间性”[J]. 南京邮电大学学报(社会科学版), 2016,18(1):11-22.
[13]	( Bao Yuanfu. Ideographic Text of New Media: From “Language-icon Relationship” to “Language-Icon Intertextuality”[J]. Journal of Nanjing University of Posts and Telecommunications (Social Science), 2016,18(1):11-22.)
[14]	Gombrich E H. The Image and the Eye: Further Studies in the Psychology of Pictorial Representation[M]. Oxford: Phaidon Press, 1982: 150.
[15]	Deng J, Dong W, Socher R, et al. ImageNet: A Large-scale Hierarchical Image Database[C]// Proceedings of the 2009 IEEE Conference on Computer Vision & Pattern Recognition, Miami, USA. USA: IEEE, 2009.
[16]	Krizhevsky A, Sutskever I, Hinton G E. Imagenet Classification with Deep Convolutional Neural Networks[C]// Proceedings of the 25th International Conference on Neural Information Processing Systems (NIPS), Lake Tahoe, USA. 2012: 1097-1105.
[17]	Mikolov T, Sutskever I, Chen K, et al. Distributed Representations of Words and Phrases and Their Compositionality[C]// Proceedings of the 26th International Conference on Neural Information Processing Systems. 2013: 3111-3119.
[18]	Maas A L, Daly R E, Pham P T, et al. Learning Word Vectors for Sentiment Analysis[C]// Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Portland, USA. New York, USA: ACM, 2011: 142-150.
[19]	Gentzkow M, Shapiro J M, Stone D F. Media Bias in the Marketplace: Theory[R]. National Bureau of Economic Research, Inc., 2014: 623-645.
[20]	Hochreiter S, Schmidhuber J. Long Short-Term Memory[J]. Neural Computation, 1997,9(8):1735-1780. pmid: 9377276
[21]	Jibril T A, Abdullah M H. Relevance of Emoticons in Computer-Mediated Communication Contexts: An Overview[J]. Asian Social Ence, 2013,9(4):201-207.
[22]	Yoon J, Chung E. Image Use in Social Network Communication: A Case Study of Tweets on the Boston Marathon Bombing[J]. Information Research, 2016,21(1):106-116.
[23]	He K, Zhang X, Ren S, et al. Deep Residual Learning for Image Recognition[C]// Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, USA. USA: IEEE, 2016.
[24]	Mikolov T, Chen K, Corrado G, et al. Efficient Estimation of Word Representations in Vector Space[OL]. arXiv Preprint, arXiv: 1301. 3781.
[25]	Simonyan K, Zisserman A. Very Deep Convolutional Networks for Large-scale Image Recognition[OL]. arXiv Preprint, arXiv: 1409. 1556.
[26]	Szegedy C, Liu W, Jia Y, et al. Going Deeper with Convolutions[C]// Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, USA. USA: IEEE, 2015.
[27]	Chollet F. Xception: Deep Learning with Depthwise Separable Convolutions[C]// Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, USA. USA: IEEE, 2017.
[28]	Shu K, Mahudeswaran D, Wang S, et al. Fakenewsnet: A Data Repository with News Content, Social Context and Dynamic Information for Studying Fake News on Social Media[OL]. arXiv Preprint ,arXiv: 1809. 01286.
[29]	Autonomio Talos[EB/OL]. [ 2020- 11- 07]. http://github.com/autonomio/talos .

[1]	Zhou Zeyu,Wang Hao,Zhao Zibo,Li Yueyan,Zhang Xiaoqin. Construction and Application of GCN Model for Text Classification with Associated Information[J]. 数据分析与知识发现, 2021, 5(9): 31-41.
[2]	Xu Yuemei, Wang Zihou, Wu Zixin. Predicting Stock Trends with CNN-BiLSTM Based Multi-Feature Integration Model[J]. 数据分析与知识发现, 2021, 5(7): 126-138.
[3]	Zhao Danning,Mu Dongmei,Bai Sen. Automatically Extracting Structural Elements of Sci-Tech Literature Abstracts Based on Deep Learning[J]. 数据分析与知识发现, 2021, 5(7): 70-80.
[4]	Huang Mingxuan,Jiang Caoqing,Lu Shoudong. Expanding Queries Based on Word Embedding and Expansion Terms[J]. 数据分析与知识发现, 2021, 5(6): 115-125.
[5]	Ma Yingxue,Zhao Jichang. Patterns and Evolution of Public Opinion on Weibo During Natural Disasters： Case Study of Typhoons and Rainstorms[J]. 数据分析与知识发现, 2021, 5(6): 66-79.
[6]	Xie Hao,Mao Jin,Li Gang. Sentiment Classification of Image-Text Information with Multi-Layer Semantic Fusion[J]. 数据分析与知识发现, 2021, 5(6): 103-114.
[7]	Zhong Jiawa,Liu Wei,Wang Sili,Yang Heng. Review of Methods and Applications of Text Sentiment Analysis[J]. 数据分析与知识发现, 2021, 5(6): 1-13.
[8]	Hu Haotian,Ji Jinfeng,Wang Dongbo,Deng Sanhong. An Integrated Platform for Food Safety Incident Entities Based on Deep Learning[J]. 数据分析与知识发现, 2021, 5(3): 12-24.
[9]	Zhang Qi,Jiang Chuan,Ji Youshu,Feng Minxuan,Li Bin,Xu Chao,Liu Liu. Unified Model for Word Segmentation and POS Tagging of Multi-Domain Pre-Qin Literature[J]. 数据分析与知识发现, 2021, 5(3): 2-11.
[10]	Lv Xueqiang,Luo Yixiong,Li Jiaquan,You Xindong. Review of Studies on Detecting Chinese Patent Infringements[J]. 数据分析与知识发现, 2021, 5(3): 60-68.
[11]	Cheng Bin,Shi Shuicai,Du Yuncheng,Xiao Shibin. Keyword Extraction for Journals Based on Part-of-Speech and BiLSTM-CRF Combined Model[J]. 数据分析与知识发现, 2021, 5(3): 101-108.
[12]	Chang Chengyang,Wang Xiaodong,Zhang Shenglei. Polarity Analysis of Dynamic Political Sentiments from Tweets with Deep Learning Method[J]. 数据分析与知识发现, 2021, 5(3): 121-131.
[13]	Feng Yong,Liu Yang,Xu Hongyan,Wang Rongbing,Zhang Yonggang. Recommendation Model Incorporating Neighbor Reviews for GRU Products[J]. 数据分析与知识发现, 2021, 5(3): 78-87.
[14]	Li Danyang, Gan Mingxin. Music Recommendation Method Based on Multi-Source Information Fusion[J]. 数据分析与知识发现, 2021, 5(2): 94-105.
[15]	Yu Chuanming, Zhang Zhengang, Kong Lingge. Comparing Knowledge Graph Representation Models for Link Prediction[J]. 数据分析与知识发现, 2021, 5(11): 29-44.

Viewed

Full text

Abstract

Cited

Shared

Discussed