Please wait a minute...
Advanced Search
数据分析与知识发现  2023, Vol. 7 Issue (11): 68-78     https://doi.org/10.11925/infotech.2096-3467.2022.0905
  研究论文 本期目录 | 过刊浏览 | 高级检索 |
基于多模态异质图的社交媒体谣言检测模型*
强子珊,顾益军()
中国人民公安大学信息网络安全学院 北京 100038
Detecting Social Media Rumors Based on Multimodal Heterogeneous Graph
Qiang Zishan,Gu Yijun()
College of Information and Cyber Security, People’s Public Security University of China, Beijing 100038, China
全文: PDF (942 KB)   HTML ( 10
输出: BibTeX | EndNote (RIS)      
摘要 

目的】 验证谣言不同模态之间存在关联性,以提高谣言检测准确率,进而提出一种基于多模态异质图的社交媒体谣言检测模型。【方法】 以社交平台上多模态的帖子为研究对象,首先通过预处理提取文本、图片两种模态信息及用户属性信息的特征表示,按照文本、图片、用户三者间的关联关系构建异质图,然后按照指定的元路径提取文本类型节点的嵌入表示,最后将其输入分类器中,判断其是否是谣言。【结果】 在公开的数据集上进行实验,结果表明,所提模型在两个数据集上的准确率分别达到91.3%和93.8%,其他评价指标也高于基线模型。【局限】 由于共享多模态谣言的三类节点会使所构建的异质图存在较大的稀疏性,所提模型更适用于小型的话题社区。【结论】 谣言的不同模态之间存在关联性,所提模型利用该特征在多模态谣言检测中表现出良好的效果。

服务
把本文推荐给朋友
加入引用管理器
E-mail Alert
RSS
作者相关文章
强子珊
顾益军
关键词 谣言检测节点嵌入多模态异质图注意力机制    
Abstract

[Objective] This paper proposes a social media rumor detection model based on the multimodal heterogeneous graph, aiming to verify the correlation between different rumor modalities and improve the accuracy of rumor detection. [Methods] First, we retrieved multimodal posts from social platforms. Then, we extracted feature representations of texts, pictures, and user attributes through preprocessing. Third, we constructed a heterogeneous graph based on the correlation between texts, pictures, and users. Fourth, we extracted the embeddings of text-type nodes according to their specified meta path. Finally, we input the embedding into the classifier to determine whether or not it is a rumor. [Results] We examined the proposed model with two open data sets. The accuracy of our model reached 91.3% and 93.8%, which were also higher than the baseline models. [Limitations] The three types of nodes from the sharing multimodal rumors will make the heterogeneous graph sparse. The proposed model is more suitable for small topic communities. [Conclusions] There is a correlation between different modalities of rumors, which helps the proposed model effectively detect multimodal rumors.

Key wordsRumor Detection    Node Embedding    Multimodal    Heterogeneous Graph    Attention Mechanism
收稿日期: 2022-08-28      出版日期: 2023-03-22
ZTFLH:  TP391 G350  
基金资助:*公安部科技强警基础工作专项(2020GABJC02);中央高校基本科研业务费专项的研究成果之一(2022JKF02039)
通讯作者: 顾益军,E-mail:guyijun@ppsuc.edu.cn。   
引用本文:   
强子珊, 顾益军. 基于多模态异质图的社交媒体谣言检测模型*[J]. 数据分析与知识发现, 2023, 7(11): 68-78.
Qiang Zishan, Gu Yijun. Detecting Social Media Rumors Based on Multimodal Heterogeneous Graph. Data Analysis and Knowledge Discovery, 2023, 7(11): 68-78.
链接本文:  
https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/10.11925/infotech.2096-3467.2022.0905      或      https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/Y2023/V7/I11/68
Fig.1  谣言多模态异质图示例
Fig.2  Het_MMRD模型
Fig.3  多模态实例编码
数据集 总数 谣言 非谣言 文本
数量
用户
数量
图片
数量
节点
总数
MediaEval2015 1 144 385 759 1 144 1 100 40 2 284
MediaEval2016 1 206 400 806 1 206 1 157 96 2 459
Table 1  数据统计
模态 模型 Acc P R F1
单模态模型 Text 0.555 0.368 0.405 0.386
Picture 0.454 0.269 0.439 0.333
多模态模型 FCC 0.655 0.500 0.506 0.503
DCC 0.620 0.431 0.317 0.365
Add_Att 0.838 0.728 0.848 0.784
Dot_Att 0.681 0.568 0.317 0.407
MicroBlog-HAN 0.887 0.835 0.835 0.835
Het_MMRD 0.913 0.839 0.924 0.880
Table 2  MediaEval2015数据集实验结果
模态 模型 Acc P R F1
单模态模型 Text 0.558 0.354 0.430 0.389
Picture 0.482 0.279 0.537 0.367
多模态模型 FCC 0.632 0.432 0.405 0.418
DCC 0.591 0.384 0.418 0.400
Add_Att 0.711 0.571 0.456 0.507
Dot_Att 0.703 0.557 0.430 0.485
MicroBlog-HAN 0.909 0.880 0.835 0.857
Het_MMRD 0.938 0.985 0.823 0.897
Table 3  MediaEval2016数据集实验结果
数据集 模型 Acc P R F1
MediaEval2015 Het_MMRDwithout T-U-T 0.873 0.798 0.848 0.822
Het_MMRDwithout T-P-T 0.900 0.833 0.886 0.859
Het_MMRDwithout T-U-P-U-T 0.887 0.791 0.911 0.847
Het_MMRDwithout T-P-U-P-T 0.908 0.837 0.911 0.873
Het_MMRD 0.913 0.839 0.924 0.880
MediaEval2016 Het_MMRDwithout T-U-T 0.930 0.984 0.798 0.881
Het_MMRDwithout T-P-T 0.913 0.953 0.772 0.853
Het_MMRDwithout T-U-P-U-T 0.901 0.983 0.709 0.824
Het_MMRDwithout T-P-U-P-T 0.926 0.984 0.785 0.873
Het_MMRD 0.938 0.985 0.823 0.897
Table 4  消融实验结果对比
[1] Castillo C, Mendoza M, Poblete B. Information Credibility on Twitter[C]// Proceedings of the 20th International Conference on World Wide Web. ACM, 2011: 675-684.
[2] Wu K, Yang S, Zhu K Q. False Rumors Detection on Sina Weibo by Propagation Structures[C]// Proceedings of 2015 IEEE 31st International Conference on Data Engineering. 2015: 651-662.
[3] Qazvinian V, Rosengren E, Radev D R, et al. Rumor Has It: Identifying Misinformation in Microblogs[C]// Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing. 2011: 1589-1599.
[4] 祖坤琳, 赵铭伟, 郭凯, 等. 新浪微博谣言检测研究[J]. 中文信息学报, 2017, 31(3): 198-204.
[4] (Zu Kunlin, Zhao Mingwei, Guo Kai, et al. Research on the Detection of Rumor on Sina Weibo[J]. Journal of Chinese Information Processing, 2017, 31(3): 198-204.)
[5] 李文政, 顾益军, 闫红丽. 基于网络贝叶斯信息准则算法的社区数量预测研究[J]. 数据分析与知识发现, 2020, 4(4): 72-82.
[5] (Li Wenzheng, Gu Yijun, Yan Hongli. Predicting Community Numbers with Network Bayesian Information Criterion[J]. Data Analysis and Knowledge Discovery, 2020, 4(4): 72-82.)
[6] 王本钰, 顾益军, 彭舒凡. 基于粒子竞争机制的半监督社区发现算法[J]. 计算机科学与探索, 2023, 17(3): 608-619.
doi: 10.3778/j.issn.1673-9418.2106040
[6] (Wang Benyu, Gu Yijun, Peng Shufan. Semi-supervised Community Detection Algorithm Based on Particle Competition[J]. Journal of Frontiers of Computer Science and Technology, 2023, 17(3): 608-619.)
doi: 10.3778/j.issn.1673-9418.2106040
[7] Ma J, Gao W, Mitra P, et al. Detecting Rumors from Microblogs with Recurrent Neural Networks[C]// Proceedings of the 25th International Joint Conference on Artificial Intelligence. 2016: 3818-3824.
[8] 刘政, 卫志华, 张韧弦. 基于卷积神经网络的谣言检测[J]. 计算机应用, 2017, 37(11): 3053-3056, 3100.
doi: 10.11772/j.issn.1001-9081.2017.11.3053
[8] (Liu Zheng, Wei Zhihua, Zhang Renxian. Rumor Detection Based on Convolutional Neural Network[J]. Journal of Computer Applications, 2017, 37(11): 3053-3056, 3100.)
doi: 10.11772/j.issn.1001-9081.2017.11.3053
[9] Ma J, Gao W, Wong K F. Rumor Detection on Twitter with Tree-Structured Recursive Neural Networks[C]// Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics(Volume 1:Long Papers). 2018: 1980-1989.
[10] Bian T, Xiao X, Xu T Y, et al. Rumor Detection on Social Media with Bi-directional Graph Convolutional Networks[C]// Proceedings of the AAAI Conference on Artificial Intelligence. 2020: 549-556.
[11] 王雷全. 基于图模型的多模态社会媒体分析[D]. 北京: 北京邮电大学, 2016.
[11] (Wang Leiquan. Analysis of Multi-modal Social Media Based on Graph Model[D]. Beijing: Beijing University of Posts and Telecommunications, 2016.)
[12] 吴友政, 李浩然, 姚霆, 等. 多模态信息处理前沿综述:应用、融合和预训练[J]. 中文信息学报, 2022, 36(5): 1-20.
[12] (Wu Youzheng, Li Haoran, Yao Ting, et al. A Survey of Multimodal Information Processing Frontiers: Application, Fusion and Pre-training[J]. Journal of Chinese Information Processing, 2022, 36(5): 1-20.)
[13] Ngiam J, Khosla A, Kim M, et al. Multimodal Deep Learning[C]// Proceedings of the 28th International Conference on International Conference on Machine Learning. 2011: 689-696.
[14] Lu J S, Yang J W, Batra D, et al. Hierarchical Question-Image Co-attention for Visual Question Answering[C]// Proceedings of the 30th International Conference on Neural Information Processing Systems. 2016: 289-297.
[15] Nam H, Ha J W, Kim J. Dual Attention Networks for Multimodal Reasoning and Matching[C]// Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition. 2017: 2156-2164.
[16] Wöllmer M, Metallinou A, Eyben F, et al. Context-Sensitive Multimodal Emotion Recognition from Speech and Facial Expression Using Bidirectional LSTM Modeling[C]// Proceedings of the 11th Annual Conference of the International Speech Communication Association. 2010: 2362-2365.
[17] Hu J W, Liu Y C, Zhao J M, et al. MMGCN: Multimodal Fusion via Deep Graph Convolution Network for Emotion Recognition in Conversation[C]// Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1:Long Papers). 2021: 5666-5675.
[18] Jin Z W, Cao J, Guo H, et al. Multimodal Fusion with Recurrent Neural Networks for Rumor Detection on Microblogs[C]// Proceedings of the 25th ACM International Conference on Multimedia. 2017: 795-816.
[19] 刘金硕, 冯阔, Jeff Z. Pan, 等. MSRD: 多模态网络谣言检测方法[J]. 计算机研究与发展, 2020, 57(11): 2328-2336.
[19] (Liu Jinshuo, Feng Kuo, Jeff Z. Pan, et al. MSRD: Multi-modal Web Rumor Detection Method[J]. Journal of Computer Research and Development, 2020, 57(11): 2328-2336.)
[20] 陈志毅, 隋杰. 基于DeepFM和卷积神经网络的集成式多模态谣言检测方法[J]. 计算机科学, 2022, 49(1): 101-107.
doi: 10.11896/jsjkx.201200007
[20] (Chen Zhiyi, Sui Jie. DeepFM and Convolutional Neural Networks Ensembles for Multimodal Rumor Detection[J]. Computer Science, 2022, 49(1): 101-107.)
doi: 10.11896/jsjkx.201200007
[21] 孟佳娜, 王晓培, 李婷, 等. 基于对抗神经网络的跨模态谣言检测[J]. 数据分析与知识发现, 2022, 6(12): 32-42.
[21] (Meng Jiana, Wang Xiaopei, Li Ting, et al. Cross-Modal Rumor Detection Based on Adversarial Neural Networks[J]. Data Analysis and Knowledge Discovery, 2022, 6(12): 32-42.)
[22] 戚力鑫, 万书振, 唐斌, 等. 基于注意力机制的多模态融合谣言检测方法[J]. 计算机工程与应用, 2022, 58(19): 209-217.
doi: 10.3778/j.issn.1002-8331.2102-0229
[22] (Qi Lixin, Wan Shuzhen, Tang Bin, et al. Multimodal Fusion Rumor Detection Method Based on Attention Mechanism[J]. Computer Engineering and Applications, 2022, 58(19): 209-217.)
doi: 10.3778/j.issn.1002-8331.2102-0229
[23] 张少钦, 杜圣东, 张晓博, 等. 融合多模态信息的社交网络谣言检测方法[J]. 计算机科学, 2021, 48(5): 117-123.
doi: 10.11896/jsjkx.200400057
[23] (Zhang Shaoqin, Du Shengdong, Zhang Xiaobo, et al. Social Rumor Detection Method Based on Multimodal Fusion[J]. Computer Science, 2021, 48(5): 117-123.)
doi: 10.11896/jsjkx.200400057
[24] 唐樾, 马静. 基于增强对抗网络和多模态融合的谣言检测方法[J]. 情报科学, 2022, 40(6): 108-114.
[24] (Tang Yue, Ma Jing. A Rumor Detection Method Based on Enhance Adversarial Network and Multimodal Fusion[J]. Information Science, 2022, 40(6): 108-114.)
[25] 陶霄, 朱焱, 李春平. 基于注意力与多模态混合融合的谣言检测方法[J]. 计算机工程, 2021, 47(12): 71-77.
doi: 10.19678/j.issn.1000-3428.0059683
[25] (Tao Xiao, Zhu Yan, Li Chunping. Rumor Detection Method Based on Attention and Multi-modal Hybrid Fusion[J]. Computer Engineering, 2021, 47(12): 71-77.)
doi: 10.19678/j.issn.1000-3428.0059683
[26] 何俊, 张彩庆, 李小珍, 等. 面向深度学习的多模态融合技术研究综述[J]. 计算机工程, 2020, 46(5): 1-11.
doi: 10.19678/j.issn.1000-3428.0057370
[26] (He Jun, Zhang Caiqing, Li Xiaozhen, et al. Survey of Research on Multimodal Fusion Technology for Deep Learning[J]. Computer Engineering, 2020, 46(5): 1-11.)
doi: 10.19678/j.issn.1000-3428.0057370
[27] Wang X, Ji H Y, Shi C, et al. Heterogeneous Graph Attention Network[C]// Proceedings of WWW’19. 2019: 2022-2032.
[28] Zhang C X, Song D J, Huang C, et al. Heterogeneous Graph Neural Network[C]// Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 2019: 793-803.
[29] Fu X Y, Zhang J N, Meng Z Q, et al. MAGNN: Metapath Aggregated Graph Neural Network for Heterogeneous Graph Embedding[C]// Proceedings of the WWW 2020. 2020: 2331-2341.
[30] 李昊杰. 异质图神经网络研究及其在推荐系统中的应用[D]. 成都: 电子科技大学, 2022.
[30] (Li Haojie. Research on Heterogeneous Graph Neural Network and Its Application in Recommendation System[D]. Chengdu: University of Electronic Science and Technology of China, 2022.)
[31] 李良训. 基于异质图嵌入的Android恶意软件检测的研究与实现[D]. 北京: 北京邮电大学, 2021.
[31] (Li Liangxun. Research and Implementation of Heterogeneous Graph Embedding for Android Malware Detection[D]. Beijing: Beijing University of Posts and Telecommunications, 2021.)
[32] Huang Q, Yu J S, Wu J, et al. Heterogeneous Graph Attention Networks for Early Detection of Rumors on Twitter[C]// Proceedings of 2020 International Joint Conference on Neural Networks. 2020: 1-8.
[33] 毕蓓, 潘慧瑶, 陈峰, 等. 基于异构图注意力网络的微博谣言监测模型[J]. 计算机应用, 2021, 41(12): 3546-3550.
doi: 10.11772/j.issn.1001-9081.2021060981
[33] (Bi Bei, Pan Huiyao, Chen Feng, et al. Microblog Rumor Detection Model Based on Heterogeneous Graph Attention Network[J]. Journal of Computer Applications, 2021, 41(12): 3546-3550.)
doi: 10.11772/j.issn.1001-9081.2021060981
[34] Sun Z, Deng Z H, Nie J Y, et al. RotatE: Knowledge Graph Embedding by Relational Rotation in Complex Space[OL]. arXiv Preprint, arXiv:1902.10197.
[35] Boididou C, Andreadou K, Papadopoulos S, et al. Verifying Multimedia Use at MediaEval 2015[C]// Proceedings of MediaEval 2015 Workshop. 2015.
[36] Maigrot C, Claveau V, Kijak E, et al. MediaEval 2016: A Multimodal System for the Verifying Multimedia Use Task[C]// Proceedings of MediaEval 2016 Workshop. 2016.
[37] 王壮, 隋杰. 基于多级融合的多模态谣言检测模型[J]. 计算机工程与设计, 2022, 43(6): 1756-1761.
[37] (Wang Zhuang, Sui Jie. Multimodal Rumor Detection Model Based on Multilevel Fusion[J]. Computer Engineering and Design, 2022, 43(6): 1756-1761.)
[38] Godbole S, Sarawagi S. Discriminative Methods for Multi-labeled Classification[C]// Proceedings of Pacific-Asia Conference on Knowledge Discovery and Data Mining. 2004: 22-30.
[1] 何丽, 杨美华, 刘璐瑶. 融合SPO语义和句法信息的事件检测方法*[J]. 数据分析与知识发现, 2023, 7(9): 114-124.
[2] 韩普, 顾亮, 叶东宇, 陈文祺. 基于多任务和迁移学习的中文医学文献实体识别研究*[J]. 数据分析与知识发现, 2023, 7(9): 136-145.
[3] 刘洋, 丁星辰, 马莉莉, 王淳洋, 朱立芳. 基于多维度图卷积网络的旅游评论有用性识别*[J]. 数据分析与知识发现, 2023, 7(8): 95-104.
[4] 赵萌, 王昊, 李晓敏. 中国民歌多情感识别及情感变化规律分析研究*[J]. 数据分析与知识发现, 2023, 7(7): 111-124.
[5] 刘洋, 张雯, 胡毅, 毛进, 黄菲. 基于多模态深度学习的酒店股票预测*[J]. 数据分析与知识发现, 2023, 7(5): 21-32.
[6] 黄学坚, 马廷淮, 王根生. 基于分层语义特征学习模型的微博谣言事件检测*[J]. 数据分析与知识发现, 2023, 7(5): 81-91.
[7] 徐康, 余胜男, 陈蕾, 王传栋. 基于语言学知识增强的自监督式图卷积网络的事件关系抽取方法*[J]. 数据分析与知识发现, 2023, 7(5): 92-104.
[8] 张昱, 张海军, 刘雅情, 梁科晋, 王月阳. 基于双向掩码注意力机制的多模态情感分析*[J]. 数据分析与知识发现, 2023, 7(4): 46-55.
[9] 潘华莉, 谢珺, 高婧, 续欣莹, 王长征. 融合多模态特征的深度强化学习推荐模型*[J]. 数据分析与知识发现, 2023, 7(4): 114-128.
[10] 韩普, 仲雨乐, 陆豪杰, 马诗雯. 基于对抗性迁移学习的药品不良反应实体识别研究*[J]. 数据分析与知识发现, 2023, 7(3): 131-141.
[11] 赵朝阳, 朱贵波, 王金桥. ChatGPT给语言大模型带来的启示和多模态大模型新的发展思路*[J]. 数据分析与知识发现, 2023, 7(3): 26-35.
[12] 周宁, 钟娜, 靳高雅, 刘斌. 基于混合词嵌入的双通道注意力网络中文文本情感分析*[J]. 数据分析与知识发现, 2023, 7(3): 58-68.
[13] 苏明星, 吴厚月, 李健, 黄菊, 张顺香. 基于多层交互注意力机制的商品属性抽取*[J]. 数据分析与知识发现, 2023, 7(2): 108-118.
[14] 王昊, 龚丽娟, 周泽聿, 范涛, 王永生. 融合语义增强的社交媒体虚假信息检测方法研究*[J]. 数据分析与知识发现, 2023, 7(2): 48-60.
[15] 林哲, 陈平华. 基于块注意力机制和Involution的文本情感分析模型*[J]. 数据分析与知识发现, 2023, 7(11): 37-45.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
版权所有 © 2015 《数据分析与知识发现》编辑部
地址:北京市海淀区中关村北四环西路33号 邮编:100190
电话/传真:(010)82626611-6626,82624938
E-mail:jishu@mail.las.ac.cn