Please wait a minute...
Advanced Search
数据分析与知识发现  2024, Vol. 8 Issue (4): 125-136     https://doi.org/10.11925/infotech.2096-3467.2023.0237
  研究论文 本期目录 | 过刊浏览 | 高级检索 |
基于图卷积网络和注意力机制的谣言检测方法*
凤丽洲1,刘馥榕1,王友卫2()
1天津财经大学统计学院 天津 300222
2中央财经大学信息学院 北京 100081
Detecting Rumor Based on Graph Convolution Network and Attention Mechanism
Feng Lizhou1,Liu Furong1,Wang Youwei2()
1School of Statistics, Tianjin University of Finance and Economics, Tianjin 300222, China
2School of Information, Central University of Finance and Economics, Beijing 100081, China
全文: PDF (1265 KB)   HTML ( 10
输出: BibTeX | EndNote (RIS)      
摘要 

【目的】 针对目前的谣言检测方法未能充分考虑评论间的转发关系特征和文本语义特征,提出一种基于图卷积网络和注意力机制的谣言检测方法。【方法】 首先,对评论间转发和回复关系特征进行分析,构建评论关系特征图,充分挖掘评论间的关联特性。然后,根据评论间的文本语义相似性,使用BERT模型生成句子的向量化表示并通过计算余弦相似度构建评论的语义特征图,充分提取评论的语义相关性。最后,基于图卷积网络完成不同节点之间的信息传递,并在各节点信息传输过程中使用注意力机制区分源评论和其他评论对谣言检测的影响,进而得到评论节点的准确表示。【结果】 在公开数据集上进行实验,结果显示所提方法在Twitter15和Twitter16数据集上的准确率分别达到0.860和0.870,F1均值分别为0.858和0.866。与BiGCN方法相比,准确率分别提升了5.1%和1.5%,F1均值分别提升了5.0%和1.9%。【局限】 仅使用文本数据进行谣言检测,未结合图片、用户属性及时间属性等特征。【结论】 在公开数据集上进行应用,验证了所提方法可以有效地提升谣言检测性能,为谣言识别与检测任务提供有价值的参考。

服务
把本文推荐给朋友
加入引用管理器
E-mail Alert
RSS
作者相关文章
凤丽洲
刘馥榕
王友卫
关键词 图卷积网络注意力机制谣言检测BERT模型    
Abstract

[Objective] This paper proposes a rumor detection method based on a graph convolutional network and attention mechanism, which utilizes comment forwarding and text semantic features. [Methods] Firstly, we analyzed the forwarding and replying relationship among comments and constructed a comment relationship feature map to explore the comments’ correlations. Then, we used the BERT model to generate the vector representation of sentences based on their text semantic similarity. We also built the semantic feature map of comments by calculating the cosine similarity and fully extracting their semantic relevance. Third, we completed information dissemination among nodes based on a Graph Convolutional Network (GCN). We also used the attention mechanism to distinguish the impact of original and other comments on rumor detection. Finally, we obtained an accurate representation of the comment nodes. [Results] Our model’s accuracy on the Twitter15 and Twitter16 public datasets reached 0.860 and 0.870, with F1 mean values of 0.858 and 0.866. Compared with the BiGCN method, our model’s accuracy improved by 5.1% and 1.5% on the Twitter15 and Twitter16 datasets, and the F1 mean improved by 5.0% and 1.9%, respectively. [Limitation] We only used texts for rumor detection. Future research will combine images, user attributes, and time attributes to improve the model’s accuracy. [Conclusion] The proposed method can effectively improve the performance of rumor detection, providing valuable references for related tasks.

Key wordsGraph Convolutional Networks    Attention Mechanism    Rumor Detection    BERT Model
收稿日期: 2023-03-23      出版日期: 2024-01-08
ZTFLH:  TP393  
  G250  
基金资助:* 国家社会科学基金项目(18CTJ008);国家自然科学基金项目(61906220);教育部人文社科项目(19YJCZH178)
通讯作者: 王友卫,ORCID: 0000-0002-3925-3422,E-mail: ywwang15@126.com。   
引用本文:   
凤丽洲, 刘馥榕, 王友卫. 基于图卷积网络和注意力机制的谣言检测方法*[J]. 数据分析与知识发现, 2024, 8(4): 125-136.
Feng Lizhou, Liu Furong, Wang Youwei. Detecting Rumor Based on Graph Convolution Network and Attention Mechanism. Data Analysis and Knowledge Discovery, 2024, 8(4): 125-136.
链接本文:  
https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/10.11925/infotech.2096-3467.2023.0237      或      https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/Y2024/V8/I4/125
Fig.1  模型算法流程
Fig.2  树结构和图结构的节点传递示意图
统计项目 Twitter15 Twitter16
所有事件转发评论总数 331 612 204 820
用户总数 276 663 173 487
事件总数 1 490 818
真实谣言总数 374 205
虚假谣言总数 370 205
未经证实的谣言总数 374 203
非谣言总数 372 205
平均转发评论数 223 251
最大转发评论数 1 768 2 765
最小转发评论数 55 81
Table 1  数据集的详细统计信息
方法 参数设置
DTC[25] 特征包括消息内容特征、用户特征、主题特征、传播特征,特征选择方法为向前搜索
SVM-TS[26] 核函数使用RBF,统计特征包括用户特征、内容特征和传播特征
RvNN[13] 隐层状态与Embedding大小为100,词典大小为5 000
GRU-2[6] 词典大小为5 000,使用两层GRU
PPC_RNN+CNN[27] GRU输出维度为32,CNN filters为32,CNN filter height为3,训练Epoch的数量为200,Dropout 设置为0.5
BiGCN[18] 隐藏节点特征向量的维数为64,Dropout为0.5,训练模型的Epoch为200,早停次数为10
Table 2  不同对比方法的参数设置
方法 Acc NR-F1 FR-F1 TR-F1 UR-F1
DTC 0.473 0.255 0.080 0.190 0.482
SVM-TS 0.574 0.756 0.421 0.571 0.526
RvNN 0.737 0.662 0.744 0.835 0.708
GRU-2 0.646 0.792 0.574 0.608 0.592
PPC_RNN+CNN 0.564 0.591 0.543 0.394 0.674
BiGCN 0.818 0.763 0.827 0.883 0.795
TG-GCN 0.860 0.846 0.869 0.902 0.815
Table 3  Twitter15数据集上的谣言检测结果
方法 Acc NR-F1 FR-F1 TR-F1 UR-F1
DTC 0.452 0.425 0.365 0.732 0.327
SVM-TS 0.564 0.786 0.462 0.414 0.493
RvNN 0.713 0.681 0.748 0.831 0.674
GRU-2 0.643 0.772 0.489 0.686 0.593
PPC_RNN+CNN 0.487 0.358 0.527 0.301 0.641
BiGCN 0.857 0.765 0.828 0.943 0.864
TG-GCN 0.870 0.800 0.845 0.937 0.882
Table 4  Twitter16数据集上的谣言检测结果
Fig.3  Twitter15数据集消融实验结果
Fig.4  Twitter16数据集消融实验结果
Fig.5  不同隐藏层维度的准确率
Fig.6  不同隐藏层维度的F1
Fig.7  Twitter15数据集早期实验结果(准确率)
Fig.8  Twitter15数据集早期实验结果(F1值)
Fig.9  Twitter16数据集早期实验结果(准确率)
Fig.10  Twitter16数据集早期实验结果(F1值)
[1] 陈燕方, 李志宇, 梁循, 等. 在线社会网络谣言检测综述[J]. 计算机学报, 2018, 41(7): 1648-1677.
[1] (Chen Yanfang, Li Zhiyu, Liang Xun, et al. Review on Rumor Detection of Online Social Networks[J]. Chinese Journal of Computers, 2018, 41(7): 1648-1677.)
[2] 高玉君, 梁刚, 蒋方婷, 等. 社会网络谣言检测综述[J]. 电子学报, 2020, 48(7): 1421-1435.
doi: 10.3969/j.issn.0372-2112.2020.07.023
[2] (Gao Yujun, Liang Gang, Jiang Fangting, et al. Social Network Rumor Detection: A Survey[J]. Acta Electronica Sinica, 2020, 48(7): 1421-1435.)
doi: 10.3969/j.issn.0372-2112.2020.07.023
[3] Castillo C, Mendoza M, Poblete B. Information Credibility on Twitter[C]// Proceedings of the 20th International Conference on World Wide Web. 2011: 675-684.
[4] 马鸣, 刘云, 刘地军, 等. 基于主题和预防模型的微博谣言检测[J]. 北京理工大学学报, 2020, 40(3): 310-315.
[4] (Ma Ming, Liu Yun, Liu Dijun, et al. Rumor Detection in Microblogs Based on Topic and Prevention Model[J]. Transactions of Beijing Institute of Technology, 2020, 40(3): 310-315.)
[5] Kwon S, Cha M, Jung K, et al. Prominent Features of Rumor Propagation in Online Social Media[C]// Proceedings of 2013 IEEE 13th International Conference on Data Mining. 2013: 1103-1108.
[6] Ma J, Gao W, Mitra P, et al. Detecting Rumors from Microblogs with Recurrent Neural Networks[C]// Proceedings of the 25th International Joint Conference on Artificial Intelligence. 2016: 3818-3824.
[7] 刘金硕, 冯阔, Jeff Z.Pan, 等. MSRD: 多模态网络谣言检测方法[J]. 计算机研究与发展, 2020, 57(11): 2328-2336.
[7] (Liu Jinshuo, Feng Kuo, Jeff Z.Pan, et al. MSRD: Multi-Modal Web Rumor Detection Method[J]. Journal of Computer Research and Development, 2020, 57(11): 2328-2336.)
[8] Luo Z R, Li Q Q, Zheng J. Deep Feature Fusion for Rumor Detection on Twitter[J]. IEEE Access, 2021, 9: 126065-126074.
[9] Shelhamer E, Long J, Darrell T. Fully Convolutional Networks for Semantic Segmentation[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(4): 640-651.
doi: 10.1109/TPAMI.2016.2572683 pmid: 27244717
[10] Kim Y. Convolutional Neural Networks for Sentence Classification[C]// Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. 2014: 1746-1751.
[11] Zhang X, Zhao J B, LeCun Y. Character-Level Convolutional Networks for Text Classification[C]// Proceedings of the 28th International Conference on Neural Information Processing Systems - Volume 1. 2015: 649-657.
[12] Peng Y, Wang J M. Rumor Detection Based on Attention CNN and Time Series of Context Information[J]. Future Internet, 2021, 13(11): 267.
[13] Ma J, Gao W, Wong K F. Rumor Detection on Twitter with Tree-Structured Recursive Neural Networks[C]// Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1:Long Papers). 2018: 1980-1989.
[14] 尹鹏博, 潘伟民, 彭成, 等. 基于用户特征分析的微博谣言早期检测研究[J]. 情报杂志, 2020, 39(7): 81-86.
[14] (Yin Pengbo, Pan Weimin, Peng Cheng, et al. Research on Early Detection of Weibo Rumors Based on User Characteristics Analysis[J]. Journal of Intelligence, 2020, 39(7): 81-86.)
[15] Umer M, Imtiaz Z, Ullah S, et al. Fake News Stance Detection Using Deep Learning Architecture(CNN-LSTM)[J]. IEEE Access, 2020, 8: 156695-156706.
[16] Kipf T N, Welling M. Semi-supervised Classification with Graph Convolutional Networks[OL]. arXiv Preprint, arXiv: 1609.02907.
[17] 王友卫, 童爽, 凤丽洲, 等. 基于图卷积网络的归纳式微博谣言检测新方法[J]. 浙江大学学报(工学版), 2022, 56(5): 956-966.
[17] (Wang Youwei, Tong Shuang, Feng Lizhou, et al. New Inductive Microblog Rumor Detection Method Based on Graph Convolutional Network[J]. Journal of Zhejiang University (Engineering Science), 2022, 56(5): 956-966.)
[18] Bian T, Xiao X, Xu T Y, et al. Rumor Detection on Social Media with Bidirectional Graph Convolutional Networks[C]// Proceedings of the 34th AAAI Conference on Artificial Intelligence. 2020.
[19] 王昕岩, 宋玉蓉, 宋波. 一种加权图卷积神经网络的新浪微博谣言检测方法[J]. 小型微型计算机系统, 2021, 42(8): 1780-1786.
[19] (Wang Xinyan, Song Yurong, Song Bo. Sina Microblog Rumor Detection Method Based on Weighted-Graph Convolutional Network[J]. Journal of Chinese Computer Systems, 2021, 42(8): 1780-1786.)
[20] Zhang Y F, Yu X L, Cui Z Y, et al. Every Document Owns Its Structure: Inductive Text Classification via Graph Neural Networks[C]// Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020: 334-339.
[21] 叶子诚, 闫桂英. 基于图模型的关键词提取算法研究[J]. 系统科学与数学, 2021, 41(4): 967-975.
[21] (Ye Zicheng, Yan Guiying. Study on Keyword Extraction Algorithm Based on Graphical Model[J]. Journal of Systems Science and Mathematical Sciences, 2021, 41(4): 967-975.)
[22] Devlin J, Chang M W, Lee K, et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding[C]// Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies, Volume 1 (Long and Short Papers). 2019: 4171-4186.
[23] Battaglia P W, Hamrick J B, Bapst V, et al. Relational Inductive Biases, Deep Learning, and Graph Networks[OL]. arXiv Preprint, arXiv: 1806.01261.
[24] Ma J, Gao W, Wong K F. Detect Rumors in Microblog Posts Using Propagation Structure Via Kernel Learning[C]// Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1:Long Papers). 2017: 708-717.
[25] Zhang H L, Alim M A, Li X, et al. Misinformation in Online Social Networks[J]. ACM Transactions on Information Systems, 2016, 34(3): 18.
[26] Ma J, Gao W, Wei Z Y, et al. Detect Rumors Using Time Series of Social Context Information on Microblogging Websites[C]// Proceedings of the 24th ACM International on Conference on Information and Knowledge Management. 2015: 1751-1754.
[27] Liu Y, Wu Y F. Early Detection of Fake News on Social Media Through Propagation Path Classification with Recurrent and Convolutional Networks[C]// Proceedings of the 32nd AAAI Conference on Artificial Intelligence. 2018.
[28] Chen Y H, Yin C Y, Zuo W L. Multi-task Learning for Stance and Early Rumor Detection[J]. Optical Memory and Neural Networks, 2021, 30(2): 131-139.
[29] Chen X N, Wang C Y, Li D, et al. A New Early Rumor Detection Model Based on BiGRU Neural Network[J]. Discrete Dynamics in Nature and Society, 2021, 2021: 2296605.
[1] 贺国秀, 任佳渝, 李宗耀, 林晨曦, 蔚海燕. 以可解释工具重探基于深度学习的谣言检测*[J]. 数据分析与知识发现, 2024, 8(4): 1-13.
[2] 刘成山, 李普国, 汪圳. 一种以科研团队为服务对象的科研人员推荐模型*[J]. 数据分析与知识发现, 2024, 8(3): 132-142.
[3] 全安坤, 李红莲, 张乐, 吕学强. 融合内容和图片特征的中文摘要生成方法研究*[J]. 数据分析与知识发现, 2024, 8(3): 110-119.
[4] 李慧, 胡耀华, 徐存真. 考虑评论情感表达力及其重要性的个性化推荐算法*[J]. 数据分析与知识发现, 2024, 8(1): 69-79.
[5] 何玉, 张晓冬, 郑鑫. 基于SpERT-Aggcn模型的专利知识图谱构建研究*[J]. 数据分析与知识发现, 2024, 8(1): 146-156.
[6] 何丽, 杨美华, 刘璐瑶. 融合SPO语义和句法信息的事件检测方法*[J]. 数据分析与知识发现, 2023, 7(9): 114-124.
[7] 韩普, 顾亮, 叶东宇, 陈文祺. 基于多任务和迁移学习的中文医学文献实体识别研究*[J]. 数据分析与知识发现, 2023, 7(9): 136-145.
[8] 刘洋, 丁星辰, 马莉莉, 王淳洋, 朱立芳. 基于多维度图卷积网络的旅游评论有用性识别*[J]. 数据分析与知识发现, 2023, 7(8): 95-104.
[9] 胥桂仙, 张子欣, 于绍娜, 董玉双, 田媛. 基于图卷积网络的藏文新闻文本分类*[J]. 数据分析与知识发现, 2023, 7(6): 73-85.
[10] 徐康, 余胜男, 陈蕾, 王传栋. 基于语言学知识增强的自监督式图卷积网络的事件关系抽取方法*[J]. 数据分析与知识发现, 2023, 7(5): 92-104.
[11] 黄学坚, 马廷淮, 王根生. 基于分层语义特征学习模型的微博谣言事件检测*[J]. 数据分析与知识发现, 2023, 7(5): 81-91.
[12] 韩普, 仲雨乐, 陆豪杰, 马诗雯. 基于对抗性迁移学习的药品不良反应实体识别研究*[J]. 数据分析与知识发现, 2023, 7(3): 131-141.
[13] 周宁, 钟娜, 靳高雅, 刘斌. 基于混合词嵌入的双通道注意力网络中文文本情感分析*[J]. 数据分析与知识发现, 2023, 7(3): 58-68.
[14] 苏明星, 吴厚月, 李健, 黄菊, 张顺香. 基于多层交互注意力机制的商品属性抽取*[J]. 数据分析与知识发现, 2023, 7(2): 108-118.
[15] 张贞港, 余传明. 基于实体与关系融合的知识图谱补全模型研究*[J]. 数据分析与知识发现, 2023, 7(2): 15-25.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
版权所有 © 2015 《数据分析与知识发现》编辑部
地址:北京市海淀区中关村北四环西路33号 邮编:100190
电话/传真:(010)82626611-6626,82624938
E-mail:jishu@mail.las.ac.cn