基于SC-attention机制的多模态讽刺检测研究

doi:10.11925/infotech.2096-3467.2021-1362

数据分析与知识发现

本期目录 | 过刊浏览 | 高级检索

基于SC-attention机制的多模态讽刺检测研究

陈圆圆,马静

（南京航空航天大学经济与管理学院，江苏南京 211106）

Research on Multimodal Sarcasm Detection Based on SC-attention Mechanism

Chen Yuanyuan,Ma Jing

(College of Economics and Management, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China)

摘要
相关文章
Metrics

全文:
输出: BibTeX | EndNote (RIS)

摘要

[目的]针对现有多模态讽刺检测模型中存在预测准确率不高、多模态特征难以融合等问题，本文设计一种SC-attention融合机制。

[方法]采用CLIP和RoBERTa模型分别提取图片、图片属性和文本三种模态特征，经由SENet的注意力机制和Co-attention机制结合构成的SC-attention机制将多模态特征进行融合，以原始模态特征为引导，合理分配特征权重，最后输入至全连接层进行讽刺检测。

[结果]实验结果表明基于SC-attention机制的多模态讽刺检测的准确率为93.71%，F1指标为 91.89%，该模型与采用同样数据集的模型相比，准确率提升了10.27%，F1值提升了11.5%。

[局限]模型的泛化性需要在更多数据集上体现出来。

[结论]本文所提出的SC-attention机制减少了信息冗余和特征损失，有效提高了多模态讽刺检测的准确率。

	服务

	把本文推荐给朋友
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章

关键词 ：多模态, 讽刺检测, SC-attention机制, CLIP模型

Abstract：

[Objective]In order to solve the problems of low prediction accuracy and difficult fusion of multimodal features in the existing multimodal sarcasm detection model, this paper designs an SC-attention fusion mechanism.

[Methods]The CLIP and RoBERTa models are used to extract features from three modes: picture, picture attribute and text respectively. SC-attention mechanism was combined with SENet's attention mechanism and Co-attention mechanism to fuse multi-modal features. Guided by the original modal features, attention weights are allocated reasonably. Finally, the features are input to the full connection layer for sarcasm detection.

[Results]The experimental results show that the accuracy of multimodal sarcasm detection based on SC-attention mechanism is 93.71%, and the F1 index is 91.89%. Compared with the model with the same data set, the accuracy of this model is increased by 10.27%, and the F1 value is increased by 11.5%.

[Limitations]The generalization of the model needs to be reflected in more data sets.

[Conclusions]The model proposed in this paper reduces information redundancy and feature loss, and effectively improves the accuracy of multimodal sarcasm detection.

Key words： multimodal sarcasm detection SC-attention mechanism CLIP model

出版日期: 2022-07-01

ZTFLH:

TP393，G250

引用本文:

陈圆圆, 马静. 基于SC-attention机制的多模态讽刺检测研究 [J]. 数据分析与知识发现, 10.11925/infotech.2096-3467.2021-1362.
Chen Yuanyuan, Ma Jing. Research on Multimodal Sarcasm Detection Based on SC-attention Mechanism . Data Analysis and Knowledge Discovery, 0, (): 1-.

链接本文:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/10.11925/infotech.2096-3467.2021-1362 或 https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/Y0/V/I/1

[1]	倪亮, 吴鹏, 周雪晴. 基于深度学习的多模态新闻数据主题发现研究*[J]. 数据分析与知识发现, 2024, 8(3): 85-97.
[2]	刘洋, 丁星辰, 马莉莉, 王淳洋, 朱立芳. 基于多维度图卷积网络的旅游评论有用性识别*[J]. 数据分析与知识发现, 2023, 7(8): 95-104.
[3]	赵萌, 王昊, 李晓敏. 中国民歌多情感识别及情感变化规律分析研究^*[J]. 数据分析与知识发现, 2023, 7(7): 111-124.
[4]	刘洋, 张雯, 胡毅, 毛进, 黄菲. 基于多模态深度学习的酒店股票预测^*[J]. 数据分析与知识发现, 2023, 7(5): 21-32.
[5]	张昱, 张海军, 刘雅情, 梁科晋, 王月阳. 基于双向掩码注意力机制的多模态情感分析^*[J]. 数据分析与知识发现, 2023, 7(4): 46-55.
[6]	潘华莉, 谢珺, 高婧, 续欣莹, 王长征. 融合多模态特征的深度强化学习推荐模型^*[J]. 数据分析与知识发现, 2023, 7(4): 114-128.
[7]	赵朝阳, 朱贵波, 王金桥. ChatGPT给语言大模型带来的启示和多模态大模型新的发展思路^*[J]. 数据分析与知识发现, 2023, 7(3): 26-35.
[8]	王昊, 龚丽娟, 周泽聿, 范涛, 王永生. 融合语义增强的社交媒体虚假信息检测方法研究^*[J]. 数据分析与知识发现, 2023, 7(2): 48-60.
[9]	强子珊, 顾益军. 基于多模态异质图的社交媒体谣言检测模型^*[J]. 数据分析与知识发现, 2023, 7(11): 68-78.
[10]	杨茹芸, 马静. 一种融合知识与Res-ViT的特征增强多模态情感识别模型^*[J]. 数据分析与知识发现, 2023, 7(11): 14-25.
[11]	张艳琼, 朱兆松, 赵晓驰. 面向手语语言学的中国手语词汇多模态语料库构建研究^*[J]. 数据分析与知识发现, 2023, 7(10): 144-155.
[12]	吴思思, 马静. 基于感知融合的多任务多模态情感分析模型^*[J]. 数据分析与知识发现, 2023, 7(10): 74-84.
[13]	余本功, 季晓晗. 基于ADGCN-MFM的多模态讽刺检测研究^*[J]. 数据分析与知识发现, 2023, 7(10): 85-94.
[14]	陈圆圆, 马静. 基于SC-Attention机制的多模态讽刺检测研究^*[J]. 数据分析与知识发现, 2022, 6(9): 40-51.
[15]	施运梅, 袁博, 张乐, 吕学强. IMTS：融合图像与文本语义的虚假评论检测方法*[J]. 数据分析与知识发现, 2022, 6(8): 84-96.

Viewed

Full text

Abstract

Cited

Shared

Discussed