Please wait a minute...
Advanced Search
数据分析与知识发现
  本期目录 | 过刊浏览 | 高级检索 |
基于持续学习的多语言情感分析模型
赵佳艺;徐月梅;顾涵文
(北京外国语大学信息科学技术学院 北京  100089)
A Multilingual Sentiment Analysis Model based on Continual Learning
Zhao Jiayi;Xu Yuemei;Gu Hanwen
(School of Information Science and Technology, Beijing Foreign Studies University, Beijing 100089, China)
全文:
输出: BibTeX | EndNote (RIS)      
摘要 

[目的]本研究旨在解决多语言模型在处理新语种任务时由于灾难性遗忘导致的性能下降问题。[方法]提出一种基于持续学习的多语言情感分析模型mLMs-EWC,将持续学习思想融入多语言模型中,使模型能够在学习新语种特征的同时,保留已学习到的旧语种语言特征。[结果]在三种语言的持续情感分析实验中发现,mLMs-EWC模型的性能在法语和英语任务中相比Multi-BERT模型高出约5.2%和4.5%。此外,实验还在轻量化的蒸馏模型上评估了mLMs-EWC模型,结果显示在英语任务上的提升率高达24.7%。[局限]本研究聚焦于三种广泛使用的语言,对其他语言的泛化能力还需进一步验证。[结论]该模型能够在多语言情感分析任务中减轻灾难性遗忘,并在多语种数据集上实现持续学习。代码发布在https://github.com/flutter85/mLMs-EWC/tree/master上。

服务
把本文推荐给朋友
加入引用管理器
E-mail Alert
RSS
作者相关文章
关键词 多语言情感分析持续学习灾难性遗忘     
Abstract

[Objective] This study aims to address the performance degradation of multilingual models in handling new language tasks due to catastrophic forgetting. [Methods] A continual learning-based multilingual sentiment analysis model, mLMs-EWC, was proposed. By integrating the continual learning idea into the multilingual models, these models can acquire new language features while retaining the linguistic characteristics of previously learned languages. [Results] The mLMs-EWC model outperforms the Multi-BERT model by approximately 5.2% and 4.5% on French and English tasks, respectively. In addition, we also evaluate our approach on a lightweight distillation model, which showed a remarkable improvement rate of 24.7% on the English task. [Limitations] This study focuses on three widely used languages, and further validation is needed for the generalization ability of other languages. [Conclusions] The proposed model can alleviate catastrophic forgetting in multilingual sentiment analysis tasks and achieve continual learning on multilingual datasets. The code can be visited through https://github.com/flutter85/mLMs-EWC/tree/master.

Key words Multilingual sentiment analysis    Continual learning    Catastrophic forgetting
     出版日期: 2024-03-15
ZTFLH:  TP393,G250  
引用本文:   
赵佳艺, 徐月梅, 顾涵文. 基于持续学习的多语言情感分析模型 [J]. 数据分析与知识发现, 10.11925/infotech.2096-3467.2023.0714.
Zhao Jiayi, Xu Yuemei, Gu Hanwen. A Multilingual Sentiment Analysis Model based on Continual Learning . Data Analysis and Knowledge Discovery, 0, (): 1-.
链接本文:  
https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/10.11925/infotech.2096-3467.2023.0714      或      https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/Y0/V/I/1
[1] 江雅仁, 乐小虬. 一对多实体关系少样本持续学习方法研究[J]. 数据分析与知识发现, 2021, 5(8): 45-53.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
版权所有 © 2015 《数据分析与知识发现》编辑部
地址:北京市海淀区中关村北四环西路33号 邮编:100190
电话/传真:(010)82626611-6626,82624938
E-mail:jishu@mail.las.ac.cn