一种基于模板提示学习的事件抽取方法

doi:10.11925/infotech.2096-3467.2022-0495

数据分析与知识发现

本期目录 | 过刊浏览 | 高级检索

一种基于模板提示学习的事件抽取方法

陈诺,李旭晖

(武汉大学信息管理学院武汉 430072) (武汉大学大数据研究院武汉 430072)

An Event Extraction Method Based on Template Prompt Learning

Chen Nuo,Li Xuhui

(School of Information Management, Wuhan University, Wuhan 430072, China) (Big Data Institute, Wuhan University, Wuhan 430072, China)

摘要
相关文章
Metrics

全文:
输出: BibTeX | EndNote (RIS)

摘要

[目的]针对现有基于标注和基于文本生成的事件抽取模型存在的不足，提出一种使用自动构造模板引出预训练语言模型知识的事件联合抽取模型。

[方法]本文基于事件提示符设计了模板自动构造策略来生成统一的提示模板，在编码层为事件提示符引入事件提示编码层，而后接入预训练的BART模型捕捉句子的语义信息，并生成对应的预测序列，从预测序列中提取对应事件类型的触发词和论元，实现事件触发词和论元的联合抽取。

[结果]在包含复杂事件信息文本的事件数据集中，事件触发词抽取的达到77.67%，事件论元抽取的达到65.06%，相较于最优的基准方法分别提升了2.43%和1.62%。

[局限]模型仅局限于句子级文本，且仅在编码层对提示符进行调优。

[结论]本文模型基于提示符调优能够在减少模板构建成本的同时保持相同甚至更优的性能，并且模型能够识别具有复杂事件信息的文本，有效提升了事件元素多标签分类的效果。

	服务

	把本文推荐给朋友
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章

关键词 ：中文事件抽取, 预训练语言模型, 提示学习, 联合学习

Abstract：

[Objective] In view of the the shortcomings of the existing event extraction models based on sequence labeling and text generation, this paper proposed a joint event extraction model using an automatically constructed template to elicit the knowledge of pre-trained language model.

[Methods] This paper designed an automatic template construction strategy based on the Event Prompt to generate unified prompt templates. The Event Prompt Embedding Layer was introduced for the Event Prompt in the encoding layer, and then connected to the pre-trained BART model to capture the semantic information of the sentence, generated the corresponding prediction sequence, and extracted the event trigger words and arguments of the corresponding event type contained in the original text from the prediction sequence, realized the joint extraction of event trigger words and arguments.

[Results] In the event dataset containing complex event information text, the F1 of event trigger word extraction reached 77.67%, and the F1 of event argument extraction reached 65.06%, 2.43% and 1.62% higher than the optimal baseline method respectively.

[Limitations] This paper only considered sentence level text, and only optimized the prompt in the encoding layer.

[Conclusions] The prompt based optimization can reduce the cost of template construction while maintaining the same or even better performance. And this model can recognize the complex event information contained in the text, which effectively improved the effect of multi-label classification of event elements.

Key words： Chinese event extraction Pre-trained language model Prompt learning Joint learning

出版日期: 2022-11-11

ZTFLH:

TP183

引用本文:

陈诺, 李旭晖. 一种基于模板提示学习的事件抽取方法 [J]. 数据分析与知识发现, 10.11925/infotech.2096-3467.2022-0495.
Chen Nuo, Li Xuhui. An Event Extraction Method Based on Template Prompt Learning . Data Analysis and Knowledge Discovery, 0, (): 1-.

链接本文:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/10.11925/infotech.2096-3467.2022-0495 或 https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/Y0/V/I/1

[1]	黄泰峰, 马静. 基于提示学习增强的文本情感分类模型*[J]. 数据分析与知识发现, 2024, 8(3): 77-84.
[2]	唐雪梅, 苏祺, 王军. 融合实体信息的古汉语关系分类研究^*[J]. 数据分析与知识发现, 2024, 8(1): 114-124.
[3]	鲍彤, 章成志. ChatGPT中文信息抽取能力测评——以三种典型的抽取任务为例^*[J]. 数据分析与知识发现, 2023, 7(9): 1-11.
[4]	邓宇扬, 吴丹. 面向藏族传统节日的汉藏双语命名实体识别研究^*[J]. 数据分析与知识发现, 2023, 7(7): 125-135.
[5]	陈诺, 李旭晖. 一种基于模板提示学习的事件抽取方法^*[J]. 数据分析与知识发现, 2023, 7(6): 86-98.
[6]	李岱峰, 林凯欣, 李栩婷. 基于提示学习与T5 PEGASUS的图书宣传自动摘要生成器^*[J]. 数据分析与知识发现, 2023, 7(3): 121-130.
[7]	张逸勤, 邓三鸿, 胡昊天, 王东波. 预训练模型视角下的跨语言典籍风格计算研究^*[J]. 数据分析与知识发现, 2023, 7(10): 50-62.
[8]	叶瀚,孙海春,李欣,焦凯楠. 融合注意力机制与句向量压缩的长文本分类模型[J]. 数据分析与知识发现, 2022, 6(6): 84-94.
[9]	景慎旗, 赵又霖. 基于医学领域知识和远程监督的医学实体关系抽取研究^*[J]. 数据分析与知识发现, 2022, 6(6): 105-114.
[10]	王义真,欧石燕,陈金菊. 民事裁判文书两阶段式自动摘要研究^*[J]. 数据分析与知识发现, 2021, 5(5): 104-114.
[11]	沈卓,李艳. 基于PreLM-FT细粒度情感分析的餐饮业用户评论挖掘[J]. 数据分析与知识发现, 2020, 4(4): 63-71.

Viewed

Full text

Abstract

Cited

Shared

Discussed