基于ChatGPT的多视角学术论文实体识别：性能测评与可用性研究<sup>*</sup>

doi:10.11925/infotech.2096-3467.2023.0474

数据分析与知识发现

2023, Vol. 7

Issue (9): 12-24 https://doi.org/10.11925/infotech.2096-3467.2023.0474

研究论文

本期目录 | 过刊浏览 | 高级检索

基于ChatGPT的多视角学术论文实体识别：性能测评与可用性研究^*

张颖怡¹,章成志²(

),周毅¹,陈必坤¹

¹苏州大学社会学院苏州 215123
²南京理工大学经济管理学院南京 210094

ChatGPT-Based Scientific Paper Entity Recognition: Performance Measurement and Availability Research

Zhang Yingyi¹,Zhang Chengzhi²(

),Zhou Yi¹,Chen Bikun¹

¹School of Sociology, Soochow University, Suzhou 215123, China
²School of Economics & Management, Nanjing University of Science and Technology, Nanjing 210094, China

摘要
图/表
参考文献
相关文章
Metrics

全文: PDF (1911 KB) HTML ( 35 )
输出: BibTeX | EndNote (RIS)

摘要

【目的】分析基于大规模语言模型的提示学习方法在学术论文实体识别任务上的可用性。【方法】以ChatGPT这一大规模语言模型为例，将ChatGPT视为实体识别工具、伪标签生成工具以及训练数据生成工具，从性能、价格和时间等维度出发分析以上三个视角下ChatGPT的可用性。【结果】三个视角下基于ChatGPT的方法的F1值高于少量样本训练得到的神经网络基线模型，比如实体识别工具视角的F1宏平均值超过10个学术论文人工标注摘要训练得到的模型21.4个百分点。基于ChatGPT的方法在不同学科领域的学术论文数据集上性能较稳定。【局限】仅在英文学术论文摘要数据集上展开实验，但中文与英文学术论文、学术论文摘要与全文存在逻辑结构和表述上的差异。【结论】当缺少人工标注数据时，将ChatGPT视为实体识别工具可从学术论文摘要中识别出部分实体，但识别结果需进一步过滤以应用到下游任务中。

	服务

	把本文推荐给朋友
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	张颖怡
	章成志
	周毅
	陈必坤

关键词 ： ChatGPT, AIGC, 学术文本信息抽取, 学术实体抽取

Abstract：

[Objective] This paper aims to use a large language model for entity recognition tasks of academic papers. [Methods] We utilized ChatGPT, a large language model, as an entity recognition tool, a pseudo-label generation tool, and a training set generation tool. Then, we analyzed ChatGPT’s performance, price, and time for the tasks. [Results] The F1 of the ChatGPT-based method in all three perspectives is higher than that of the neural network baseline model trained with a small dataset. For example, the F1 from the perspective of entity recognition was 21.4% higher than the model trained by manually annotating 10 abstracts. The ChatGPT-based methods had stable performance on academic paper datasets in different disciplines. [Limitations] We only examined the new method with English academic paper abstract datasets. More research is needed to examine it with the Chinese datasets. [Conclusions] ChatGPT can identify entities from academic paper abstracts with little manually annotated data. The recognition results need to be further filtered to be applied to downstream tasks.

Key words： ChatGPT AIGC Scientific Paper Information Extraction Scientific Entity Extraction

收稿日期: 2023-05-19 出版日期: 2023-09-12

ZTFLH:	G350
	TP391

基金资助:*国家自然科学基金项目(72074113);苏州大学人文社会科学青年交叉研究团队项目的研究成果之一。

通讯作者: 章成志，ORCID：0000-0001-9522-2914，E-mail： zhangcz@njust.edu.cn。

引用本文:

张颖怡, 章成志, 周毅, 陈必坤. 基于ChatGPT的多视角学术论文实体识别：性能测评与可用性研究^*[J]. 数据分析与知识发现, 2023, 7(9): 12-24.
Zhang Yingyi, Zhang Chengzhi, Zhou Yi, Chen Bikun. ChatGPT-Based Scientific Paper Entity Recognition: Performance Measurement and Availability Research. Data Analysis and Knowledge Discovery, 2023, 7(9): 12-24.

链接本文:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/10.11925/infotech.2096-3467.2023.0474 或 https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/Y2023/V7/I9/12

Fig.1 研究框架

Table 1 数据集统计信息

Fig.2 基于ChatGPT的单阶段实体识别模型
（注：蓝色“DEFINITION”代表实体定义信息；绿色“FORM”代表模板信息；紫色“SENTENCE”代表句子信息，下同。）

Fig.3 基于ChatGPT的两阶段实体识别模型
（注：橙色“CATEGORY”代表实体类型。）

Fig.4 基于ChatGPT的伪标签生成与实体识别流程

Fig.5 基于ChatGPT的训练数据生成与实体识别模型
（注：绿色“NUM”代表需生成的实体数量，紫色“ENTITIES”代表实体。）

Table 2 SCIERC实体识别结果

Fig.6 STM实体识别结果
（注：“Data”表示数据实体。“实体识别-单阶段”和“实体识别-两阶段”是实体识别视角下的基于ChatGPT的方法。）

Fig.7 SCIERC数据集上机器识别结果与人工标注结果词语重合度
（注：数值越大，表示实体重合度越高。下同。）

Fig.8 STM农学领域数据中机器识别结果与人工标注结果词语重合度

Fig.9 STM化学领域数据中机器识别结果与人工标注结果词语重合度

Table 3 SCIERC和STM数据集中词语类型错误数量百分比

Table 4 SCIERC和STM数据集中其他错误数量百分比

Table 5 基于ChatGPT的实体识别方法所需的价格与时间

[1]	Heffernan K, Teufel S. Identifying Problems and Solutions in Scientific Text[J]. Scientometrics, 2018, 116(2): 1367-1382. doi: 10.1007/s11192-018-2718-6 pmid: 30147202
[2]	Kovačević A, Konjović Z, Milosavljević B, et al. Mining Methodologies from NLP Publications: A Case Study in Automatic Terminology Recognition[J]. Computer Speech & Language, 2012, 26(2): 105-126.
[3]	Luo Z R, Lu W, He J G, et al. Combination of Research Questions and Methods: A New Measurement of Scientific Novelty[J]. Journal of Informetrics, 2022, 16(2): Article No. 101282.
[4]	马费成, 张帅. 我国图书情报领域新兴交叉学科发展探析[J]. 中国图书馆学报, 2023, 49(2): 4-14.
[4]	(Ma Feicheng, Zhang Shuai. The Development of Emerging Interdisciplines in Library and Information Science in China[J]. Journal of Library Science in China, 2023, 49(2): 4-14.)
[5]	Wadden D, Wennberg U, Luan Y, et al. Entity, Relation, and Event Extraction with Contextualized Span Representations[C]// Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing. ACL, 2019: 5784-5789.
[6]	Zhong Z X, Chen D Q. A Frustratingly Easy Approach for Entity and Relation Extraction[C]// Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies. 2021: 50-61.
[7]	Zhang H H, Ren F L. BERTatDE at SemEval-2020 Task 6: Extracting Term-Definition Pairs in Free Text Using Pre-Trained Model[C]// Proceedings of the 14th Workshop on Semantic Evaluation. 2020: 690-696.
[8]	Ding N, Chen Y L, Han X, et al. Prompt-Learning for Fine-Grained Entity Typing[OL]. arXiv Preprint, arXiv: 2108.10604.
[9]	Kan Z G, Feng L H, Yin Z Y, et al. A Unified Generative Framework Based on Prompt Learning for Various Information Extraction Tasks[OL]. arXiv Preprint, arXiv: 2209.11570.
[10]	李鸿鹏, 马博, 杨雅婷, 等. 基于槽位语义增强提示学习的篇章级事件抽取方法[J]. 计算机工程, 2023, 49(9):23-31. doi: 10.19678/j.issn.1000-3428.0066170
[10]	(Li Hongpeng, Ma Bo, Yang Yating, et al. Document-Level Event Extraction Based on Slot Semantic Enhanced Prompt Learning[J]. Computer Engineering, 2023, 49(9):23-31.) doi: 10.19678/j.issn.1000-3428.0066170
[11]	张华平, 李林翰, 李春锦. ChatGPT中文性能测评与风险应对[J]. 数据分析与知识发现, 2023, 7(3): 16-25.
[11]	(Zhang Huaping, Li Linhan, Li Chunjin. ChatGPT Performance Evaluation on Chinese Language and Risk Measures[J]. Data Analysis and Knowledge Discovery, 2023, 7(3): 16-25.)
[12]	Ma Y B, Cao Y X, Hong Y, et al. Large Language Model is Not a Good Few-Shot Information Extractor, But a Good Reranker for Hard Samples![OL]. arXiv Preprint, arXiv: 2303.08559.
[13]	Polak M P, Morgan D. Extracting Accurate Materials Data from Research Papers with Conversational Language Models and Prompt Engineering[OL]. arXiv Preprint, arXiv: 2303.05352.
[14]	Das A, Du X Y, Wang B, et al. Automatic Error Analysis for Document-Level Information Extraction[C]// Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics. ACL, 2022: 3960-3975.
[15]	Ding B S, Qin C W, Liu L L, et al. Is GPT-3 a Good Data Annotator?[OL]. arXiv Preprint, arXiv: 2212.10450.
[16]	Wei X, Cui X Y, Cheng N, et al. Zero-Shot Information Extraction via Chatting with ChatGPT[OL]. arXiv Preprint, arXiv: 2302.10205.
[17]	Agrawal M, Hegselmann S, Lang H, et al. Large Language Models are Few-Shot Clinical Information Extractors[C]// Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing. 2022: 1998-2022.
[18]	Gutiérrez B J, McNeal N, Washington C, et al. Thinking About GPT-3 In-Context Learning for Biomedical IE? Think Again[OL]. arXiv Preprint, arXiv: 2203.08410.
[19]	Luan Y, He L H, Ostendorf M, et al. Multi-Task Identification of Entities, Relations, and Coreference for Scientific Knowledge Graph Construction[C]// Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. ACL, 2018: 3219-3232.
[20]	Brack A, D’Souza J, Hoppe A, et al. Domain-Independent Extraction of Scientific Concepts from Research Articles[C]// Proceedings of the 42nd European Conference on Information Retrieval Research. Springer, 2020: 251-266.

[1]	鲍彤, 章成志. ChatGPT中文信息抽取能力测评——以三种典型的抽取任务为例^*[J]. 数据分析与知识发现, 2023, 7(9): 1-11.
[2]	张华平, 李林翰, 李春锦. ChatGPT中文性能测评与风险应对^*[J]. 数据分析与知识发现, 2023, 7(3): 16-25.
[3]	赵朝阳, 朱贵波, 王金桥. ChatGPT给语言大模型带来的启示和多模态大模型新的发展思路^*[J]. 数据分析与知识发现, 2023, 7(3): 26-35.
[4]	张智雄, 于改红, 刘熠, 林歆, 张梦婷, 钱力. ChatGPT对文献情报工作的影响^*[J]. 数据分析与知识发现, 2023, 7(3): 36-42.
[5]	钱力, 刘熠, 张智雄, 李雪思, 谢靖, 许钦亚, 黎洋, 管铮懿, 李西雨, 文森. ChatGPT的技术基础分析^*[J]. 数据分析与知识发现, 2023, 7(3): 6-15.

Viewed

Full text

Abstract

Cited

Shared

Discussed