Please wait a minute...
Data Analysis and Knowledge Discovery  2023, Vol. 7 Issue (9): 12-24    DOI: 10.11925/infotech.2096-3467.2023.0474
Current Issue | Archive | Adv Search |
ChatGPT-Based Scientific Paper Entity Recognition: Performance Measurement and Availability Research
Zhang Yingyi1,Zhang Chengzhi2(),Zhou Yi1,Chen Bikun1
1School of Sociology, Soochow University, Suzhou 215123, China
2School of Economics & Management, Nanjing University of Science and Technology, Nanjing 210094, China
Download: PDF (1911 KB)   HTML ( 35
Export: BibTeX | EndNote (RIS)      
Abstract  

[Objective] This paper aims to use a large language model for entity recognition tasks of academic papers. [Methods] We utilized ChatGPT, a large language model, as an entity recognition tool, a pseudo-label generation tool, and a training set generation tool. Then, we analyzed ChatGPT’s performance, price, and time for the tasks. [Results] The F1 of the ChatGPT-based method in all three perspectives is higher than that of the neural network baseline model trained with a small dataset. For example, the F1 from the perspective of entity recognition was 21.4% higher than the model trained by manually annotating 10 abstracts. The ChatGPT-based methods had stable performance on academic paper datasets in different disciplines. [Limitations] We only examined the new method with English academic paper abstract datasets. More research is needed to examine it with the Chinese datasets. [Conclusions] ChatGPT can identify entities from academic paper abstracts with little manually annotated data. The recognition results need to be further filtered to be applied to downstream tasks.

Key wordsChatGPT      AIGC      Scientific Paper Information Extraction      Scientific Entity Extraction     
Received: 19 May 2023      Published: 12 September 2023
ZTFLH:  G350  
  TP391  
Fund:The National Natural Science Foundation of China(72074113);The Youth Cross Research Team Project of Social Sciences of Soochow University
Corresponding Authors: Zhang Chengzhi,ORCID:0000-0001-9522-2914,E-mail: zhangcz@njust.edu.cn。   

Cite this article:

Zhang Yingyi, Zhang Chengzhi, Zhou Yi, Chen Bikun. ChatGPT-Based Scientific Paper Entity Recognition: Performance Measurement and Availability Research. Data Analysis and Knowledge Discovery, 2023, 7(9): 12-24.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.2096-3467.2023.0474     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2023/V7/I9/12

Research Framework
统计类型 SCIERC STM
句子数 2 761 1 163
句子数/文章数 5.52 10.57
句子数/领域数 / 116.3
实体数 4 482 3 962
实体数/文章数 8.96 36.02
实体数/领域数 / 396.2
任务实体数 1 280 /
方法实体数 2 092 260
评价指标实体数 340 /
材料实体数 770 2 096
数据实体数 / 1 606
Statistical Information of Datasets
ChatGPT Based Single Stage Entity Recognition Model
ChatGPT Based Two-Stage Entity Recognition Model
ChatGPT Based Pseudo Label Generation and Entity Recognition Process
ChatGPT Based Training Data Generation and Entity Recognition Model
指标
模型
Macro
P(%)
Macro
R(%)
Macro
F1(%)
Task
F1(%)
Meth
F1(%)
Mate
F1(%)
Metr
F1(%)
基线模型 5 0 0 0 0 0 0 0
10 11.2 9.5 9.3 14.7 22.6 0 0
20 20.1 20.8 20.1 36.2 44.2 0 0
50 41.4 51.3 45.7 48.1 59.1 49.8 25.9
100 52.9 64.0 57.9 53.9 67.0 62.8 53.1
200 63.4 63.9 63.6 57.0 71.5 58.6 67.3
ALL 65.3 69.5 67.3 59.7 74.6 66.7 68.1
ChatGPT 实体识别 43.7 24.9 30.7 25.1 29.1 34.7 33.8
伪标签生成 32.7 28.8 29.8 25.8 30.9 31.2 31.0
训练数据生成 23.1 6.5 9.6 17.0 11.8 8.4 1.4
Entity Recognition Results on the SCIERC Dataset
Entity Recognition Results on the STM Dataset
The Span Overlap of Machine Recognition Results and Manual Annotation Results on SCIERC Dataset
The Span Overlap of Machine Recognition Results and Manual Annotation Results in the Agricultural Field on STM Dataset
The Span Overlap of Machine Recognition Results and Manual Annotation Results in the Chemistry Field on STM Dataset
数据模型 SCIERC 化学 农学 天文学 地球
科学
材料
科学
基线模型 22.2 9.5 17.5 8.1 22.4 17.5
实体识别 26.6 19.4 30.9 16 38.8 24.4
伪标签生成 27.3 10.8 26.4 12.3 14.9 13.6
训练数据生成 25.3 2.4 20.3 0.0 25.0 20.3
Percentage of Type Errors on SCIERC and STM Datasets
数据模型 SCIERC 化学 农学 天文学 地球
科学
材料
科学
基线模型 40.1 38.1 31.6 64.9 40.3 27.5
实体识别 49.0 45.1 50.9 58.0 36.7 36.6
伪标签生成 63.0 41.2 25.3 49.1 17.0 39.0
训练数据生成 66.8 17.1 34.4 50.0 17.3 20.4
Percentage of Other Errors on SCIERC and STM Datasets
数据方法 SCIERC STM
每分钟处理数量 价格/单篇摘要 每分钟处理数量 价格/单个句子
单阶段实体识别 5篇摘要 0.002美元 24个句子 0.000 4美元
两阶段实体识别 / / 14个句子 0.001美元
伪标签生成 5篇摘要 0.002美元 24个句子 0.000 4美元
训练数据生成 50个实体/ 3篇摘要 0.002美元 50个实体/15个句子 0.000 4美元
Price and Time Required for ChatGPT Based Entity Recognition Methods
[1] Heffernan K, Teufel S. Identifying Problems and Solutions in Scientific Text[J]. Scientometrics, 2018, 116(2): 1367-1382.
doi: 10.1007/s11192-018-2718-6 pmid: 30147202
[2] Kovačević A, Konjović Z, Milosavljević B, et al. Mining Methodologies from NLP Publications: A Case Study in Automatic Terminology Recognition[J]. Computer Speech & Language, 2012, 26(2): 105-126.
[3] Luo Z R, Lu W, He J G, et al. Combination of Research Questions and Methods: A New Measurement of Scientific Novelty[J]. Journal of Informetrics, 2022, 16(2): Article No. 101282.
[4] 马费成, 张帅. 我国图书情报领域新兴交叉学科发展探析[J]. 中国图书馆学报, 2023, 49(2): 4-14.
[4] (Ma Feicheng, Zhang Shuai. The Development of Emerging Interdisciplines in Library and Information Science in China[J]. Journal of Library Science in China, 2023, 49(2): 4-14.)
[5] Wadden D, Wennberg U, Luan Y, et al. Entity, Relation, and Event Extraction with Contextualized Span Representations[C]// Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing. ACL, 2019: 5784-5789.
[6] Zhong Z X, Chen D Q. A Frustratingly Easy Approach for Entity and Relation Extraction[C]// Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies. 2021: 50-61.
[7] Zhang H H, Ren F L. BERTatDE at SemEval-2020 Task 6: Extracting Term-Definition Pairs in Free Text Using Pre-Trained Model[C]// Proceedings of the 14th Workshop on Semantic Evaluation. 2020: 690-696.
[8] Ding N, Chen Y L, Han X, et al. Prompt-Learning for Fine-Grained Entity Typing[OL]. arXiv Preprint, arXiv: 2108.10604.
[9] Kan Z G, Feng L H, Yin Z Y, et al. A Unified Generative Framework Based on Prompt Learning for Various Information Extraction Tasks[OL]. arXiv Preprint, arXiv: 2209.11570.
[10] 李鸿鹏, 马博, 杨雅婷, 等. 基于槽位语义增强提示学习的篇章级事件抽取方法[J]. 计算机工程, 2023, 49(9):23-31.
doi: 10.19678/j.issn.1000-3428.0066170
[10] (Li Hongpeng, Ma Bo, Yang Yating, et al. Document-Level Event Extraction Based on Slot Semantic Enhanced Prompt Learning[J]. Computer Engineering, 2023, 49(9):23-31.)
doi: 10.19678/j.issn.1000-3428.0066170
[11] 张华平, 李林翰, 李春锦. ChatGPT中文性能测评与风险应对[J]. 数据分析与知识发现, 2023, 7(3): 16-25.
[11] (Zhang Huaping, Li Linhan, Li Chunjin. ChatGPT Performance Evaluation on Chinese Language and Risk Measures[J]. Data Analysis and Knowledge Discovery, 2023, 7(3): 16-25.)
[12] Ma Y B, Cao Y X, Hong Y, et al. Large Language Model is Not a Good Few-Shot Information Extractor, But a Good Reranker for Hard Samples![OL]. arXiv Preprint, arXiv: 2303.08559.
[13] Polak M P, Morgan D. Extracting Accurate Materials Data from Research Papers with Conversational Language Models and Prompt Engineering[OL]. arXiv Preprint, arXiv: 2303.05352.
[14] Das A, Du X Y, Wang B, et al. Automatic Error Analysis for Document-Level Information Extraction[C]// Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics. ACL, 2022: 3960-3975.
[15] Ding B S, Qin C W, Liu L L, et al. Is GPT-3 a Good Data Annotator?[OL]. arXiv Preprint, arXiv: 2212.10450.
[16] Wei X, Cui X Y, Cheng N, et al. Zero-Shot Information Extraction via Chatting with ChatGPT[OL]. arXiv Preprint, arXiv: 2302.10205.
[17] Agrawal M, Hegselmann S, Lang H, et al. Large Language Models are Few-Shot Clinical Information Extractors[C]// Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing. 2022: 1998-2022.
[18] Gutiérrez B J, McNeal N, Washington C, et al. Thinking About GPT-3 In-Context Learning for Biomedical IE? Think Again[OL]. arXiv Preprint, arXiv: 2203.08410.
[19] Luan Y, He L H, Ostendorf M, et al. Multi-Task Identification of Entities, Relations, and Coreference for Scientific Knowledge Graph Construction[C]// Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. ACL, 2018: 3219-3232.
[20] Brack A, D’Souza J, Hoppe A, et al. Domain-Independent Extraction of Scientific Concepts from Research Articles[C]// Proceedings of the 42nd European Conference on Information Retrieval Research. Springer, 2020: 251-266.
[1] Bao Tong, Zhang Chengzhi. Extracting Chinese Information with ChatGPT:An Empirical Study by Three Typical Tasks[J]. 数据分析与知识发现, 2023, 7(9): 1-11.
[2] Zhang Huaping, Li Linhan, Li Chunjin. ChatGPT Performance Evaluation on Chinese Language and Risk Measures[J]. 数据分析与知识发现, 2023, 7(3): 16-25.
[3] Zhao Chaoyang, Zhu Guibo, Wang Jinqiao. The Inspiration Brought by ChatGPT to LLM and the New Development Ideas of Multi-modal Large Model[J]. 数据分析与知识发现, 2023, 7(3): 26-35.
[4] Zhang Zhixiong, Yu Gaihong, Liu Yi, Lin Xin, Zhang Menting, Qian Li. The Influence of ChatGPT on Library & Information Services[J]. 数据分析与知识发现, 2023, 7(3): 36-42.
[5] Qian Li, Liu Yi, Zhang Zhixiong, Li Xuesi, Xie Jing, Xu Qinya, Li Yang, Guan Zhengyi, Li Xiyu, Wen Sen. An Analysis on the Basic Technologies of ChatGPT[J]. 数据分析与知识发现, 2023, 7(3): 6-15.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn