Please wait a minute...
Data Analysis and Knowledge Discovery
Current Issue | Archive | Adv Search |
A books promotion abstractive summarization method based on prompt learning and T5 PEGASUS
Li Daifeng,Lin Kaixin,Li Xuting
(School of Information Management,Sun Yat-Sen University,Guangzhou 510006,China)
Download:
Export: BibTeX | EndNote (RIS)      
Abstract  

[Objective] To generate accurate propaganda words of books from book information quickly, and reduce the manpower and material resources consumed by purely artificial means.

[Application Background] Currently, there are few researches on the generation of automatic publicity summary of books. The library and online book market mostly use manual method to write publicity words for books, which increases the work burden.

[Methods] Based on the idea of prompt learning, the data set is constructed by crawling the book information, data enhancement and keyword extraction are used to increase the information, and finally T5 PEGASUS is input to get the basic propaganda. Summaries of book reviews are added when the number of book reviews reaches the threshold.

[Results] Compared with the optimal baseline model, the results of  Rouge-1、Rouge-2、Rouge-L were improved by 28.9%, 37.6% and 31.9%, respectively.

[Conclusions] According to the characteristics of the book corpus, the propaganda generated by the experiment process has practical application value.


Key words Text summarization, Prompt learning, Data enhancement      Textrank,T5 PEGASUS      
Published: 29 July 2022
ZTFLH:  TP393,G250  

Cite this article:

Li Daifeng, Lin Kaixin, Li Xuting. A books promotion abstractive summarization method based on prompt learning and T5 PEGASUS . Data Analysis and Knowledge Discovery, 0, (): 1-.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.2096-3467.2022-0350     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y0/V/I/1

[1] Huang Taifeng, Ma Jing. Text Sentiment Classification Algorithm Based on Prompt Learning Enhancement[J]. 数据分析与知识发现, 2024, 8(3): 77-84.
[2] Bao Ritong, Sun Haichun. An Overview of Research on Multi-Document Summarization[J]. 数据分析与知识发现, 2024, 8(2): 17-32.
[3] Liu Jiangfeng, Feng Yutong, Liu Liu, Shen Si, Wang Dongbo. Structural Recognition of Abstracts of Academic Text Enhanced by Domain Bilingual Data[J]. 数据分析与知识发现, 2023, 7(8): 105-118.
[4] Chen Nuo, Li Xuhui. An Event Extraction Method Based on Template Prompt Learning[J]. 数据分析与知识发现, 2023, 7(6): 86-98.
[5] Li Daifeng, Lin Kaixin, Li Xuting. Identifying Named Entities of Adverse Drug Reaction with Adversarial Transfer Learning[J]. 数据分析与知识发现, 2023, 7(3): 121-130.
[6] Zhao Yiming, Pan Pei, Mao Jin. Recognizing Intensity of Medical Query Intentions Based on Task Knowledge Fusion and Text Data Enhancement[J]. 数据分析与知识发现, 2023, 7(2): 38-47.
[7] Zeng Ziming, Zhang Yu. Rumor Detection of Public Health Emergencies Based on Data Augmentation and Multi-Task Learning[J]. 数据分析与知识发现, 2023, 7(11): 56-67.
[8] Liu Xingli, Fan Junjie, Ma Haiqun. Improvement of Data Augment Algorithm for Named Entity Recognition with Small Samples[J]. 数据分析与知识发现, 2022, 6(10): 128-141.
[9] Yu Yan, Zhu Shengchen. Extracting Patent Keywords by Integrating Restriction Relationship[J]. 数据分析与知识发现, 2022, 6(10): 57-67.
[10] Liu Tong,Liu Chen,Ni Weijian. A Semi-Supervised Sentiment Analysis Method for Chinese Based on Multi-Level Data Augmentation[J]. 数据分析与知识发现, 2021, 5(5): 51-58.
[11] Yan Qiang,Zhang Xiaoyan,Zhou Simin. Extracting Keywords Based on Sememe Similarity[J]. 数据分析与知识发现, 2021, 5(4): 80-89.
[12] Xia Tian. Extracting Key-phrases from Chinese Scholarly Papers[J]. 数据分析与知识发现, 2020, 4(7): 76-86.
[13] Mingzhu Sun,Jing Ma,Lingfei Qian. Extracting Keywords Based on Topic Structure and Word Diagram Iteration[J]. 数据分析与知识发现, 2019, 3(8): 68-76.
[14] An Wang,Yijun Gu,Kunming Li,Wenzheng Li. Extracting Keywords Based on Removed Network Word Nodes[J]. 数据分析与知识发现, 2019, 3(11): 35-44.
[15] Liu Zhuchen,Chen Hao,Yu Yanhua,Li Jie. Extracting Keywords with TextRank and Weighted Word Positions[J]. 数据分析与知识发现, 2018, 2(9): 74-79.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn