Please wait a minute...
New Technology of Library and Information Service  2016, Vol. 32 Issue (5): 1-8    DOI: 10.11925/infotech.1003-3513.2016.05.01
Orginal Article Current Issue | Archive | Adv Search |
Review of Citation-based Automatic Summarization Studies
Liu Tianyi,Bu Yi Zhao Danqun Huang Wenbin,Zhao Danqun(),Huang Wenbin
Department of Information Management, Peking University, Beijing 100871, China
Download: PDF(433 KB)   HTML ( 79
Export: BibTeX | EndNote (RIS)      

[Objective] This paper is an in-depth review of popular research methodologies adopted by the Citation-Based Summarization (CBS) studies. [Coverage] We retrieved scholarly papers on CBS published since 2007, as well as earlier research on automatic summarization and citation analysis. [Methods] We thoroughly discussed the basic concepts and natural language processing technology in the field of CBS. [Results] Citances plays more important roles in automatic summarization applications than randomly selected sentences from scientific works. [Limitations] We did not compare the current achievements with possible results under the ideal circumstances. [Conclusions] CBS technology expands the scope of traditional informetrics and automatic summarization studies. It also offers suggestion to improve the existing evaluation methods of automatic summarization services. CBS calls for the expansion of citation windows and new experimental corpus. We have addressed these issues and explored new perspectives for the CBS research.

Key wordsAutomatic summarization      Citation-based summarization      Citance      Natural Language Processing     
Received: 21 October 2015      Published: 24 June 2016

Cite this article:

Liu Tianyi,Bu Yi Zhao Danqun Huang Wenbin,Zhao Danqun,Huang Wenbin. Review of Citation-based Automatic Summarization Studies. New Technology of Library and Information Service, 2016, 32(5): 1-8.

URL:     OR

[1] Mei Q, Zhai C.Generating Impact-Based Summaries for Scientific Literature [C]. In: Proceedings of ACL-08: HLT, 2008: 816-824.
[2] Bradshaw S.Reference Directed Indexing: Redeeming Relevance for Subject Search in Citation Indexes [C]. In: Proceedings of the 7th European Conference on Research and Advanced Technology on Digital Libraries (ECDL 2003), Trondheim, Norway. Springer, 2003: 499-510.
[3] Elkiss A, Shen S, Fader A, et al.Blind Men and Elephants: What do Citation Summaries Tell Us about a Research Article?[J]. Journal of the American Society for Information Science and Technology, 2008, 59(1): 51-62.
[4] Mohammad S, Dorr B, Egan M, et al.Using Citations to Generate Surveys of Scientific Paradigms[C]. In: Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, 2009: 584-592.
[5] Kan M-Y, Klavans J L, McKeown K R. Using the Annotated Bibliography as a Resource for Indicative Summarization [C]. In: Proceedings of LREC, Las Palmas, Spain. 2002: 1746-1752.
[6] Qazvinian V, Radev D R.Scientific Paper Summarization Using Citation Summary Networks[C]. In: Proceedings of the 22nd International Conference on Computational Linguistics- Volume 1, 2008: 689-696.
[7] 王连喜. 自动摘要研究中的若干问题[J]. 图书情报工作, 2014, 58(20): 13-22.
[7] (Wang Lianxi.Issues in Automatic Summarization Research[J]. Library and Information Service, 2014, 58(20): 13-22.)
[8] Nakov P I, Schwartz A S, Hearst M A.Citances: Citation Sentences for Semantic Analysis of Bioscience Text [C]. In: Proceedings of the SIGIR’04 Workshop on Search and Discovery in Bioinformatics, 2004: 81-88.
[9] Nanba H, Kando N, Okumura M.Classification of Research Papers Using Citation Links and Citation Types: Towards Automatic Review Article Generation[J]. Advances in Classification Research Online, 2000, 11(1): 117-134.
[10] Nanba H, Okumura M.Towards Multi-paper Summarization Using Reference Information [C]. In: Proceedings of the 16th International Joint Conference on Artificial Intelligence, 1999: 926-931.
[11] 刘洋, 崔雷. 引文上下文在文献内容分析中的信息价值研究[J]. 图书情报工作, 2014, 58(6): 101-104.
[11] (Liu Yang, Cui Lei.The Information Value of Citation Context in Document Content Analysis[J]. Library and Information Service, 2014, 58(6): 101-104.)
[12] Qazvinian V, Radev D R.Identifying Non-explicit Citing Sentences for Citation-based Summarization [C]. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, 2010: 555-564.
[13] Athar A, Teufel S.Detection of Implicit Citations for Sentiment Detection [C]. In: Proceedings of the Workshop on Detecting Structure in Scholarly Discourse, 2012: 18-26.
[14] 白光祖, 何远标, 马建霞, 等. 利用小样本量机器学习实现学术文摘结构的自动识别[J]. 现代图书情报技术, 2014(7-8): 34-40.
[14] (Bai Guangzu, He Yuanbiao, Ma Jianxia, et al.Application of Machine Learning with Limited Corpus to Identify Structure of Scientific Abstracts Automatically[J]. New Technology of Library and Information Service, 2014(7-8): 34-40.)
[15] Teufel S.Argumentative Zoning: Information Extraction from Scientific Text [D]. Edinburgh: University of Edinburgh School of Cognitive Science, 2000.
[16] Teufel S, Moens M.Summarizing Scientific Articles: Experiments with Relevance and Rhetorical Status[J]. Computational Linguistics, 2002, 28(4): 409-445.
[17] Liakata M, Dobnik S, Saha S, et al.A Discourse-Driven Content Model for Summarising Scientific Articles Evaluated in a Complex Question Answering Task [C]. In: Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, USA. 2013: 747-757.
[18] Guo Y, Korhonen A, Liakata M, et al.Identifying the Information Structure of Scientific Abstracts: An Investigation of Three Different Schemes [C]. In: Proceedings of the 2010 Workshop on Biomedical Natural Language Processing (ACL 2010), 2010: 99-107.
[19] Mizuta Y, Korhonen A, Mullen T, et al.Zone Analysis in Biology Articles as a Basis for Information Extraction[J]. International Journal of Medical Informatics, 2006, 75(6): 468-487.
[20] Teufel S.Argumentative Zoning for Improved Citation Indexing [A]. //Computing Attitude and Affect in Text: Theory and Applications[M]. Netherlands: Springer, 2006: 159-169.
[21] Ehrler F, Geissbühler A, Jimeno A, et al.Data-poor Categorization and Passage Retrieval for Gene Ontology Annotation in Swiss-Prot[J]. BMC Bioinformatics, 2005, 6(S1): S23.
[22] Hirohata K, Okazaki N, Ananiadou S, et al.Identifying Sections in Scientific Abstracts Using Conditional Random Fields [C]. In: Proceedings of the International Joint Conference on Natural Language Processing, 2008: 381-388.
[23] Teufel S, Siddharthan A, Batchelor C.Towards Discipline- independent Argumentative Zoning: Evidence from Chemistry and Computational Linguistics [C]. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3, 2009: 1493-1502.
[24] Liakata M, Teufel S, Siddharthan A, et al.Corpora for the Conceptualisation and Zoning of Scientific Papers [C]. In: Proceedings of the International Conference on Language Resources and Evaluation, 2010: 2054-2061.
[25] Contractor D, Guo Y, Korhonen A.Using Argumentative Zones for Extractive Summarization of Scientific Articles [C]. In: Proceedings of the International Conference on Computational Linguistics, 2012: 663-678.
[26] Abu-Jbara A, Radev D.Coherent Citation-based Summarization of Scientific Papers [C]. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies- Volume 1, 2011: 500-509.
[27] Carbonell J, Goldstein J.The Use of MMR, Diversity-based Reranking for Reordering Documents and Producing Summaries [C]. In: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 1998: 335-336.
[28] Qazvinian V, Radev D R, ?zgür A.Citation Summarization Through Keyphrase Extraction [C]. In: Proceedings of the 23rd International Conference on Computational Linguistics, 2010: 895-903.
[29] Mollá D, Jones C, Sarker A.Impact of Citing Papers for Summarisation of Clinical Documents[C]. In: Proceedings of the Australasian Language Technology Association Workshop, 2014: 79.
[30] Jaidka K, Chandrasekaran M K, Jha R, et al.The Computational Linguistics Summarization Pilot Task [C]. In: Proceedings of Text Analysis Conference, 2014.
[31] Radev D, Allison T, Blair-Goldensohn S, et al.MEAD-A Platform for Multidocument Multilingual Text Summarization [C]. In: Proceedings of Conference on Language Resources and Evaluation, 2004: 699-702.
[32] Chen J, Zhuge H.Summarization of Scientific Documents by Detecting Common Facts in Citations[J]. Future Generation Computer Systems, 2014, 32: 246-252.
[33] Galgani F, Compton P, Hoffmann A.Summarization Based on Bi-directional Citation Analysis[J]. Information Processing & Management, 2015, 51(1): 1-24.
[34] Erkan G, Radev D R.LexRank: Graph-based Lexical Centrality as Salience in Text Summarization[J]. Journal of Artificial Intelligence Research, 2004, 22: 457-479.
[35] Shi L, Tong H, Tang J, et al.VEGAS: Visual influEnce GrAph Summarization on Citation Networks[J]. IEEE Transactions on Knowledge and Data Engineering, 2015, 27(12): 3417-3431.
[36] Christensen J, Mausam S S, Soderland S, et al.Towards Coherent Multi-Document Summarization [C]. In: Proceedings of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2013: 1163-1173.
[37] Barzilay R, Lapata M.Modeling Local Coherence: An Entity-based Approach[J]. Computational Linguistics, 2008, 34(1): 1-34.
[38] Lin C-Y.Rouge: A Package for Automatic Evaluation of Summaries [C]. In: Proceedings of the Workshop on Text Summarization Branches out. 2004.
[39] Nenkova A, Passonneau R.Evaluating Content Selection in Summarization: The Pyramid Method [C]. In: Proceedings of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2004: 145-152.
[1] Jiahui Hu,An Fang,Wanqing Zhao,Chenliu Yang,Huiling Ren. Annotating Chinese E-Medical Record for Knowledge Discovery[J]. 数据分析与知识发现, 2019, 3(7): 123-132.
[2] Chunlei Yang. Quantification Constraint System for Pragmatic Disambiguation: From Linguistic Design to Computational Implementation[J]. 数据分析与知识发现, 2017, 1(11): 1-11.
[3] Yang Chunlei. Building Online System for Chinese Lexicon and Grammar[J]. 现代图书情报技术, 2016, 32(7-8): 129-136.
[4] Tang Xiaobo, Qiu Xin. Research on Subject-Oriented High Quality Reviews Mining Model[J]. 现代图书情报技术, 2015, 31(7-8): 104-112.
[5] Peng Hao, Xu Jian, Xiao Zhuo. Sentiment Analysis of Web Reviews Based on Comparative Sentence Extraction[J]. 现代图书情报技术, 2015, 31(12): 48-56.
[6] Yang Chunlei, Dan Flickinger. ManGO:Grammar Engineering for Deep Linguistic Processing[J]. 现代图书情报技术, 2014, 30(3): 57-64.
[7] Qiu Junping, Fang Guoping. The Comparative Analysis of Natural Language Processing Research at Home and Abroad Based on Knowledge Mapping[J]. 现代图书情报技术, 2014, 30(12): 51-61.
[8] She Guiqing, Zhang Yongan. Study on the Model of Automatic Extraction and Annotation of Trail Cases[J]. 现代图书情报技术, 2013, (6): 23-29.
[9] Wang Xiuyan, Cui Lei. Extract Semantic Relations Between Biomedical Entities Applied Hybrid Method[J]. 现代图书情报技术, 2013, 29(3): 77-82.
[10] Zhang Yunliang Liang Jian Zhu Lijun Qiao Xiaodong. Key Techniques Study on Automatic Enrichment of Scientific and Technical KOS Based on Definition of Term[J]. 现代图书情报技术, 2010, 26(7/8): 66-71.
[11] Zhong Xia Zhang Zhiping Wang Huilin. Survey on Lexicalized Tree Adjoining Grammar and Its Application in Chinese[J]. 现代图书情报技术, 2010, 26(5): 35-42.
[12] Liu Yao,Shui Zhifang,Zhou Yang,Wang Zhenguo. Research on Automatic Construction of Chinese Traditional Medicine Ontology Concept’s Description Architecture[J]. 现代图书情报技术, 2008, 24(5): 21-26.
[13] Hua Bolin. Stop-word Processing Technique in Knowledge Extraction[J]. 现代图书情报技术, 2007, 2(8): 48-51.
[14] Hua Bolin. Architecture of Knowledge Extraction Based on NLP[J]. 现代图书情报技术, 2007, 2(10): 38-41.
[15] Zhang Zhixiong. Information Extraction and Its Functions in the Digital Library[J]. 现代图书情报技术, 2004, 20(6): 1-5.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938