Please wait a minute...
Data Analysis and Knowledge Discovery  2020, Vol. 4 Issue (6): 43-50    DOI: 10.11925/infotech.2096-3467.2019.1320
Current Issue | Archive | Adv Search |
Generating Sentences of Contrast Relationship
Jiao Qihang,Le Xiaoqiu()
National Science Library, Chinese Academy of Sciences, Beijing 100190, China
Department of Library, Information and Archives Management, School of Economics and Management, University of Chinese Academy of Sciences, Beijing 100190, China
Download: PDF (770 KB)   HTML ( 17
Export: BibTeX | EndNote (RIS)      

[Objective] This paper tries to generate contrastive sentences from two related paragraphs, aiming to establish a new model for creating contrastive paragraphs. [Methods] We generated contrastive sentences automatically from contrastive text sequences. We designed a deep learning model based on Seq2seq, which incorporated contrast features with character vectors to represent texts. Both the Encoder and Decoder layers of our model used BiLSTM structure, which also included attention mechanism. [Results] We examined the proposed model with manually annotated search lists and scientific papers. Then, we adopted BLEU as evaluation index for the results. The final evaluation score was 12.1, which was 6.5 higher than those of the benchmark model using BiLSTM + Attention. [Limitations] Due to the complexity of manually labeling, the data size in our experiments was small. [Conclusions] The proposed model could be used to build new model for generating contrastive paragraphs.

Key wordsContrast Relationship      Text Generation      Text Representation      Deep Learning     
Received: 10 December 2019      Published: 07 July 2020
ZTFLH:  TP391  
Corresponding Authors: Le Xiaoqiu     E-mail:

Cite this article:

Jiao Qihang,Le Xiaoqiu. Generating Sentences of Contrast Relationship. Data Analysis and Knowledge Discovery, 2020, 4(6): 43-50.

URL:     OR

科技论文中对比关系文本示例 查新单中对比关系文本示例
对于不同段落间篇章级并列关系的识别研究目前还较少。Zhao等在新闻推荐研究中采用序列标注方法,考虑句子出现在新闻文本中的位置信息,对新闻文本有并列关系但并不相似的语句进行识别,但所识别的句群分布在两篇论文中,尚未发现针对一篇文章内句群间并列关系的文本识别相关研究。 从检出文献看,在国内已有关于转运呼吸机的报道。常久利报道了一种新生儿专用急救综合治疗车,涉及呼吸机、暖箱的应用,呼吸机、暖箱采用蓄电池供电,与该查新项目采用车载电源并进行逆变匹配和响应略有不同,也未提及电源逆变的具体技术;南通市第一人民医院报道了…
Examples of Contrast Relationship Text
Generation Model Framework
参数 取值
Batch Size 16
字向量维度 64
学习率 10-3
隐藏层单元个数 1 024
输入文本截断 600
输出文本截断 200
Model Parameters
项目 配置
GPU TeslaP100
操作系统 Ubuntu18.04
内存 12GB
显存 16GB
Python版本 Python3.6.9
TensorFlow版本 Tensorflow1.15.0
Environment Configuration
LSTM 2.6
BiLSTM 2.9
BiLSTM+Attention 5.6
本文方法(BiLSTM+Attention+对比特征) 12.1
Model Experiment Results
查新文本+相关文本 基准模型(BiLSTM+Attention)生成文本 本文方法生成文本 人工生成文本
上述研究了用于呼吸机呼吸机的危护治装的危术,未涉及新生儿转运物的电变配和的,响应的技术。 上述文献报了了一新生儿专急救综综治疗车,涉及呼吸机、暖箱的合用,呼吸机、研究蓄电池供电,未提提电源逆变进行技术。 上述研究报道了一种新生儿专用急救综合治疗车,涉及呼吸机、暖箱的应用,呼吸机、暖箱采用蓄电池供电,与该查新项目采用车载电源并进行逆变匹配和响应略有不同,也未提及电源逆变的具体技术。
Senentce Generation Example of Contrast Relationship in Search List
[1] 万小军, 冯岩松, 孙薇薇. 文本自动生成研究进展与趋势[R]. 北京:北京大学, 2016: 1-2.
[1] ( Wan Xiaojun, Feng Yansong, Sun Weiwei. Research Progress and Trend of Automatic Text Generation[R]. Beijing: Peking University, 2016: 1-2.)
[2] Mihalcea R, Tarau P. TextRank: Bringing Order into Text [C]//Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing. 2004: 404-411.
[3] 林汝昌, 李曼珏. 语义的对比关系和对立关系[J]. 外语教学与研究, 1987(2):15-21.
[3] ( Lin Ruchang, Li Manjue. On Semantic Opposites and Contrasts[[J]. Foreign Language Teaching and Research, 1987(2):15-21.)
[4] 车竞. 现代汉语比较句论略[J]. 湖北师范学院学报:哲学社会科学版, 2005,25(3):60-63.
[4] ( Che Jing. A Brief Analysis of Comparative Sentences in Modern Chinese[J]. Journal of Hubei Normal University:Philosophy and Social Sciences, 2005,25(3):60-63.)
[5] 魏阳阳. 现代汉语三种平比句型的语义认知机制研究[J]. 理论月刊, 2017(12):75-80.
[5] ( Wei Yangyang. A Study on the Semantic Cognitive Mechanism of Three Parable Sentence Patterns in Modern Chinese[[J]. Theory Monthly, 2017(12):75-80.)
[6] Jindal N, Liu B. Identifying Comparative Sentences in Text Documents [C]//Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 2006: 244-251.
[7] 黄小江, 万小军, 杨建武, 等. 汉语比较句识别研究[J]. 中文信息学报, 2008,22(5):30-38.
[7] ( Huang Xiaojiang, Wan Xiaojun, Yang Jianwu, et al. Learning to Identify Chinese Comparative Sentences[J]. Journal of Chinese Information Processing, 2008,22(5):30-38.)
[8] 白林楠, 胡韧奋, 刘智颖. 基于句法语义规则系统的比较句自动识别[J]. 北京大学学报(自然科学版), 2015,51(2):275-281.
[8] ( Bai Linnan, Hu Renfen, Liu Zhiying. Recognition of Comparative Sentences Based on Syntactic and Semantic Rules-System[J]. Acta Scientiarum Naturalium Universitatis Pekinensis, 2015,51(2):275-281.)
[9] 吴晨, 韦向峰. 用户评价中比较句的识别和倾向性分析[J]. 计算机科学, 2016,43(S1):435-439.
[9] ( Wu Chen, Wei Xiangfeng. Opinion Analysis and Recognition of Comparative Sentences in User Views[J]. Computer Science, 2016,43(S1):435-439.)
[10] 朱茂然, 王奕磊, 高松, 等. 中文比较关系的识别: 基于注意力机制的深度学习模型[J]. 情报学报, 2019,38(6):612-621.
[10] ( Zhu Maoran, Wang Yilei, Gao Song, el at. A Deep-Learning Model Based on Attention Mechanism for Chinese Comparative Relation Detection[J]. Journal of the China Society for Scientific and Technical Information, 2019,38(6):612-621.)
[11] Baxendale P B. Machine-made Index for Technical Literature—An Experiment[J]. IBM Journal of Research and Development, 1958,2(4):354-361.
doi: 10.1147/rd.24.0354
[12] Edmundson H P. New Methods in Automatic Extracting[J]. Journal of the ACM, 1969,16(2):264-285.
doi: 10.1145/321510.321519
[13] Gkatzia D, Lemon O, Rieser V. Natural Language Generation Enhances Human Decision-making with Uncertain Information[OL]. arXiv Preprint, arXiv: 1606. 03254.
[14] Lopez A. Statistical Machine Translation[J]. ACM Computing Surveys, 2008,40(3). DOI: 10.1145/1380584.1380586.
[15] Sutskever I, Vinyals O, Le Q V. Sequence to Sequence Learning with Neural Networks[OL]. arXiv Preprint, arXiv: 1409. 3215.
[16] Shi T, Keneshloo Y, Ramakrishnan N, et al. Neural Abstractive Text Summarization with Sequence-to-Sequence Models : A Survey [OL]. arXiv Preprint, arXiv: 1812. 02303.
[17] Jain P, Agrawal P, Mishra A, et al. Story Generation from Sequence of Independent Short Descriptions[OL]. arXiv Preprint, arXiv: 1707. 05501.
[18] Liu T, Wang K, Sha L, et al. Table-to-Text Generation by Structure-aware Seq2Seq Learning [C]//Proceedings of the 32nd AAAI Conference on Artificial Intelligence. 2018.
[19] Deng Y, Kim Y, Chiu J, et al. Latent Alignment and Variational Attention [C]//Advances in Neural Information Processing Systems. 2018: 9712-9724.
[20] Li J, Monroe W, Shi T, et al. Adversarial Learning for Neural Dialogue Generation[OL]. arXiv Preprint, arXiv: 1701. 06547.
[21] Al-Rfou R, Perozzi B, Skiena S. Polyglot: Distributed Word Representations for Multilingual NLP[OL]. arXiv Preprint, arXiv: 1307. 1662.
[22] Papineni K, Roukos S, Ward T, et al. BLEU: A Method for Automatic Evaluation of Machine Translation [C]//Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 2002: 311-318.
[1] Zhou Zeyu,Wang Hao,Zhao Zibo,Li Yueyan,Zhang Xiaoqin. Construction and Application of GCN Model for Text Classification with Associated Information[J]. 数据分析与知识发现, 2021, 5(9): 31-41.
[2] Jiang Yaren, Le Xiaoqiu. Continual Learning for One-to-many Entity Relationship Generation with Small Samples[J]. 数据分析与知识发现, 2021, 5(8): 45-53.
[3] Xu Yuemei, Wang Zihou, Wu Zixin. Predicting Stock Trends with CNN-BiLSTM Based Multi-Feature Integration Model[J]. 数据分析与知识发现, 2021, 5(7): 126-138.
[4] Zhang Le, Leng Jidong, Lv Xueqiang, Cui Zhuo, Wang Lei, You Xindong. RLCPAR: A Rewriting Model for Chinese Patent Abstracts Based on Reinforcement Learning[J]. 数据分析与知识发现, 2021, 5(7): 59-69.
[5] Zhao Danning,Mu Dongmei,Bai Sen. Automatically Extracting Structural Elements of Sci-Tech Literature Abstracts Based on Deep Learning[J]. 数据分析与知识发现, 2021, 5(7): 70-80.
[6] Zhong Jiawa,Liu Wei,Wang Sili,Yang Heng. Review of Methods and Applications of Text Sentiment Analysis[J]. 数据分析与知识发现, 2021, 5(6): 1-13.
[7] Huang Mingxuan,Jiang Caoqing,Lu Shoudong. Expanding Queries Based on Word Embedding and Expansion Terms[J]. 数据分析与知识发现, 2021, 5(6): 115-125.
[8] Song Ruoxuan,Qian Li,Du Yu. Identifying Academic Creative Concept Topics Based on Future Work of Scientific Papers[J]. 数据分析与知识发现, 2021, 5(5): 10-20.
[9] Zhang Guobiao,Li Jie. Detecting Social Media Fake News with Semantic Consistency Between Multi-model Contents[J]. 数据分析与知识发现, 2021, 5(5): 21-29.
[10] Chang Chengyang,Wang Xiaodong,Zhang Shenglei. Polarity Analysis of Dynamic Political Sentiments from Tweets with Deep Learning Method[J]. 数据分析与知识发现, 2021, 5(3): 121-131.
[11] Feng Yong,Liu Yang,Xu Hongyan,Wang Rongbing,Zhang Yonggang. Recommendation Model Incorporating Neighbor Reviews for GRU Products[J]. 数据分析与知识发现, 2021, 5(3): 78-87.
[12] Cheng Bin,Shi Shuicai,Du Yuncheng,Xiao Shibin. Keyword Extraction for Journals Based on Part-of-Speech and BiLSTM-CRF Combined Model[J]. 数据分析与知识发现, 2021, 5(3): 101-108.
[13] Hu Haotian,Ji Jinfeng,Wang Dongbo,Deng Sanhong. An Integrated Platform for Food Safety Incident Entities Based on Deep Learning[J]. 数据分析与知识发现, 2021, 5(3): 12-24.
[14] Zhang Qi,Jiang Chuan,Ji Youshu,Feng Minxuan,Li Bin,Xu Chao,Liu Liu. Unified Model for Word Segmentation and POS Tagging of Multi-Domain Pre-Qin Literature[J]. 数据分析与知识发现, 2021, 5(3): 2-11.
[15] Lv Xueqiang,Luo Yixiong,Li Jiaquan,You Xindong. Review of Studies on Detecting Chinese Patent Infringements[J]. 数据分析与知识发现, 2021, 5(3): 60-68.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938