Identifying Topic-Problem Instances Based on Syntactic Dependency Enhancement
Wang Lu1,2,Le Xiaoqiu1,2()
1National Science Library, Chinese Academy of Sciences, Beijing 100190, China 2Department of Library, Information and Archives Management, School of Economics and Management, University of Chinese Academy of Sciences, Beijing 100190, China
[Objective] This paper aims to identify the defects, deficiencies, and difficulties of existing research on a given topic. [Methods] First, we transformed the topic-problem instance pair extraction to candidate phrase classification. Then, we extracted candidate phrases from the problem sentences, and constructed a syntactic dependency tree. Third, we built a syntactic dependency enhanced classification model based on BiGCN and Transformer interaction module, Fourth, we used this new model to identify the problem instances from the candidate phrases corresponding to a given topic. [Results] The proposed model effectively identified the problem instances and topic-problem instances. Its F1 value reached 83.7%, which is 2.8 percentage point higher than the baseline model. [Limitations] We did not examine the referential relationship between sentences, which may omit some problem instances and reduce the recall rates. [Conclusions] The proposed model could effectively identify the topic and problem instances.
王露, 乐小虬. 基于句法依赖增强的主题-问题实例识别方法研究[J]. 数据分析与知识发现, 2022, 6(12): 13-22.
Wang Lu, Le Xiaoqiu. Identifying Topic-Problem Instances Based on Syntactic Dependency Enhancement. Data Analysis and Knowledge Discovery, 2022, 6(12): 13-22.
(Deng Siyi. Identification Method of the Core “Problem-Method-Conclusion” Coreference Triple in a Single Scientific Paper[D]. Beijing: University of Chinese Academy of Sciences, 2020.)
[2]
Heffernan K, Teufel S. Identifying Problems and Solutions in Scientific Text[J]. Scientometrics, 2018, 116(2):1367-1382.
doi: 10.1007/s11192-018-2718-6
pmid: 30147202
[3]
Mishra R B, Jiang H B. Classification of Problem and Solution Strings in Scientific Texts: Evaluation of the Effectiveness of Machine Learning Classifiers and Deep Neural Networks[J]. Applied Sciences, 2021, 11(21):9997.
doi: 10.3390/app11219997
(Xu Zhenzhen, Zhang Junsheng, Liu Wenbin. Automatically Discovering Associations among Technologies in Scientific Literature[J]. Library and Information Service, 2021, 65(20): 113-122.)
doi: 10.13266/j.issn.0252-3116.2021.20.012
(Wang Yanyan, Zhang Junsheng, Qiao Xiaodong, et al. Evaluating Novelty of Scientific Literature Based on Question-Method Matrix[J]. Information Studies: Theory & Application, 2021, 44(2): 90-95.)
[6]
Sasaki H, Yamamoto S, Agchbayar A, et al. Extracting Problem Linkages to Improve Knowledge Exchange Between Science and Technology Domains Using an Attention-Based Language Model[J]. Engineering, Technology & Applied Science Research, 2020, 10(4): 5903-5913.
(Chen Guo, Peng Jiabin, Xiao Lu. Knowledge Evolution of Scientific Research Domains Based on Problem-Solution Knowledge Extraction: A Case Study of Artificial Intelligence[J]. Information Studies: Theory & Application, 2022, 45(6): 32-38.)
(Lu Wei, Li Pengcheng, Zhang Guobiao, et al. Recognition of Lexical Functions in Academic Texts: Automatic Classification of Keywords Based on BERT Vectorization[J]. Journal of the China Society for Scientific and Technical Information, 2020, 39(12): 1320-1329.)
(Qian Jiajia, Luo Zhuoran, Lu Wei. Novelty Measurement and Innovation Type Identification of Scientific Literature Based on Question-Method Combination[J]. Library and Information Service, 2021, 65(14): 82-89.)
doi: 10.13266/j.issn.0252-3116.2021.14.010
[10]
D’Souza J, Auer S. Pattern-Based Acquisition of Scientific Entities from Scholarly Article Titles[C]// Proceedings of the 23rd International Conference on Asia-Pacific Digital Libraries. 2021: 401-410.
[11]
Kipf T N, Welling M. Semi-Supervised Classification with Graph Convolutional Networks[OL]. arXiv Preprint, arXiv: 1609.02907.
[12]
Vaswani A, Shazeer N, Parmar N, et al. Attention is all You Need[C]// Proceedings of the 31st International Conference on Neural Information Processing Systems. 2017: 6000-6010.
(Li Xin, Cheng Qikai, Liu Xingbang. Design and Implementation of Scientific Literature Analysis System Based on Term Function Recognition[J]. Library and Information Service, 2017, 61(1): 109-116.)
doi: 10.13266/j.issn.0252-3116.2017.01.013
[14]
Asadi N, Badie K, Mahmoudi M T. Automatic Zone Identification in Scientific Papers via Fusion Techniques[J]. Scientometrics, 2019, 119(2): 845-862.
doi: 10.1007/s11192-019-03060-9
[15]
Howard J, Ruder S. Universal Language Model Fine-Tuning for Text Classification[C]// Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1:Long Papers). 2018: 328-339.
[16]
Merity S, Keskar N S, Socher R. Regularizing and Optimizing LSTM Language Models[OL]. arXiv Preprint, arXiv: 1708.02182.
Ge S Y, Huang J X, Meng Y, et al. Fine-Grained Opinion Summarization with Minimal Supervision[OL]. arXiv Preprint, arXiv: 2110.08845.
[19]
Tang H, Ji D H, Li C L, et al. Dependency Graph Enhanced Dual-transformer Structure for Aspect-Based Sentiment Classification[C]// Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020: 6578-6588.
[20]
Xing B W, Tsang I. DigNet: Digging Clues from Local-Global Interactive Graph for Aspect-Level Sentiment Classification[OL]. arXiv Preprint, arXiv: 2201.00989.
[21]
Bahdanau D, Cho K, Bengio Y. Neural Machine Translation by Jointly Learning to Align and Translate[OL]. arXiv Preprint, arXiv: 1409.0473.
[22]
Wang Y Q, Huang M L, Zhu X Y, et al. Attention-Based LSTM for Aspect-Level Sentiment Classification[C]// Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing. 2016: 606-615.
[23]
Liu J M, Zhang Y. Attention Modeling for Targeted Sentiment[C]// Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers. 2017: 572-577.
[24]
Li X, Bing L D, Lam W, et al. Transformation Networks for Target-Oriented Sentiment Classification[C]// Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics(Volume 1:Long Papers). 2018: 946-956.
[25]
Xue W, Li T. Aspect Based Sentiment Analysis with Gated Convolutional Networks[C]// Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1:Long Papers). 2018: 2514-2523.
[26]
Zhang C, Li Q C, Song D W. Aspect-Based Sentiment Classification with Aspect-Specific Graph Convolutional Networks[C]// Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing. 2019: 4568-4578.
[27]
Wang K, Shen W Z, Yang Y Y, et al. Relational Graph Attention Network for Aspect-Based Sentiment Analysis[C]// Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020: 3229-3238.
[28]
Manning C D, Surdeanu M, Bauer J, et al. The Stanford CoreNLP Natural Language Processing Toolkit[C]// Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics:System Demonstrations. 2014: 55-60.
[29]
Pennington J, Socher R, Manning C. GloVe: Global Vectors for Word Representation[C]// Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. 2014: 1532-1543.
[30]
Kingma D P, Ba J. Adam: A Method for Stochastic Optimization[OL]. arXiv Preprint, arXiv: 1412.6980.