Please wait a minute...
Data Analysis and Knowledge Discovery  2024, Vol. 8 Issue (3): 120-131    DOI: 10.11925/infotech.2096-3467.2023.0092
Current Issue | Archive | Adv Search |
Multi-Round Iterative Retrieval Algorithm for Parsing Question-Answering Process
Zhou Changshun1,2,Ying Wenhao2(),Zhong Shan2,Gong Shengrong1,2
1School of Computer Science and Technology, Soochow University, Suzhou 215008, China
2School of Computer Science and Engineering, Changshu Institute of Technology, Changshu 215500, China
Download: PDF (1051 KB)   HTML ( 4
Export: BibTeX | EndNote (RIS)      
Abstract  

[Objective] This paper designs a retrieval model to explore the interpretability of question-answering tasks. It examines the reasoning processes of these reading comprehension models and improves sentence relevance in traditional unsupervised retrieval algorithms. [Methods] We proposed a new unsupervised retrieval model ISR, which integrated modules of Pearson correlation coefficient, GloVe word embeddings, and IDF weighting. The ISR model conducted fine-grained retrieval of inference sentences through multi-round iterations. [Results] The proposed model’s P, R, and F1 metrics were 2.4%, 1.8%, and 2.1% higher than the MSSwQ model on the MultiRC dataset. Its P, R, and F1 metrics were 4.8%, 2.6%, and 3.7% higher than the MSSwQ on the HotPotQA dataset. [Limitations] There might be excessive matching issues due to the model’s retrieval matching mechanism. [Conclusions] The proposed model improves the accuracy of retrieval inference sentences, which can be effectively applied to the question-answering tasks.

Key wordsReading Comprehension Question-Answering      Unsupervised Framework      Multi-Round Iterations      Question-Answering Inference      Inference Sentence Retrieval     
Received: 12 February 2023      Published: 04 May 2023
ZTFLH:  TP393  
  G250  
Fund:National Natural Science Foundation of China(61972059);National Science Foundation for Post-Doctoral Scientists of China(2021M692368)
Corresponding Authors: Ying Wenhao,ORCID:0000-0001-5992-5444,E-mail:ywh@cslg.edu.cn。   

Cite this article:

Zhou Changshun, Ying Wenhao, Zhong Shan, Gong Shengrong. Multi-Round Iterative Retrieval Algorithm for Parsing Question-Answering Process. Data Analysis and Knowledge Discovery, 2024, 8(3): 120-131.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.2096-3467.2023.0092     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2024/V8/I3/120

The Basic Process of ISR Model
数据集类型 跳数 数量
MultiRC数据集 2 075
HotPotQA数据集 2 11 943
3 4 354
4 1 219
5 179
Number of Questions in the Dataset
模型 MultiRC数据集 HotPotQA数据集
P/% R/% F1/% 2跳 3跳 4跳 5跳
P/% R/% F1/% P/% R/% F1/% P/% R/% F1/% P/% R/% F1/%
BM25 43.8 61.2 51.0 60.1 74.9 66.7 64.3 72.2 68.0 71.0 67.6 69.3 78.2 60.1 68.0
AutoROCC 48.2 68.2 56.4 53.2 83.2 64.9 68.9 78.9 73.6 78.0 71.2 74.4 80.9 60.4 69.1
MSS 55.7 58.6 57.1 66.1 75.6 70.5 74.3 72.5 73.4 82.3 68.7 74.9 77.2 50.8 61.3
MSSwQ 64.1 62.4 63.2 73.2 86.7 79.4 80.4 77.6 79.0 85.5 72.9 78.7 86.1 55.4 67.4
ISR(本文) 66.5 64.2 65.3 76.7 88.1 82.0 85.9 79.8 82.7 91.1 75.6 82.6 90.6 59.4 71.8
Results of Inference Sentence Retrieval Experiment
模块 P/% R/% F1/%
词嵌入 Word2Vec 65.0 62.4 63.5
GloVe.50维度 66.1 61.9 63.9
GloVe.100维度 65.9 62.9 64.4
GloVe.300维度 66.2 63.8 65.0
加权方法 -IDF 66.0 63.2 64.6
TF-IDF 63.2 57.6 60.3
文本匹配算法 余弦相似度 65.7 63.3 64.5
欧几里得距离 65.7 63.3 64.5
ISR(本文) 66.5 64.2 65.3
Results of Control Variables of Main Modules
Verifying the Validity of Inference Sentences Using MRPC Task
Using Cross Entropy Loss to Predict the Answer
参数 参数描述 参数值
learn_rate 学习率 1e-5/5e-5
batch_size 数据批处理量 16/32
hidden_units 隐藏单元数 1 024
dropout 神经单元丢弃率 0.1
train_epochs 训练迭代数 4/10
Setting of experimental parameters
模型 MultiRC数据集推理句
m a c r o F 1/% m i c r o F 1/%
Multee(GloVe) 71.3 68.3
Multee(ELMo) 73.0 69.6
RS 73.1 70.5
BERTBASE[M] 74.8 72.2
RoBERTa[M] 75.5 73.6
BERTLARGE[M] 77.5 74.1
Experimental Results of Inference Sentence Judgment in MultiRC Dataset
模型 HotPotQA数据集推理句
F1/% EM/%
RoBERTa[H] 69.8 57.4
BERTBASE[H] 70.1 58.9
XLNet 73.2 62.3
BERTLARGE[H] 75.4 65.7
Experimental Results of Inference Sentence Judgment in HotPotQA Dataset
Error Example in Hard Matching Mechanism
[1] 黄星宇. 基于ALBERT的医疗问答系统设计与实现[D]. 成都: 电子科技大学, 2022.
[1] (Huang Xingyu. Design and Implementation of Medical Question Answering System Based on ALBERT[D]. Chengdu: University of Electronic Science and Technology of China, 2022.)
[2] 范亦涵. 面向税务的智能问答系统的设计与实现[D]. 济南: 山东大学, 2019.
[2] (Fan Yihan. The Design and Implementation of Intelligent Question Answering System for Taxation[D]. Jinan: Shandong University, 2019.)
[3] 黄薇屹. 基于知识图谱的深度法律内容问答模型[D]. 深圳: 中国科学院深圳先进技术研究院, 2020.
[3] (Huang Weiyi. Deep Neural Networks for Legal Question Answering Based on Knowledge Graph[D]. Shenzhen: Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, 2020.)
[4] 胡月, 周光有. 基于Graph Transformer的知识库问题生成[J]. 中文信息学报, 2022, 36(2): 111-120.
[4] (Hu Yue, Zhou Guangyou. Question Generation from Knowledge Base with Graph Transformer[J]. Journal of Chinese Information Processing, 2022, 36(2): 111-120.)
[5] 顾迎捷, 桂小林, 李德福, 等. 基于神经网络的机器阅读理解综述[J]. 软件学报, 2020, 31(7): 2095-2126.
[5] (Gu Yingjie, Gui Xiaolin, Li Defu, et al. Survey of Machine Reading Comprehension Based on Neural Network[J]. Journal of Software, 2020, 31(7): 2095-2126.)
[6] 王保成, 刘利军, 黄青松. 面向中文医疗问答网站的相似问题检索研究[J]. 中文信息学报, 2022, 36(6): 135-145.
[6] (Wang Baocheng, Liu Lijun, Huang Qingsong. Retrieval of Similar Questions from Chinese Medical Question Answering Website[J]. Journal of Chinese Information Processing, 2022, 36(6): 135-145.)
[7] 赵芸, 刘德喜, 万常选, 等. 检索式自动问答研究综述[J]. 计算机学报, 2021, 44(6): 1214-1232.
[7] (Zhao Yun, Liu Dexi, Wan Changxuan, et al. Retrieval-Based Automatic Question Answer: A Literature Survey[J]. Chinese Journal of Computers, 2021, 44(6): 1214-1232.)
[8] 杨姗姗, 姜丽芬, 孙华志, 等. 基于时间卷积网络的多项选择机器阅读理解[J]. 计算机工程, 2020, 46(11): 97-103.
doi: 10.19678/j.issn.1000-3428.0055628
[8] (Yang Shanshan, Jiang Lifen, Sun Huazhi, et al. Multiple Choice Machine Reading Comprehension Based on Temporal Convolutional Network[J]. Computer Engineering, 2020, 46(11): 97-103.)
doi: 10.19678/j.issn.1000-3428.0055628
[9] Yang Z L, Qi P, Zhang S Z, et al. HotpotQA: A Dataset for Diverse, Explainable Multi-Hop Question Answering[C]// Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. Stroudsburg: Association for Computational Linguistics, 2018: 2369-2380.
[10] Reddy S, Chen D Q, Manning C D. CoQA: A Conversational Question Answering Challenge[C]// Proceedings of the Transactions of the Association for Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 2019: 249-266.
[11] Inoue N, Stenetorp P, Inui K. R4C: A Benchmark for Evaluating RC Systems to Get the Right Answer for the Right Reason[C]// Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 2020: 6740-6750.
[12] Ho X, Nguyen A K D, Sugawara S, et al. Constructing a Multi-Hop QA Dataset for Comprehensive Evaluation of Reasoning Steps[C]// Proceedings of the 28th International Conference on Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 2020: 6609-6625.
[13] Saxena A, Tripathi A, Talukdar P. Improving Multi-Hop Question Answering over Knowledge Graphs Using Knowledge Base Embeddings[C]// Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 2020: 4498-4507.
[14] Lin B Y, Chen X Y, Chen J, et al. KagNet: Knowledge-Aware Graph Networks for Commonsense Reasoning[C]// Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the International Joint Conference on Natural Language Processing. Stroudsburg: Association for Computational Linguistics, 2019: 2829-2839.
[15] Yasunaga M, Ren H, Bosselut A, et al. QA-GNN: Reasoning with Language Models and Knowledge Graphs for Question Answering[C]// Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies. Stroudsburg: Association for Computational Linguistics, 2021: 535-546.
[16] Das R, Dhuliawala S, Zaheer M, et al. Go for a Walk and Arrive at the Answer: Reasoning over Paths in Knowledge Bases Using Reinforcement Learning[C]// Proceedings of the 6th International Conference on Learning Representations. Washington: ICLR, 2018.
[17] Feng Y L, Chen X Y, Lin B Y, et al. Scalable Multi-Hop Relational Reasoning for Knowledge-Aware Question Answering[C]// Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing. Stroudsburg: Association for Computational Linguistics, 2020: 1295-1309.
[18] Wang P F, Peng N Y, Ilievski F, et al. Connecting the Dots: A Knowledgeable Path Generator for Commonsense Question Answering[C]// Findings of the Association for Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 2020: 4129-4140.
[19] Yadav V, Bethard S, Surdeanu M. Unsupervised Alignment-Based Iterative Evidence Retrieval for Multi-Hop Question Answering[C]// Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 2020: 4514-4525.
[20] Sydorova A, Poerner N, Roth B. Interpretable Question Answering on Knowledge Bases and Text[C]// Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 2019: 4943-4951.
[21] Chen D Q, Fisch A, Weston J, et al. Reading Wikipedia to Answer Open-Domain Questions[C]// Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 2017: 1870-1879.
[22] Pan L M, Chen W H, Xiong W H, et al. Unsupervised Multi-Hop Question Answering by Question Generation[C]// Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Stroudsburg: Association for Computational Linguistics, 2021: 5866-5880.
[23] Pennington J, Socher R, Manning C D. GloVe: Global Vectors for Word Representation[C]// Proceedings of the Conference on Empirical Methods in Natural Language Processing. Stroudsburg: Association for Computational Linguistics, 2014: 1532-1543.
[24] 杨泉, 孙玉泉. 基于《同义词词林》深度的词义相似度计算研究[J]. 计算机工程与应用, 2020, 56(17): 48-54.
doi: 10.3778/j.issn.1002-8331.2001-0249
[24] (Yang Quan, Sun Yuquan. Research on Semantic Similarity Calculation Based on Depth of CiLin[J]. Computer Engineering and Applications, 2020, 56(17): 48-54.)
doi: 10.3778/j.issn.1002-8331.2001-0249
[25] 张仰森, 王胜, 魏文杰, 等. 融合语义信息与问题关键信息的多阶段注意力答案选取模型[J]. 计算机学报, 2021, 44(3): 491-507.
[25] (Zhang Yangsen, Wang Sheng, Wei Wenjie, et al. An Answer Selection Model Based on Multi-Stage Attention Mechanism with Combination of Semantic Information and Key Information of the Question[J]. Chinese Journal of Computers, 2021, 44(3): 491-507.)
[26] Vikas Y, Steven B, Mihai S. Quick and (not so) Dirty: Unsupervised Selection of Justification Sentences for Multi-Hop Question Answering[C]// Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing. Stroudsburg: Association for Computational Linguistics, 2019: 2578-2589.
[27] Cui Y M, Liu T, Che W X, et al. ExpMRC: Explainability Evaluation for Machine Reading Comprehension[J]. Heliyon, 2022, 8(4): e09290.
doi: 10.1016/j.heliyon.2022.e09290
[28] Wang A, Singh A, Michael J, et al. GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding[C]// Proceedings of the EMNLP Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP. Stroudsburg: Association for Computational Linguistics, 2018: 353-355.
[29] Trivedi H, Kwon H, Khot T, et al. Repurposing Entailment for Multi-Hop Question Answering Tasks[C]// Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Stroudsburg: Association for Computational Linguistics, 2019: 2948-2958.
[30] Sun K, Yu D, Yu D, et al. Improving Machine Reading Comprehension with General Reading Strategies[C]// Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Stroudsburg: Association for Computational Linguistics, 2019: 2633-2643.
[31] Devlin J, Chang M W, Lee K, et al. BERT: Pre-Training of Deep Bidirectional Transformers for Language Understanding[C]// Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Stroudsburg: Association for Computational Linguistics, 2019: 4171-4186.
[32] Liu Y H, Ott M, Goyal N, et al. RoBERTa: A Robustly Optimized BERT Pretraining Approach[OL]. arXiv Preprint, arXiv:1907.11692.
[33] Yang Z L, Dai Z H, Yang Y M, et al. XLNet: Generalized Autoregressive Pretraining for Language Understanding[C]// Proceedings of the 32nd Conference on Neural Information Processing Systems. Cambridge: MIT Press, 2019: 5753-5763.
[1] Ni Liang, Wu Peng, Zhou Xueqing. Topic Detecting on Multimodal News Data Based on Deep Learning[J]. 数据分析与知识发现, 2024, 8(3): 85-97.
[2] Wang Xiwei, Wang Qiuyue, Cai Hongtian. Research on Recognition Model and Recognition Effect of Network Public Opinion Visual Emotion under Multi-Dimensional Attention Mechanism[J]. 数据分析与知识发现, 2024, 8(3): 156-167.
[3] Zhang Xiongtao, Zhu Na, Guo Yuhui. A Survey on Session-Based Recommendation Methods with Graph Neural Network[J]. 数据分析与知识发现, 2024, 8(2): 1-16.
[4] Chen Zhuo, Jiang Xixi, Zhang Xiaojuan. Influence of Network Structure Changes on Co-word Network Link Prediction[J]. 数据分析与知识发现, 2024, 8(2): 114-130.
[5] Xiang Shuxuan, Cao Yujie, Mao Jin. Computing Patent Similarity Based on Hierarchical Feature of Claims[J]. 数据分析与知识发现, 2024, 8(2): 33-43.
[6] Shi Shuiqian, Jin Jing, Shen Gengyu, Wang Baojia, Ren Ni. Chinese Named Entity Disambiguation Based on Multivariate Similarity Fusion[J]. 数据分析与知识发现, 2024, 8(2): 56-64.
[7] He Yu, Zhang Xiaodong, Zheng Xin. Constructing Patent Knowledge Graph with SpERT-Aggcn Model[J]. 数据分析与知识发现, 2024, 8(1): 146-156.
[8] Liu Tianchang, Wang Lei, Zhu Qinghua. Predicting User Churn of Smart Home-based Care Services Based on SHAP Interpretation[J]. 数据分析与知识发现, 2024, 8(1): 40-54.
[9] Zhao Yiming, Chen Zhan, Zhang Fan. Identifying User Satisfaction Levels and Evolution Patterns in Exploratory Search[J]. 数据分析与知识发现, 2024, 8(1): 90-103.
[10] Liu Rui, Chen Ye. Dynamic Movie Recommendation Considering Long-Term and Short-Term Interest and Its Evolution[J]. 数据分析与知识发现, 2024, 8(1): 80-89.
[11] Liu Tong, Ren Xinru, Yin Jinhui, Ni Weijian. Knowledge Distillation with Few Labeled Samples[J]. 数据分析与知识发现, 2024, 8(1): 104-113.
[12] Gao Guangliang, Li Yazhou, Yuan Ming, Wang Qun. Community Detection Algorithm Base on Node and Edge Analysis[J]. 数据分析与知识发现, 2023, 7(11): 114-124.
[13] Zeng Ziming, Zhang Yu. Rumor Detection of Public Health Emergencies Based on Data Augmentation and Multi-Task Learning[J]. 数据分析与知识发现, 2023, 7(11): 56-67.
[14] Liu Shuai, Fu Lifang. Identifying Fake News with External Knowledge and User Interaction Features[J]. 数据分析与知识发现, 2023, 7(11): 79-87.
[15] Lin Zhe, Chen Pinghua. Analyzing Text Sentiments Based on Patch Attention and Involution[J]. 数据分析与知识发现, 2023, 7(11): 37-45.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn