Please wait a minute...
Advanced Search
数据分析与知识发现  2024, Vol. 8 Issue (3): 120-131     https://doi.org/10.11925/infotech.2096-3467.2023.0092
  研究论文 本期目录 | 过刊浏览 | 高级检索 |
一种用于解析问答推理过程的多轮迭代检索算法研究*
周长顺1,2,应文豪2(),钟珊2,龚声蓉1,2
1苏州大学计算机科学与技术学院 苏州 215008
2常熟理工学院计算机科学与工程学院 常熟 215500
Multi-Round Iterative Retrieval Algorithm for Parsing Question-Answering Process
Zhou Changshun1,2,Ying Wenhao2(),Zhong Shan2,Gong Shengrong1,2
1School of Computer Science and Technology, Soochow University, Suzhou 215008, China
2School of Computer Science and Engineering, Changshu Institute of Technology, Changshu 215500, China
全文: PDF (1051 KB)   HTML ( 4
输出: BibTeX | EndNote (RIS)      
摘要 

【目的】针对当前阅读理解类问答推理过程中传统无监督检索方式句子关联性不足的问题,设计一种检索模型,研究问答任务的推理过程,探求问答任务的可解释性。【方法】提出一种新型无监督检索模型ISR,模型中融合皮尔逊相关系数、GloVe词嵌入、IDF加权等主要模块,ISR模型通过多轮迭代方式细粒度检索推理句。【结果】对比模型MSSwQ,ISR模型在MultiRC数据集上进行实验,PRF1指标平均高出2.4、1.8、2.1个百分点;在HotPotQA数据集上进行实验,PRF1指标平均高出4.8、2.6、3.7个百分点。【局限】检索采用硬匹配,可能存在过分匹配的情形。【结论】本文模型能够提升检索推理句的准确性,检索的推理句能够有效应用于问答任务的推理过程。

服务
把本文推荐给朋友
加入引用管理器
E-mail Alert
RSS
作者相关文章
周长顺
应文豪
钟珊
龚声蓉
关键词 阅读理解问答无监督框架多轮迭代问答推理推理句检索    
Abstract

[Objective] This paper designs a retrieval model to explore the interpretability of question-answering tasks. It examines the reasoning processes of these reading comprehension models and improves sentence relevance in traditional unsupervised retrieval algorithms. [Methods] We proposed a new unsupervised retrieval model ISR, which integrated modules of Pearson correlation coefficient, GloVe word embeddings, and IDF weighting. The ISR model conducted fine-grained retrieval of inference sentences through multi-round iterations. [Results] The proposed model’s P, R, and F1 metrics were 2.4%, 1.8%, and 2.1% higher than the MSSwQ model on the MultiRC dataset. Its P, R, and F1 metrics were 4.8%, 2.6%, and 3.7% higher than the MSSwQ on the HotPotQA dataset. [Limitations] There might be excessive matching issues due to the model’s retrieval matching mechanism. [Conclusions] The proposed model improves the accuracy of retrieval inference sentences, which can be effectively applied to the question-answering tasks.

Key wordsReading Comprehension Question-Answering    Unsupervised Framework    Multi-Round Iterations    Question-Answering Inference    Inference Sentence Retrieval
收稿日期: 2023-02-12      出版日期: 2023-05-04
ZTFLH:  TP393  
  G250  
基金资助:* 国家自然科学基金项目(61972059);中国博士后科学基金项目(2021M692368)
通讯作者: 应文豪,ORCID:0000-0001-5992-5444,E-mail:ywh@cslg.edu.cn。   
引用本文:   
周长顺, 应文豪, 钟珊, 龚声蓉. 一种用于解析问答推理过程的多轮迭代检索算法研究*[J]. 数据分析与知识发现, 2024, 8(3): 120-131.
Zhou Changshun, Ying Wenhao, Zhong Shan, Gong Shengrong. Multi-Round Iterative Retrieval Algorithm for Parsing Question-Answering Process. Data Analysis and Knowledge Discovery, 2024, 8(3): 120-131.
链接本文:  
https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/10.11925/infotech.2096-3467.2023.0092      或      https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/Y2024/V8/I3/120
Fig.1  ISR模型基本流程
数据集类型 跳数 数量
MultiRC数据集 2 075
HotPotQA数据集 2 11 943
3 4 354
4 1 219
5 179
Table 1  数据集中问题的数量
模型 MultiRC数据集 HotPotQA数据集
P/% R/% F1/% 2跳 3跳 4跳 5跳
P/% R/% F1/% P/% R/% F1/% P/% R/% F1/% P/% R/% F1/%
BM25 43.8 61.2 51.0 60.1 74.9 66.7 64.3 72.2 68.0 71.0 67.6 69.3 78.2 60.1 68.0
AutoROCC 48.2 68.2 56.4 53.2 83.2 64.9 68.9 78.9 73.6 78.0 71.2 74.4 80.9 60.4 69.1
MSS 55.7 58.6 57.1 66.1 75.6 70.5 74.3 72.5 73.4 82.3 68.7 74.9 77.2 50.8 61.3
MSSwQ 64.1 62.4 63.2 73.2 86.7 79.4 80.4 77.6 79.0 85.5 72.9 78.7 86.1 55.4 67.4
ISR(本文) 66.5 64.2 65.3 76.7 88.1 82.0 85.9 79.8 82.7 91.1 75.6 82.6 90.6 59.4 71.8
Table 2  推理句检索实验结果
模块 P/% R/% F1/%
词嵌入 Word2Vec 65.0 62.4 63.5
GloVe.50维度 66.1 61.9 63.9
GloVe.100维度 65.9 62.9 64.4
GloVe.300维度 66.2 63.8 65.0
加权方法 -IDF 66.0 63.2 64.6
TF-IDF 63.2 57.6 60.3
文本匹配算法 余弦相似度 65.7 63.3 64.5
欧几里得距离 65.7 63.3 64.5
ISR(本文) 66.5 64.2 65.3
Table 3  主要模块控制变量结果
Fig.2  使用MRPC任务验证推理句有效性
Fig.3  使用交叉熵损失预测答案
参数 参数描述 参数值
learn_rate 学习率 1e-5/5e-5
batch_size 数据批处理量 16/32
hidden_units 隐藏单元数 1 024
dropout 神经单元丢弃率 0.1
train_epochs 训练迭代数 4/10
Table 4  实验参数
模型 MultiRC数据集推理句
m a c r o F 1/% m i c r o F 1/%
Multee(GloVe) 71.3 68.3
Multee(ELMo) 73.0 69.6
RS 73.1 70.5
BERTBASE[M] 74.8 72.2
RoBERTa[M] 75.5 73.6
BERTLARGE[M] 77.5 74.1
Table 5  MultiRC数据集推理句判断实验结果
模型 HotPotQA数据集推理句
F1/% EM/%
RoBERTa[H] 69.8 57.4
BERTBASE[H] 70.1 58.9
XLNet 73.2 62.3
BERTLARGE[H] 75.4 65.7
Table 6  HotPotQA数据集推理句判断实验结果
Fig.4  硬匹配模式下错误示例
[1] 黄星宇. 基于ALBERT的医疗问答系统设计与实现[D]. 成都: 电子科技大学, 2022.
[1] (Huang Xingyu. Design and Implementation of Medical Question Answering System Based on ALBERT[D]. Chengdu: University of Electronic Science and Technology of China, 2022.)
[2] 范亦涵. 面向税务的智能问答系统的设计与实现[D]. 济南: 山东大学, 2019.
[2] (Fan Yihan. The Design and Implementation of Intelligent Question Answering System for Taxation[D]. Jinan: Shandong University, 2019.)
[3] 黄薇屹. 基于知识图谱的深度法律内容问答模型[D]. 深圳: 中国科学院深圳先进技术研究院, 2020.
[3] (Huang Weiyi. Deep Neural Networks for Legal Question Answering Based on Knowledge Graph[D]. Shenzhen: Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, 2020.)
[4] 胡月, 周光有. 基于Graph Transformer的知识库问题生成[J]. 中文信息学报, 2022, 36(2): 111-120.
[4] (Hu Yue, Zhou Guangyou. Question Generation from Knowledge Base with Graph Transformer[J]. Journal of Chinese Information Processing, 2022, 36(2): 111-120.)
[5] 顾迎捷, 桂小林, 李德福, 等. 基于神经网络的机器阅读理解综述[J]. 软件学报, 2020, 31(7): 2095-2126.
[5] (Gu Yingjie, Gui Xiaolin, Li Defu, et al. Survey of Machine Reading Comprehension Based on Neural Network[J]. Journal of Software, 2020, 31(7): 2095-2126.)
[6] 王保成, 刘利军, 黄青松. 面向中文医疗问答网站的相似问题检索研究[J]. 中文信息学报, 2022, 36(6): 135-145.
[6] (Wang Baocheng, Liu Lijun, Huang Qingsong. Retrieval of Similar Questions from Chinese Medical Question Answering Website[J]. Journal of Chinese Information Processing, 2022, 36(6): 135-145.)
[7] 赵芸, 刘德喜, 万常选, 等. 检索式自动问答研究综述[J]. 计算机学报, 2021, 44(6): 1214-1232.
[7] (Zhao Yun, Liu Dexi, Wan Changxuan, et al. Retrieval-Based Automatic Question Answer: A Literature Survey[J]. Chinese Journal of Computers, 2021, 44(6): 1214-1232.)
[8] 杨姗姗, 姜丽芬, 孙华志, 等. 基于时间卷积网络的多项选择机器阅读理解[J]. 计算机工程, 2020, 46(11): 97-103.
doi: 10.19678/j.issn.1000-3428.0055628
[8] (Yang Shanshan, Jiang Lifen, Sun Huazhi, et al. Multiple Choice Machine Reading Comprehension Based on Temporal Convolutional Network[J]. Computer Engineering, 2020, 46(11): 97-103.)
doi: 10.19678/j.issn.1000-3428.0055628
[9] Yang Z L, Qi P, Zhang S Z, et al. HotpotQA: A Dataset for Diverse, Explainable Multi-Hop Question Answering[C]// Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. Stroudsburg: Association for Computational Linguistics, 2018: 2369-2380.
[10] Reddy S, Chen D Q, Manning C D. CoQA: A Conversational Question Answering Challenge[C]// Proceedings of the Transactions of the Association for Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 2019: 249-266.
[11] Inoue N, Stenetorp P, Inui K. R4C: A Benchmark for Evaluating RC Systems to Get the Right Answer for the Right Reason[C]// Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 2020: 6740-6750.
[12] Ho X, Nguyen A K D, Sugawara S, et al. Constructing a Multi-Hop QA Dataset for Comprehensive Evaluation of Reasoning Steps[C]// Proceedings of the 28th International Conference on Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 2020: 6609-6625.
[13] Saxena A, Tripathi A, Talukdar P. Improving Multi-Hop Question Answering over Knowledge Graphs Using Knowledge Base Embeddings[C]// Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 2020: 4498-4507.
[14] Lin B Y, Chen X Y, Chen J, et al. KagNet: Knowledge-Aware Graph Networks for Commonsense Reasoning[C]// Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the International Joint Conference on Natural Language Processing. Stroudsburg: Association for Computational Linguistics, 2019: 2829-2839.
[15] Yasunaga M, Ren H, Bosselut A, et al. QA-GNN: Reasoning with Language Models and Knowledge Graphs for Question Answering[C]// Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies. Stroudsburg: Association for Computational Linguistics, 2021: 535-546.
[16] Das R, Dhuliawala S, Zaheer M, et al. Go for a Walk and Arrive at the Answer: Reasoning over Paths in Knowledge Bases Using Reinforcement Learning[C]// Proceedings of the 6th International Conference on Learning Representations. Washington: ICLR, 2018.
[17] Feng Y L, Chen X Y, Lin B Y, et al. Scalable Multi-Hop Relational Reasoning for Knowledge-Aware Question Answering[C]// Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing. Stroudsburg: Association for Computational Linguistics, 2020: 1295-1309.
[18] Wang P F, Peng N Y, Ilievski F, et al. Connecting the Dots: A Knowledgeable Path Generator for Commonsense Question Answering[C]// Findings of the Association for Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 2020: 4129-4140.
[19] Yadav V, Bethard S, Surdeanu M. Unsupervised Alignment-Based Iterative Evidence Retrieval for Multi-Hop Question Answering[C]// Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 2020: 4514-4525.
[20] Sydorova A, Poerner N, Roth B. Interpretable Question Answering on Knowledge Bases and Text[C]// Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 2019: 4943-4951.
[21] Chen D Q, Fisch A, Weston J, et al. Reading Wikipedia to Answer Open-Domain Questions[C]// Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 2017: 1870-1879.
[22] Pan L M, Chen W H, Xiong W H, et al. Unsupervised Multi-Hop Question Answering by Question Generation[C]// Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Stroudsburg: Association for Computational Linguistics, 2021: 5866-5880.
[23] Pennington J, Socher R, Manning C D. GloVe: Global Vectors for Word Representation[C]// Proceedings of the Conference on Empirical Methods in Natural Language Processing. Stroudsburg: Association for Computational Linguistics, 2014: 1532-1543.
[24] 杨泉, 孙玉泉. 基于《同义词词林》深度的词义相似度计算研究[J]. 计算机工程与应用, 2020, 56(17): 48-54.
doi: 10.3778/j.issn.1002-8331.2001-0249
[24] (Yang Quan, Sun Yuquan. Research on Semantic Similarity Calculation Based on Depth of CiLin[J]. Computer Engineering and Applications, 2020, 56(17): 48-54.)
doi: 10.3778/j.issn.1002-8331.2001-0249
[25] 张仰森, 王胜, 魏文杰, 等. 融合语义信息与问题关键信息的多阶段注意力答案选取模型[J]. 计算机学报, 2021, 44(3): 491-507.
[25] (Zhang Yangsen, Wang Sheng, Wei Wenjie, et al. An Answer Selection Model Based on Multi-Stage Attention Mechanism with Combination of Semantic Information and Key Information of the Question[J]. Chinese Journal of Computers, 2021, 44(3): 491-507.)
[26] Vikas Y, Steven B, Mihai S. Quick and (not so) Dirty: Unsupervised Selection of Justification Sentences for Multi-Hop Question Answering[C]// Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing. Stroudsburg: Association for Computational Linguistics, 2019: 2578-2589.
[27] Cui Y M, Liu T, Che W X, et al. ExpMRC: Explainability Evaluation for Machine Reading Comprehension[J]. Heliyon, 2022, 8(4): e09290.
doi: 10.1016/j.heliyon.2022.e09290
[28] Wang A, Singh A, Michael J, et al. GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding[C]// Proceedings of the EMNLP Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP. Stroudsburg: Association for Computational Linguistics, 2018: 353-355.
[29] Trivedi H, Kwon H, Khot T, et al. Repurposing Entailment for Multi-Hop Question Answering Tasks[C]// Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Stroudsburg: Association for Computational Linguistics, 2019: 2948-2958.
[30] Sun K, Yu D, Yu D, et al. Improving Machine Reading Comprehension with General Reading Strategies[C]// Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Stroudsburg: Association for Computational Linguistics, 2019: 2633-2643.
[31] Devlin J, Chang M W, Lee K, et al. BERT: Pre-Training of Deep Bidirectional Transformers for Language Understanding[C]// Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Stroudsburg: Association for Computational Linguistics, 2019: 4171-4186.
[32] Liu Y H, Ott M, Goyal N, et al. RoBERTa: A Robustly Optimized BERT Pretraining Approach[OL]. arXiv Preprint, arXiv:1907.11692.
[33] Yang Z L, Dai Z H, Yang Y M, et al. XLNet: Generalized Autoregressive Pretraining for Language Understanding[C]// Proceedings of the 32nd Conference on Neural Information Processing Systems. Cambridge: MIT Press, 2019: 5753-5763.
[1] 倪亮, 吴鹏, 周雪晴. 基于深度学习的多模态新闻数据主题发现研究*[J]. 数据分析与知识发现, 2024, 8(3): 85-97.
[2] 王晰巍, 王秋月, 蔡宏天. 多维度注意力机制下网络舆情视觉情感识别模型及识别效果研究*[J]. 数据分析与知识发现, 2024, 8(3): 156-167.
[3] 张雄涛, 祝娜, 郭玉慧. 基于图神经网络的会话推荐方法综述*[J]. 数据分析与知识发现, 2024, 8(2): 1-16.
[4] 陈卓, 蒋茜茜, 张晓娟. 网络结构变动对共词网络链路预测效果的影响研究*[J]. 数据分析与知识发现, 2024, 8(2): 114-130.
[5] 向姝璇, 操玉杰, 毛进. 基于权利要求层级特征的专利相似度计算方法研究*[J]. 数据分析与知识发现, 2024, 8(2): 33-43.
[6] 石水倩, 金晶, 沈耕宇, 王宝佳, 任妮. 基于多元相似度融合的中文命名实体消歧方法*[J]. 数据分析与知识发现, 2024, 8(2): 56-64.
[7] 何玉, 张晓冬, 郑鑫. 基于SpERT-Aggcn模型的专利知识图谱构建研究*[J]. 数据分析与知识发现, 2024, 8(1): 146-156.
[8] 刘天畅, 王雷, 朱庆华. 基于SHAP解释方法的智慧居家养老服务平台用户流失预测研究*[J]. 数据分析与知识发现, 2024, 8(1): 40-54.
[9] 赵一鸣, 陈湛, 张帆. 探寻式搜索中用户满意度状态识别与演化规律研究*[J]. 数据分析与知识发现, 2024, 8(1): 90-103.
[10] 刘瑞, 陈烨. 考虑长短期兴趣及其演化的电影个性化动态推荐研究*[J]. 数据分析与知识发现, 2024, 8(1): 80-89.
[11] 刘彤, 任欣儒, 尹金辉, 倪维健. 一种全面的少标签样本情形下的知识蒸馏方法*[J]. 数据分析与知识发现, 2024, 8(1): 104-113.
[12] 高光亮, 李亚洲, 袁明, 王群. 融入节点和边缘重要性分析的社区发现算法*[J]. 数据分析与知识发现, 2023, 7(11): 114-124.
[13] 曾子明, 张瑜. 基于数据增强和多任务学习的突发公共卫生事件谣言识别研究*[J]. 数据分析与知识发现, 2023, 7(11): 56-67.
[14] 刘帅, 傅丽芳. 融合外部知识和用户交互特征的虚假新闻检测[J]. 数据分析与知识发现, 2023, 7(11): 79-87.
[15] 林哲, 陈平华. 基于块注意力机制和Involution的文本情感分析模型*[J]. 数据分析与知识发现, 2023, 7(11): 37-45.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
版权所有 © 2015 《数据分析与知识发现》编辑部
地址:北京市海淀区中关村北四环西路33号 邮编:100190
电话/传真:(010)82626611-6626,82624938
E-mail:jishu@mail.las.ac.cn