|
|
A Multi-Perspective Co-Matching Model for Machine Reading Comprehension |
Duan Jianyong1,2( ),Wei Xiaopeng1,Wang Hao1,2 |
1School of Information, North China University of Technology, Beijing 100144, China 2CNONIX National Standard Application and Promotion Laboratory, North China University of Technology, Beijing 100144, China |
|
|
Abstract [Objective] This paper proposes a model for multiple-choice reading comprehension, and then explores the impacts of question types and answer length on machine reading comprehension. [Methods] First, we used the multi-perspective matching mechanism to obtain the correlation between the articles, questions and candidate answers. Then, we multiplied the correlation and articles to create the vector representation of questions and candidate answers. Third, we extracted sentence-level and document-level features, which were used to select the correct answers. Fourth, we categorized the data based on the question types and answer length. Finally, we analyzed their impacts on the machine’s choice of correct answers. [Results] The accuracy of our model on the RACE-M, RACE-H and RACE datasets reached 72.5%, 63.1% and 66.1% respectively. [Limitations] The multi-perspective matching mechanism has four matching strategies and multiple angles, which makes the model consume a lot of memory and spend longer processing time at the interactive layer. [Conclusions] The proposed model can effectively match articles with questions and answers. The accuracy of the model is more affected by the type of question, not by the length of answer.
|
Received: 21 July 2020
Published: 17 May 2021
|
|
Fund:National Natural Science Foundation of China(61672040);National Natural Science Foundation of China(61972003) |
Corresponding Authors:
Duan Jianyong
E-mail: duanjy@ncut.edu.cn
|
[1] |
Tang M, Cai J, Zhuo H. Multi-Matching Network for Multiple Choice Reading Comprehension[C]// Proceedings of the 33rd AAAI Conference on Artificial Intelligence. 2019,33:7088-7095.
|
[2] |
Zhu H, Wei F, Qin B, et al. Hierarchical Attention Flow for Multiple-Choice Reading Comprehension[C]// Proceedings of the 32nd AAAI Conference on Artificial Intelligence. 2018.
|
[3] |
Lai G, Xie Q, Liu H, et al. Race: Large-Scale Reading Comprehension Dataset from Examinations[C]// Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing. 2017: 795-804.
|
[4] |
Hermann K M, Kocisky T, Grefenstette E, et al. Teaching Machines to Read and Comprehend[C]// Proceedings of the 28th International Conference on Neural Information Processing Systems. 2015: 1693-1701.
|
[5] |
Nguyen T, Rosenberg M, Song X, et al. MS MARCO: A Human-Generated MAchine Reading Comprehension Dataset[OL]. arXiv Preprint, arXiv: 1611. 09268.
|
[6] |
Rajpurkar P, Zhang J, Lopyrev K, et al. SQuAD: 100,000+ Questions for Machine Comprehension of Text[C]// Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing. 2016: 2383-2392.
|
[7] |
Kadlec R, Schmid M, Bajgar O, et al. Text Understanding with the Attention Sum Reader Network[C]// Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics. 2016: 908-918.
|
[8] |
Dhingra B, Liu H, Yang Z, et al. Gated-Attention Readers for Text Comprehension[C]// Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics. 2017: 1832-1846.
|
[9] |
Seo M, Kembhavi A, Farhadi A, et al. Bidirectional Attention Flow for Machine Comprehension[OL]. arXiv Preprint, arXiv: 1611. 01603.
|
[10] |
Wang W, Yang N, Wei F, et al. Gated Self-Matching Networks for Reading Comprehension and Question Answering[C]// Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics. 2017: 189-198.
|
[11] |
Chaturvedi A, Pandit O, Garain U. CNN for Text-Based Multiple Choice Question Answering[C]// Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics. 2018: 272-277.
|
[12] |
Wang S, Yu M, Chang S, et al. A Co-Matching Model for Multi-Choice Reading Comprehension[C]// Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics. 2018: 746-751.
|
[13] |
Wang Z, Hamza W, Florian R. Bilateral Multi-Perspective Matching for Natural Language Sentences[C]// Proceedings of the 26th International Joint Conference on Artificial Intelligence. 2017: 4144-4150.
|
[14] |
Devlin J, Chang M, Lee K, et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding[C]// Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics. 2019: 4171-4186.
|
[15] |
Chen D, Bolton J, Manning C D. A Thorough Examination of the CNN/Daily Mail Reading Comprehension Task[C]// Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, Berlin, Germany. 2016: 2358-2367.
|
[16] |
Parikh S, Sai A B, Nema P, et al. ElimiNet: A Model for Eliminating Options for Reading Comprehension with Multiple Choice Questions[C]// Proceedings of the 27th International Joint Conference on Artificial Intelligence. 2018: 4272-4278.
|
[17] |
Xu Y, Liu J, Gao J, et al. Towards Human-Level Machine Reading Comprehension: Reasoning and Inference with Multiple Strategies[OL]. arXiv Preprint, arXiv: 1711. 04964.
|
[18] |
Tay Y, Tuan L A, Hui S C. Multi-range Reasoning for Machine Comprehension[OL]. arXiv Preprint, arXiv: 180.09074.
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|