Improving RNN-based Answer Selection for Morphologically Rich Languages

Investor logo

Warning

This publication doesn't include Faculty of Sports Studies. It includes Faculty of Informatics. Official publication website can be found on muni.cz.
Authors

MEDVEĎ Marek HORÁK Aleš SABOL Radoslav

Year of publication 2020
Type Article in Proceedings
Conference Proceedings of the 12th International Conference on Agents and Artificial Intelligence
MU Faculty or unit

Faculty of Informatics

Citation
Doi http://dx.doi.org/10.5220/0008979206440651
Keywords Question Answering; Question Classification; Answer Classification; Czech; Simple Question Answering Database; SQAD
Description Question answering systems have improved greatly during the last five years by employing architectures of deep neural networks such as attentive recurrent networks or transformer-based networks with pretrained con- textual information. In this paper, we present the results and detailed analysis of experiments with the largest question answering benchmark dataset for the Czech language. The best results evaluated in the text reach the accuracy of 72 %, which is a 4 % improvement to the previous best result. We also introduce the newest version of the Czech Question Answering benchmark dataset SQAD 3.0, which was substantially extended to more than 13,000 question-answer pairs, and we report the first answer selection results on this dataset which indicate that the size of the training data is important for the task.
Related projects:

You are running an old browser version. We recommend updating your browser to its latest version.

More info