Konferensartikel

Grammatical Error Generation Based on Translated Fragments

Eetu Sjöblom

Mathias Creutz

Teemu Vahtola

Ladda ner artikel

Ingår i: Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa), May 31-June 2, 2021.

Linköping Electronic Conference Proceedings 178:44, s. 398-403

Visa mer +

Publicerad: 2021-05-21

ISBN: 978-91-7929-614-8

ISSN: 1650-3686 (tryckt), 1650-3740 (online)

Abstract

We perform neural machine translation of sentence fragments in order to create large amounts of training data for English grammatical error correction. Our method aims at simulating mistakes made by second language learners, and produces a wider range of non-native style language in comparison to a state-of-the-art baseline model. We carry out quantitative and qualitative evaluation. Our method is shown to outperform the baseline on data with a high proportion of errors.

Nyckelord

grammatical error correction, translation of chunks, synthetic training data, neural machine translation, learner corpora

Referenser

Inga referenser tillgängliga

Citeringar i Crossref