Kwabena Amponsah-Kaakyire
Daria Pylypenko
Cristina Cristina España-Bonet
Josef van Genabith
Download articlePublished in: Proceedings of the First Workshop on Modelling Translation - Translatology in the Digital Age
Linköping Electronic Conference Proceedings 179:1, p. 1-7
NEALT Proceedings Series 46:1, p. 1-7
Published: 2021-05-24
ISBN: 978-91-7929-613-1
ISSN: 1650-3686 (print), 1650-3740 (online)
Translationese data is a scarce and valuable resource. Traditionally, the proceedings of the European Parliament have been used for studying translationese phenomena since their metadata allows to distinguish between original and translated texts. However, translations are not always direct and we hypothesise that a pivot (also called ”relay”) language might alter the conclusions on translationese effects. In this work, we (i) isolate translations that have been done without an intermediate language in the Europarl proceedings from those that might have used a pivot language, and (ii) build comparable and parallel corpora with data aligned across multiple languages that therefore can be used for both machine translation and translation studies.