Automatic conversion of colloquial Finnish to standard Finnish

Inari Listenmaa
Chalmers Institute of Technology, Sweden

Francis M. Tyers
HSL-fakultehta, UiT Norgga árktal šs universitehtaNorway

Published in: Proceedings of the 20th Nordic Conference of Computational Linguistics, NODALIDA 2015, May 11-13, 2015, Vilnius, Lithuania

Linköping Electronic Conference Proceedings 109:27, s. 219-223

NEALT Proceedings Series 23:27, s. 219-223

Published: 2015-05-06

ISBN: 978-91-7519-098-3

ISSN: 1650-3686 (print), 1650-3740 (online)


This paper presents an unsupervised method for converting between colloquial Finnish and standard Finnish. The method relies upon a small number of orthographical rules combined with a large language model of standard Finnish for ranking the possible conversions. Aside from this contribution, the paper also presents an evaluation corpus consisting of aligned sentences in colloquial Finnish, orthographically-standardised colloquial Finnish and standard Finnish. The methods we present outperforms the baseline of simply treating colloquial Finnish as standard Finnish and offers promise for the adaptation of language-technology tools created for standard Finnish to colloquial Finnish. To this end the paper also presents preliminary results which show promise for using normalisation in the machine translation task.


No keywords available


