Conference article

Grapheme-Based Cross-Language Forced Alignment: Results with Uralic Languages

Juho Leinonen

Sami Virpioja

Mikko Kurimo

Download article

Published in: Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa), May 31-June 2, 2021.

Linköping Electronic Conference Proceedings 178:36, p. 345-350

NEALT Proceedings Series 45:36, p. 345-350

Show more +

Published: 2021-05-21

ISBN: 978-91-7929-614-8

ISSN: 1650-3686 (print), 1650-3740 (online)

Abstract

Forced alignment is an effective process to speed up linguistic research. However, most forced aligners are language-dependent, and under-resourced languages rarely have enough resources to train an acoustic model for an aligner. We present a new Finnish grapheme-based forced aligner and demonstrate its performance by aligning multiple Uralic languages and English as an unrelated language. We show that even a simple non-expert created grapheme-to-phoneme mapping can result in useful word alignments.

Keywords

forced alignment, cross-language, speech recognition

References

No references available

Citations in Crossref