Konferensartikel

Talrómur: A large Icelandic TTS corpus

Atli Sigurgeirsson

Þorsteinn Gunnarsson

Gunnar Örnólfsson

Eydís Magnúsdóttir

Ragnheiður Þórhallsdóttir

Stefán Jónsson

Jón Guðnason

Ladda ner artikel

Ingår i: Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa), May 31-June 2, 2021.

Linköping Electronic Conference Proceedings 178:50, s. 440-444

Visa mer +

Publicerad: 2021-05-21

ISBN: 978-91-7929-614-8

ISSN: 1650-3686 (tryckt), 1650-3740 (online)

Abstract

We present Talrómur, a large high-quality Text-To-Speech (TTS) corpus for the Icelandic language. This multi-speaker corpus contains recordings from 4 male speakers and 4 female speakers of a wide range in age and speaking style. The corpus consists of 122,417 single utterance recordings equating to approximately 213 hours of voice data. All speakers read from the same script which has a high coverage of possible Icelandic diphones. Manual analysis of 15,956 utterances indicates that the corpus has a reading mistake rate no higher than 0.25%. We additionally present results from subjective evaluations of the different voices with regards to intelligibility, likeability and trustworthiness.

Nyckelord

corpus, TTS, Icelandic

Referenser

Inga referenser tillgängliga

Citeringar i Crossref