Stefanie Dipper
Department of Linguistics, Ruhr University Bochum, Germany
Simone Schultz-Balluff
German Department, Ruhr University Bochum, Germany
Download articlePublished in: Proceedings of the workshop on computational historical linguistics at NODALIDA 2013; May 22-24; 2013; Oslo; Norway. NEALT Proceedings Series 18
Linköping Electronic Conference Proceedings 87:3, p. 27-42
NEALT Proceedings Series 18:3, p. 27-42
Published: 2013-05-17
ISBN: 978-91-7519-587-2
ISSN: 1650-3686 (print), 1650-3740 (online)
This paper presents ongoing work in the Anselm project at Ruhr-University Bochum; which deals with a parallel corpus of historical language data. We first present our corpus; which consists of about 50 versions of the medieval text Interrogatio Sancti Anselmi de Passione Domini (‘Questions by Saint Anselm about the Lord’s Passion’); written in different dialects from Early New High German; Middle Low German; and Middle Dutch. The versions were transcribed in a diplomatic way; and are currently being normalized and annotated with lemma and part of speech. In addition; the versions are being aligned at different levels of granularity (paragraph; sentence; phrase; word). We describe two use cases that profit from the annotations: one use case from historical lexical semantics; the other from historical syntax. We finally sketch further application scenarios from the historico-cultural domain of Digital Humanities.