Parliamentary Corpora in the CLARIN infrastructure

Darja Fišer
Department of Translation, Faculty of Arts, University of Ljubljana, Department of Knowledge Technologies, Jožef Stefan Institute, Slovenia

Jakob Lenardic
Department of Translation, Faculty of Arts, University of Ljubljana, Slovenia

Ingår i: Selected papers from the CLARIN Annual Conference 2017, Budapest, 18–20 September 2017

Linköping Electronic Conference Proceedings 147:7, s. 75-85

Publicerad: 2018-05-16

ISBN: 978-91-7685-273-6

ISSN: 1650-3686 (tryckt), 1650-3740 (online)


This paper gives an overview of the parliamentary records and corpora from CLARIN countries with a focus on an analysis of their availability through the CLARIN infrastructure. Based on the results of the survey we provide a comprehensive overview of the corpora as well as draw a list of recommendations to optimize the depositing and cataloguing of the corpora in the CLARIN repositories in order to make them readily accessible for researchers from different disciplines. We also analyse the recall and precision of simple and faceted search of parliamentary corpora in the Virtual Language Observatory.


parliamentary records parliamentary corpora resource accessibility


