CLARIN Concept Registry: The New Semantic Registry

Ineke Schuurman
KU Leuven, Belgium / Utrecht University, The Netherlands

Menzo Windhouwer
Meertens Institute, Amsterdam, The Netherlands

Oddrun Ohren
National Library of Norway

Daniel Zeman
Faculty of Mathematics and Physics, Charles University in Prague, Czech Republic

Ladda ner artikel

Ingår i: Selected Papers from the CLARIN Annual Conference 2015, October 14–16, 2015, Wroclaw, Poland

Linköping Electronic Conference Proceedings 123:5, s. 62-70

NEALT Proceedings Series 28:5, p. 62-70

Visa mer +

Publicerad: 2016-04-11

ISBN: 978-91-7685-765-6

ISSN: 1650-3686 (tryckt), 1650-3740 (online)


The CLARIN Concept Registry (clarin.eu/conceptregistry) is the place in the CLARIN Infrastructure where common and shared semantics of, but not limited to, linguistic concepts are defined. This is important to achieve semantic interoperability, and to overcome to a degree the diversity in data structures, either in metadata or linguistic resources, encountered within the infrastructure. Whereas in the past, CLARIN has been using the ISOcat registry for these purposes, nowadays this new registry is being used, as ISOcat turned out to have some serious drawbacks as far as its use in the CLARIN community is concerned. The main difference between the two semantic registries is that the CCR is a concept registry whereas ISOcat is a data category registry. In this paper we describe why the decision to switch to a concept registry has been made. We also describe the most important other characteristics of the new (Open)SKOS-based registry, as well as the management procedures used to prevent a recurrent proliferation of entries, as was the case with ISOcat.


Inga nyckelord är tillgängliga


Daan Broeder, Menzo Windhouwer, Dieter van Uytvanck, Twan Goosen, and Thorsten Trippel. 2012. CMDI: a Component Metadata Infrastructure. Proceedings of LREC Workshop Describing LRs with Metadata: Towards Flexibility and Interoperability in the Documentation of LR. Istanbul, Turkey.

Daan Broeder, Ineke Schuurman, and Menzo Windhouwer. 2014. Experiences with the ISOcat Data Category Registry. Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014), Reykjavik, Iceland.

Hennie Brugman and Mark Lindeman. 2012. Publishing and Exploiting Vocabularies using the OpenSKOS Repository Service. Proceedings of the Describing Language Resources with Metadata workshop (LREC 2012), Istanbul, Turkey.

CE-2015-0688. CLARIN-PLUS CCR analysis. CLARIN ERIC, Utrecht.

Matej Durco and Menzo Windhouwer. 2013. Semantic Mapping in CLARIN Component Metadata. In E. Garoufallou and J. Greenberg (eds.), Metadata and Semantics Research (MTSR 2013), CCIS Vol. 390, Springer.

ISO 12620:2009. Specification of data categories and management of a Data Category Registry for language resources. International Organization for Standardization, Geneve.

Menzo Windhouwer. 2012. RELcat: a Relation Registry for ISOcat data categories. Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC 2012), Istanbul, Turkey.

Sue Ellen Wright, Menzo Windhouwer, Ineke Schuurman and Daan Broeder. 2014. Segueing from a Data Category Registry to a Data Concept Registry. Proceedings of the 11th international conference on Terminology and Knowledge Engineering (TKE 2014), Berlin, Germany.

Citeringar i Crossref