A Bridge from EUDAT’s B2DROP cloud service to CLARIN’s Language Resource Switchboard

Claus Zinn
Seminar für Sprachwissenschaft, Universität Tübingen, Germany

Ladda ner artikel

Ingår i: Selected papers from the CLARIN Annual Conference 2017, Budapest, 18–20 September 2017

Linköping Electronic Conference Proceedings 147:4, s. 36-45

Visa mer +

Publicerad: 2018-05-16

ISBN: 978-91-7685-273-6

ISSN: 1650-3686 (tryckt), 1650-3740 (online)


The Language Resource Switchboard is becoming a central pillar in the CLARIN infrastruc- ture as it helps researchers to connect resources with tools that can process them in one way or another. Languages resources can be found in different places, and ideally, the switchboard is available nearby. Resources located at users’ desktop computers can simply be uploaded to the switchboard, and resources found in CLARIN’s Virtual Language Observatory can simply be sent to the switchboard by a simple click. Until now, the switchboard was only indirectly accessible for resources stored in the cloud. Here, users had to download a resource from their cloud storage to their desktop device before uploading it again to the switchboard to find applicable tools, which is tedious. In this paper, we describe how we linked EUDAT’s B2DROP cloud service to the switchboard, giving users the capability to directly launch the switchboard with a resource from their B2DROP account. Also, we describe the usage of B2DROP to support the switchboard’s back-end for intermediate file storage. The reported work makes a link to another infrastructure, and hence, facilitates and promotes the provision of complementary services to CLARIN members. We believe the cooperation between CLARIN and EUDAT to be of mutual benefit. On the one hand, our bridge makes the use of the generic cloud storage service from EUDAT more attractive to CLARIN members so that they are encouraged to use B2DROP rather than another cloud provider. On the other hand, it encourages EUDAT users to try out and profit from the CLARIN tool space, which in turn will challenge the tool providers to cope with an increased demand, and potentially new user requirements.


Cloud-based access to the Language Resource Switchboard CLARIN and EUDAT CLARIN infrastructure


[Blumtritt et al. 2014] Jonathan Blumtritt, Willem Elbers, Twan Goosen, Marie Hinrichs, Wei Qiu, Mischa Sall, and Menzo Windhouwer. 2014. User Delegation in the CLARIN Infrastructure. Linkoping Electronic Press, (116):14–24.

[Dima et al. 2015] Emanuel Dima, Christian Pag´e, and Reinhard Budich. 2015. D7.5.2: Technology Adaptation and Development Framework (final). Technical report, EUDAT deliverable. Available at https://b2share.eudat.eu/api/files/4cc8cf0e-99a2-4b6b-981a-0ffcd870af19/EUDAT-DEL-WP7-D7%205%202-Technology_adaptation_and_development_framework-2.pdf.

[Giannakouris and Smihily 2016] Konstantinos Giannakouris and Maria Smihily. 2016. Cloud computing - statistics on the use by enterprises. Technical report, eurostat - Statistics Explained. ISSN 2443-8219, available at http://ec.europa.eu/eurostat/statistics-explained/index.php/Cloud_computing_-_statistics_on_the_use_by_enterprises.

[Hinrichs et al. 2010] Erhard Hinrichs, Marie Hinrichs, and Thomas Zastrow:. 2010. Weblicht: Web-Based LRT Services for German. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics (System Demonstrations).

[Seybert and Reinecke 2014] Heidi Seybert and Petronela Reinecke. 2014. Internet and cloud services – statistics on the use by individuals. Technical report, eurostat - Statistics in focus 16/2014. SSN:2314-9647, available at http://ec.europa.eu/eurostat/statistics-explained/index.php?title=Internet_and_cloud_services_-_statistics_on_the_use_by_individuals.

[Uytvanck et al. 2012] Dieter Van Uytvanck, Herman Stehouwer, and Lari Lampen. 2012. Semantic metadata mapping in practice: the virtual language observatory. In Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Ugur Dogan, Bente Maegaard, Joseph Mariani, Jan Odijk, and Stelios Piperidis, editors, Proceedings of the Eighth International Conference on Language Resources and Evaluation, LREC 2012, Istanbul, Turkey, May 23-25, 2012, pages 1029–1034. European Language Resources Association (ELRA).

[van de Sanden et al. 2015] Marie van de Sanden, Christine Staiger, Claudio Cacciari, Roberto Mucci, Carl Johan Hakansson, Adil Hasan, Stephane Coutin, Hannes Thiemann, Benedikt von St. Vieth, and Jens Jensen. 2015. D5.3: Final Report on EUDAT Services. Technical report, EUDAT. Available at http://hdl.handle.net/11304/2433d23a-6079-49a6-9010-ca534f6e348d.

[Zinn et al. 2017] Claus Zinn, Twan Goosen, Marie Hinrichs, Emanuel Dima, Willem Elbers, Dieter Van Uytvanck, Dirk Goldhahn, Thorsten Trippel, and Josef Misutka. 2017. Joint infrastructure services. Technical report, CLARIN-PLUS Deliverable D4.2. Available at: https://office.clarin.eu/v/CE-2017-0985-CLARINPLUS-D4_2.pdf.

[Zinn et al. 2018] Claus Zinn, Wei Qui, Marie Hinrichs, Emanuel Dima, and Alexandr Chernov. 2018. Handling big data and sensitive data using EUDAT’s Generic Execution Framework and the WebLicht workflow engine. In Proceedings of the Eighth International Conference on Language Resources and Evaluation, LREC 2018, Miyazaki, Japan, May 7-12, 2018. European Language Resources Association (ELRA).

[Zinn 2016] Claus Zinn. 2016. The CLARIN language resource switchboard. In Proceedings of the CLARIN Annual Conference. CLARIN ERIC. Available at https://office.clarin.eu/v/CE-2016-0917-Proceedings-CAC-2016.pdf.

Citeringar i Crossref