Conference article

ORTOLANG: a French Infrastructure for Open Resources and TOols for LANGuage

Jean-Marie Pierrel
University of Lorraine, CNRS, UMR ATILF, France

Christophe Parisse
INSERM, University of Paris-Ouest, Nanterre, UMR MoDyCo, France

Jérôme Blanchard
CNRS, University of Lorraine, UMR ATILF, France

Etienne Petitjean
CNRS, University of Lorraine, UMR ATILF, France

Frédéric Pierre
CNRS, University of Lorraine, UMR ATILF, France

Download article

Published in: Selected papers from the CLARIN Annual Conference 2016, Aix-en-Provence, 26–28 October 2016, CLARIN Common Language Resources and Technology Infrastructure

Linköping Electronic Conference Proceedings 136:8, p. 102-112

Show more +

Published: 2017-05-23

ISBN: 978-91-7685-499-0

ISSN: 1650-3686 (print), 1650-3740 (online)

Abstract

ORTOLANG (Open Resources and Tools for Language: www.ortolang.fr) is a French infrastructure which aims to ensure the management, pooling and sharing, dissemination and long-term preservation of language resources such as corpora, lexicons, terminologies and language processing tools, with particular focus on the languages of France. It will be used as a technical language platform for written and oral language forms. The ORTOLANG software platform is based on a new Digital Object Repository service. By combining a Service Oriented Architecture for high level services and a Software Component Architecture for its Repository Service, the platform seeks to build a robust and reliable Digital Object Repository that provides rich functionalities and a modern interface delivering excellent performances and the best optimization strategies. Thanks to its hardware and software architecture choices, the ORTOLANG platform ensures very flexible evolution possibilities to guarantee long-time support for the hosted resources.

Keywords

Platform, Infrastructure, Resources, Corpus, Lexicons, language processing toolsng tools

References

[ATILF 2011] ATILF 2011. Corpus journalistique issu de l’Est Républicain [Corpus]. ORTOLANG (Open Resources and TOols for LANGuage) - www.ortolang.fr,
https://hdl.handle.net/11403/est_republicain/v1, https://hdl.handle.net/11403/est_republicain/v1

[ATILF2017] ATILF 2017. TCOF: Traitement de Corpus Oraux en Français [Corpus]. ORTOLANG (Open Resources and TOols for LANGuage) - www.ortolang.fr, https://hdl.handle.net/11403/tcof/v1.

[Bel and Blache 2006] Bernard Bel and Philippe Blache. 2006. Le Centre de Ressources pour la Description de l’Oral (CRDO). Travaux interdisciplinaires du Laboratoire parole et langage d’Aix-en-Provence (TIPA), vol. 25. 2006, p. 13-18.

[Durand and al. 2009] Jacques Durand, Bernard Laks & Chantal Lyche. 2009. Le projet PFC: une source de données primaires structurées. In J. Durand, B. Laks et C. Lyche (eds)(2009) Phonologie, variation et accents du français. Paris: Hermès. pp. 19-61, https://hdl.handle.net/11403/pfc/v1.

[Eshkol-Taravella et al. 2012] Eshkol-Taravella I., Baude O., Maurel D., Hriba L., Dugua C., Tellier I. 2012, Un grand corpus oral « disponible » : le corpus d’Orléans 1968-2012., in Ressources linguistiques libres, Traitement Automatique des Langues. Volume 52 – n° 3/2011, 17-46, https://hdl.handle.net/11403/eslo/v1

[Haerder and Reuter 1983] Theo Haerder, Andreas Reuter, A. (1983). "Principles of transaction-oriented database recovery". ACM Computing Surveys. 15 (4): 287

[Morgenstern and Parisse 2012] Aliyah Morgenstern and Christophe Parisse 2012. The Paris Corpus, Journal of French Language Studies, 22/01, 7–12, https://hdl.handle.net/11403/colaje/v1.1.

[Pierrel and Petitjean 2007] Jean-Marie Pierrel et Etienne Petitjean. 2007. Valorisation et exploitation scientifiques de documents numériques pour la recherche en linguistique : l’exemple du CNRTL, Actes de CIDE 2007 Congrès International sur le Document Numérique, Nancy, 2-4 juillet 2007, p13-24, Europia 2007, ISBN 978-2-909285-38-2

[Richardson and Ruby 2007] Leonard Richardson and Sam Ruby. 2007. RESTful Web Services, O’Reilly Media, 454 p.

[Schmid 1994] Helmut Schmid. 1994. Probabilistic Part-of-Speech Tagging Using Decision Trees. Proceedings of International Conference on New Methods in Language Processing, Manchester, UK.

Citations in Crossref