Konferensartikel

Udapi: Universal API for Universal Dependencies

Martin Popel
Charles University, Faculty of Mathematics and Physics, Prague, Czech Republic

Zdenek Žabokrtský
Charles University, Faculty of Mathematics and Physics, Prague, Czech Republic

Martin Vojtek
Charles University, Faculty of Mathematics and Physics, Prague, Czech Republic

Ladda ner artikel

Ingår i: Proceedings of the NoDaLiDa 2017 Workshop on Universal Dependencies, 22 May, Gothenburg Sweden

Linköping Electronic Conference Proceedings 135:12, s. 96-101

NEALT Proceedings Series 31:12, p. 96-101

Visa mer +

Publicerad: 2017-05-29

ISBN: 978-91-7685-501-0

ISSN: 1650-3686 (tryckt), 1650-3740 (online)

Abstract

Udapi is an open-source framework providing an application programming interface (API) for processing Universal Dependencies data. Udapi is available in Python, Perl and Java. It is suitable both for full-fledged applications and fast prototyping: visualization of dependency trees, format conversions, querying, editing and transformations, validity tests, dependency parsing, evaluation etc.

Nyckelord

Inga nyckelord är tillgängliga

Referenser

[Bejcek et al.2013] Eduard Bejcek, Eva Hajicová, Jan Hajic, Pavlína Jínová, Václava Kettnerová, Veronika Kolárová, Marie Mikulová, Jirí Mírovský, Anna Nedoluzhko, Jarmila Panevová, Lucie Poláková, Magda Ševcíková, Jan Štepánek, and Šárka Zikánová. 2013. Prague Dependency Treebank 3.0. LINDAT/CLARIN digital library at the Institute of Formal and Applied Linguistics, Charles University in Prague.

[Bird et al.2009] Steven Bird, Ewan Klein, and Edward Loper. 2009. Natural Language Processing with Python. O’Reilly Media, Inc., 1st edition.

[Bojar et al.2016] Ond?rej Bojar, Ondrej Dušek, Tom Kocmi, Jindrich Libovický, Michal Novák, Martin Popel, Roman Sudarikov, and Dušan Variš. 2016. CzEng 1.6: Enlarged Czech-English Parallel Corpus with Processing Tools Dockered. In Petr Sojka, Aleš Horák, Ivan Kopecek, and Karel Pala, editors, Text, Speech, and Dialogue: 19th International Conference, TSD 2016, number 9924 in Lecture Notes in Computer Science, pages 231–238, Cham / Heidelberg / New York / Dordrecht / London. Masaryk University, Springer International Publishing.

[Cunningham et al.2011] Hamish Cunningham, Diana Maynard, Kalina Bontcheva, Valentin Tablan, Niraj Aswani, Ian Roberts, Genevieve Gorrell, Adam Funk, Angus Roberts, Danica Damljanovic, Thomas Heitz, Mark A. Greenwood, Horacio Saggion, Johann Petrak, Yaoyong Li, and Wim Peters. 2011. Text Processing with GATE (Version 6).

[Nivre and Nilsson2005] Joakim Nivre and Jens Nilsson. 2005. Pseudo-projective dependency parsing. In Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, ACL ’05, pages 99–106, Stroudsburg, PA, USA. Association for Computational Linguistics.

[Popel and Žabokrtský2010] Martin Popel and Zdenek Žabokrtský. 2010. TectoMT: modular NLP framework. Advances in Natural Language Processing, pages 293–304.

[Stenetorp et al.2012] Pontus Stenetorp, Sampo Pyysalo, Goran Topic, Tomoko Ohta, Sophia Ananiadou, and Jun’ichi Tsujii. 2012. brat: a web-based tool for nlp-assisted text annotation. In Proceedings of the Demonstrations at the 13th Conference of the European Chapter of the Association for Computational Linguistics, pages 102–107. Association for Computational Linguistics.

[Straka et al.2016] Milan Straka, Jan Hajic, and Jana Straková. 2016. UDPipe: trainable pipeline for processing CoNLL-U files performing tokenization, morphological analysis, pos tagging and parsing. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC’16), Paris, France, May. European Language Resources Association (ELRA).

Citeringar i Crossref