
UDLex: Towards Cross-language Subcategorization Lexicons

Giulia Rambelli
Computational Linguistics Laboratory, Department of Philology, Literature, and Linguistics, University of Pisa, Pisa, Italy

Alessandro Lenci
Computational Linguistics Laboratory, Department of Philology, Literature, and Linguistics, University of Pisa, Pisa, Italy

Thierry Poibeau
LATTICE, CNRS, École normale supérieure and Université Sorbonne nouvelle, PSL Research University and USPC, Paris, France

Ingår i: Proceedings of the Fourth International Conference on Dependency Linguistics (Depling 2017), September 18-20, 2017, Università di Pisa, Italy

Linköping Electronic Conference Proceedings 139:24, s. 207-217

Publicerad: 2017-09-13

ISBN: 978-91-7685-467-9

ISSN: 1650-3686 (tryckt), 1650-3740 (online)


This paper introduces UDLex, a computational framework for the automatic extraction of argument structures for several languages. By exploiting the versatility of the Universal Dependency annotation scheme, our system acquires subcategorization frames directly from a dependency parsed corpus, regardless of the input language. It thus uses a universal set of language-independent rules to detect verb dependencies in a sentence. In this paper we describe how the system has been developed by adapting the LexIt (Lenci et al., 2012) framework, originally designed to describe argument structures of Italian predicates. Practical issues that arose when building argument structure representations for typologically different languages will also be discussed.


