Different Issues in the Design and Implementation of a Rule Based Grammar for the Surface Syntactic Disambiguation of Basque

Jose Mari Arriola Egurrola
UPV/EHU University of the Basque Country

Ladda ner artikel

Ingår i: Proceedings of the Workshop on “Constraint Grammar - methods, tools and applications” at NODALIDA 2015, May 11-13, 2015, Institute of the Lithuanian Language, Vilnius, Lithuania

Linköping Electronic Conference Proceedings 113:1, s. 1-9

NEALT Proceedings Series 24:1, s. 1-9

Visa mer +

Publicerad: 2015-06-17

ISBN: 978-91-7519-037-2

ISSN: 1650-3686 (tryckt), 1650-3740 (online)


This paper presents the results of a set of preliminary experiments reusing some of the modules for the surface syntactic processing of Basque in order to improve the surface syntactic disambiguation. The general idea is to reuse the existing modules at different stages of processing and to find the better order of application of those modules. It aims at introducing a strategy for surface syntactic disambiguation in Basque via rule-based grammars. The results from an evaluation of this disambiguation strategy on a sample corpus are described.


Inga nyckelord är tillgängliga


Abney S. P. 1997. Part-of-speech tagging and partial parsing. S. Young and G. Bloothooft, editors, Corpus-Based Methods in Language and Speech Processing, Kluwer, Dordrecht.

Aduriz, I., Aldezabal, I., Alegria, I., Arriola, J., Díaz de Ilarraza, A., Ezeiza, N., Gojenola, K. 2003. Finite State Applications for Basque, Workshop on Finite-State Methods in Natural Language Processing. 11th Conference of the European Chapter of the Association for Computational Linguistics, Budapest, Hungary 2003, pp. 3-11.

Agirre E., Atutxa A., Sarasola K. 2012. Contribution of Complex Lexical Information to Solve Syntactic Ambiguity in Basque. Proceedings of COLING 2012, pp: 97—114.

Aldezabal, I., Alegria, I., Ansa O., Arriola, J., Ezeiza, N. 1999. Designing spelling correctors for inflected languages using lexical transducers, Proceedings of European Chapter of the Association for Computational Linguistics, Bergen, Norway, June 1999, pp. 265-266.

Aldezabal, I., Ansa, O., Arrieta, B., Artola, X., Ezeiza, A., Hernández, G., Lersundi, M. 2001. EDBL: a General Lexical Basis for the Automatic Processing of Basque, Proceedings of the IRCS Workshop on Linguistic Databases. Philadelphia (USA), December 2001, pp. 1-10.

Aldezabal, Izaskun, María Jesús Aranzabe, Arantza Díaz de Illarraza, Ainara Estarrona and Larraitz Uria. 2010. ‘Euspropbank: Integrating semantic information in the basque dependency treebank’, Computational Linguistics and Intelligent Text Processing pp. 60–73.

Alegria, I., Artola, X., Sarasola, K., Urkia, M. 1996. Automatic morphological analysis of Basque. Literary & Linguistic Computing, 11: 193-203.

Alegria, I., Artola, X., Sarasola, K. 1997. Improving a Robust Morphological Analyser using Lexical Transducers. In Mitkov, R. and Nicolov, N. (eds), Recent Advances in Natural Language Processing. Current Issues in Linguistic Theory (CILT) series, 136. John Benjamins publisher company, pp. 97-110.

Maria Jesus Aranzabe, Jose Maria Arriola and Arantza Díaz de Ilarraza. 2009. Theoretical and Methodological issues of tagging Noun Phrase Structures following Dependency Grammar Formalism. In Artiagoitia, X. and Lakarra J.A. (eds) Gramatika Jaietan. Patxi Goenagaren omenez. Donostia: Gipuzkoako Foru Aldundia-UPV/EHU.

Bick, E. 2000. The Parsing System ’Palavras’: Automatic Grammatical Analysis of Portuguese in a Constraint Grammar Framework, Aarhus University Press, Aarhus.

Bick, Eckhard. 2013. ML-Tuned Constraint Grammars. In: Proceedings of the 27th Pacific Asia Conference on Language, Information and Computation, pp. 440-449. Taipei: Department of English, National Chengchi University.

Didriksen T. 2010. Constraint Grammar Manual: 3rd version of the CG formalism variant. Grammar-Soft Aps, Denmark.

DoleŽalova, J. and PetkeviC, V. 2007. Shallow parsing of Czech sentence based on correct morphological disambiguation. I: Kosta, P. and Schürcks, L. (eds.), Linguistic Investigations into Formal Descriptionof Slavic Languages. Contributions of the Sixth European Conference (FDSL-6), Peter Lang:Frankfurt am Main, 53-64.

Euskaltzaindia. 1994. Basque Grammar: First Steps (in Basque). Euskaltzaindia.

Karlsson, Fred, Atro Voutilainen, Juha Heikkilä & Arto Anttila. 1995. Constraint grammar: A language-independent system for parsing unrestricted text. In Natural Language Processing, No 4. Berlin and New York: Mouton de Gruyter.

Laka, I. 1996. A Brief Grammar of Euskara, the Basque Language, Euskararako Errektoreordetza, EHU.

Nivre, J., Hall, J., Nilsson, J., Chanev, A., Eryigit, G., Kubler, S., Marinov, S., and Marsi, E. 2007. Maltparser: A language-independent system for data-driven dependency parsing. Natural Language Engineering, 13(2):95.

Samuelsson C. and Voutilainen A. 1997. Comparing a linguistic and a stochastic tagger. In Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and the Eighth Conference of the European Chapter of the Association for Computational Linguistics. ACL.

Tapanainen, Pasi. 1996. The Constraint Grammar Parser CG-2. University of Helsinki Publications No. 27.

Tapanainen P. and Järvinen T. 1997. A non-projective dependency parser. In Proceedings of the Fifth Conference on Applied Natural Language Processing ACL.

Wiechetek Linda, Arriola J.M. 2011. An Experiment of Use and Reuse of Verb Valency in Morphosyntactic Disambiguation and Machine Translation for Euskara and North Sámi In: Constraint Grammar Workshop at NoDaLiDa 2011.

Citeringar i Crossref