Liesbeth Augustinus
Centre for Computational Linguistics, University of Leuven, Belgium
Vincent Vandeghinste
Centre for Computational Linguistics, University of Leuven, Belgium
Ineke Schuurman
Centre for Computational Linguistics, University of Leuven, Belgium
Frank Van Eynde
Centre for Computational Linguistics, University of Leuven, Belgium
Download articlePublished in: Proceedings of the 19th Nordic Conference of Computational Linguistics (NODALIDA 2013); May 22-24; 2013; Oslo University; Norway. NEALT Proceedings Series 16
Linköping Electronic Conference Proceedings 85:38, p. 423-428
NEALT Proceedings Series 16:38, p. 423-428
Published: 2013-05-17
ISBN: 978-91-7519-589-6
ISSN: 1650-3686 (print), 1650-3740 (online)
Although several syntactically annotated corpora (or treebanks) exist for Dutch; they are seldomly used for descriptive linguistic research because there are no easy-to-use exploitation tools available. This demonstration paper describes GrETEL; a linguistic search engine (
Dutch; treebank; querying; example-based
Augustinus; L.; Vandeghinste; V.; and Van Eynde; F. (2012). Example-Based Treebank Querying. In Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012); Istanbul.
Hellmann; S.; Unbehauen; J.; Chiarcos; C.; and Ngomo; A.-C. N. (2010). The TIGER Corpus Navigator. In Proceedings of TLT-9; pages 91–102; Tartu; Estonia.
Hoekstra; H.; Moortgat; M.; Renmans; B.; Schouppe; M.; Schuurman; I.; and van der Wouden; T. (2003). CGN Syntactische Annotatie. http://nederbooms.ccl.kuleuven.be/documentation/saman_ cgn.pdf.
Martens; S. (2011). Quantifying Linguistic Regularity. PhD thesis; KU Leuven; Leuven; Belgium.
Martens; S. and Vandeghinste; V. (2010). An Efficient; Generic Approach to Extracting Multi- Word Expressions from Dependency Trees. In Proceedings of the CoLing Workshop: Multiword Expressions: From Theory to Applications (MWE 2010); Beijing; China.
Oostdijk; N.; Goedertier; W.; Van Eynde; F.; Boves; L.; Martens; J.-P.; Moortgat; M.; and Baayen; H. (2002). Experiences from the Spoken Dutch Corpus Project. In Proceedings of the 3rd International Conference on Language Resources and Evaluation (LREC’02); pages 340–347; Las Palmas; Spain.
Resnik; P. and Elkiss; A. (2005). The Linguist’s Search Engine: An Overview. In Proceedings of the ACL Interactive Poster and Demonstration Sessions; pages 33–36; Ann Arbor.
Van Eynde; F. (2004). Part of Speech Tagging en Lemmatisering van het Corpus Gesproken Nederlands. www.ccl.kuleuven.be/Papers/POSmanual_febr2004.pdf.
van Noord; G. (2006). At Last Parsing Is Now Operational. In TALN 2006; pages 20–42.
van Noord; G.; Bouma; G.; Van Eynde; F.; de Kok; D.; van der Linde; J.; Schuurman; I.; Tjong Kim Sang; E.; and Vandeghinste; V. (2013). Large Scale Syntactic Annotation of Written Dutch: Lassy. In Essential Speech and Language Technology for Dutch: Resources; Tools and Applications. Springer.
van Noord; G.; Schuurman; I.; and Bouma; G. (2011). Lassy Syntactische Annotatie; Revision 19455. www.let.rug.nl/vannoord/Lassy/sa-man_lassy.pdf.