Göm menyn

Proceedings of the 22nd Nordic Conference on Computational Linguistics (NoDaLiDa), September 30 - October 2, Turku, Finland
Mareike Hartman: University of Copenhagen, Denmark Barbara Plank: ITU Copenhagen, Denmark
No. of pages:
Linköping Electronic Conference Proceedings
ISSN (print):
ISSN (online):
NEALT Proceedings Series
Linköping University Electronic Press, Linköpings universitet

Welcome to the 22nd Nordic Conference on Computational Linguistics (NoDaLiDa 2019) held at the University of Turku in the beautiful city of Turku in Finland, on September 30-October 2, 2019. The aim of NoDaLiDa is to bring together researchers in the Nordic countries interested in any aspect related to human language and speech technologies. It is a great honor for me to serve as the general chair of NoDaLiDa 2019.

NoDaLiDa has a very long tradition. It stems from a working group initiative led by Sture Allèn, Kolbjörn Heggstad, Baldur Jönsson, Viljo Kohonen and Bente Maegaard (as the preface of the oldest workshop proceedings in the ACL anthology reveals).1 They organized the first NoDaLiDa (“Nordiska datalingvistikdagar”) in Gothenburg on October 10-11, 1977. In 2006, NEALT, the Northern European Association for Language Technology was founded. We are very honored to bring this bi-annual conference after 42 years to Turku this fall.

We solicited three different types of papers (long, short, demo papers) and received 78 valid submissions. In total, we accepted 49 papers, which will be presented as 34 oral presentations, 10 posters and 5 demo papers. A total of 4 submissions were withdrawn in the process. Each paper was reviewed by three experts. We are extremely grateful to the Programme Committee members for their detailed and helpful reviews. Overall, there are 10 oral sessions with talks and one poster session organized into themes over the two days, starting each day with a keynote talk.

We would like to thank our two keynote speakers for travel to Turku and sharing their work. Marie-Catherine de Marneffe from Ohio State University will talk about "Do you know that there’s still a chance? Identifying speaker commitment for natural language understanding". Grzegorz Chrupala from Tilburg University will talk about "Investigating neural representations of speech and language". We are also very grateful to Fred Karlsson, who accepted to share his insights into the Finnish language in the traditional NoDaLiDa language tutorial.

The conference is preceded by 5 workshops on a diverse set of topics: deep learning for natural language processing, NLP for Computer-Assisted Language Learning, Constraint Grammar Methods, Tools and Applications, NLP and pseudonymisation and Financial Narrative Processing. This shows the breadth of topics that can be found in language technology these days, and we are extremely happy and grateful to the workshop organizers for complementing the main program this way.

There will be two social events. A reception which is sponsored by the City of Turku and held at the Old Town Hall in Turku. A conference dinner will be held in the Turku Castle in the King’s hall. Two fantastic evenings are awaiting.

I would like to thank the entire team that made NoDaLiDa 2019 possible in the first place. First of all, I would like to thank Beáta Megyesi for inviting me to take up this exciting (and admittedly at times demanding) role and all her valuable input regarding NEALT and previous editions of NoDaLiDa. Jörg Tiedemann, for the smooth transition from the previous NoDaLiDa edition and his input and work as program chair; the program chair committee Jurgita Kapo?ci¯ut?e-Dzikien?e, Hrafn Loftsson, Patrizia Paggio, and Erik Velldal, for working hard on putting the program together. I am particularly grateful to Jörg Tiedemann, Jurgita Kapo?ci¯ut?e-Dzikien?e, Kairit Sirts and Patrizia Paggio for leading the reviewing process. Special thanks goes to the workshop chairs Richard Johansson and Kairit Sirts, who have done an invaluable job with leading the workshop selection and organization. A big thanks also to Miryam de Lhoneux for her work as social media chair and Mareike Hartmann for leading the publication efforts that led to this volume, as well as the coordination of the workshop proceedings. Thank you! Finally, my ultimate thanks goes to the amazing local organization committee and team. Thank you, Filip Ginter and Jenna Kanerva. With your infinite support and pro-active engagement in organizing NoDaLiDa you are the ones that make NoDaLiDa possible and surely an unforgettable experience. Thanks also to the entire local team (with special thanks to Hans Moen for help with the program): Li-Hsin Chang, Rami Ilo, Suwisa Kaewphan, Kai Hakala, Roosa Kyllönen, Veronika Laippala, Akseli Leino, Juhani Luotolahti, Farrokh Mehryary, Hans Moen, Maria Pyykönen, Sampo Pyysalo, Samuel Rönnqvist, Antti Saloranta, Antti Virtanen, Sanna Volanen. NoDaLiDa 2019 has received financial support from our generous sponsors, which we would also like to thank here.

This is the usual place for the greetings from the local organizers, but as we set out to write it, it turns out that Barbara already said it all. So we really only need to add one thing: huge thanks to Barbara for all the hard work she put into NoDaLiDa. We can only wonder where you found the time for all this. We hope the Turku edition of NoDaLiDa will be a success, at least we tried our best to make it so. In two weeks we will know. — Filip, Jenna, and the local team

Danke - kiitos!

We very much hope that you will have an enjoyable and inspiring time at NoDaLiDa 2019 in Turku.

Barbara Plank


September 2019

Proceedings of the 22nd Nordic Conference on Computational Linguistics (NoDaLiDa), September 30 - October 2, Turku, Finland

José Carlos Rosales Nuñez, Djamé Seddah, Guillaume Wisniewski
Comparison between NMT and PBSMT Performance for Translating Noisy User-Generated Content
[Abstract and Fulltext]

Prasanth Kolachina, Aarne Ranta
Bootstrapping UD treebanks for Delexicalized Parsing
[Abstract and Fulltext]

Barbara Plank, Sigrid Klerke
Lexical Resources for Low-Resource PoS Tagging in Neural Times
[Abstract and Fulltext]

Magnus Sahlgren, Fredrik Olsson
Gender Bias in Pretrained Swedish Embeddings
[Abstract and Fulltext]

Astrid van Aggelen, Antske Fokkens, Laura Hollink, Jacco van Ossenbruggen
A larger-scale evaluation resource of terms and their shift direction for diachronic lexical semantics
[Abstract and Fulltext]

Yuri Bizzoni, Marius Mosbach, Dietrich Klakow, Stefania Degaetano-Ortlieb
Some steps towards the generation of diachronic WordNets
[Abstract and Fulltext]

Karolína Horenovská
An evaluation of Czech word embeddings
[Abstract and Fulltext]

Adam Ek, Jean-Phillipe Bernardy, Shalom Lappin
Language Modeling with Syntactic and Semantic Representation for Sentence Acceptability Predictions
[Abstract and Fulltext]

Hazem Al Saied, Marie Candito, Mathieu Constant
Comparing linear and neural models for competitive MWE identification
[Abstract and Fulltext]

Silvio Ricardo Cordeiro, Marie Candito
Syntax-based identification of light-verb constructions
[Abstract and Fulltext]

Marina Santini, Benjamin Danielsson, Arne Jönsson
Comparing the Performance of Feature Representations for the Categorization of the Easy-to-Read Variety vs Standard Language
[Abstract and Fulltext]

Michele Persiani, Thomas Hellström
Unsupervised Inference of Object Affordance from Text Corpora
[Abstract and Fulltext]

Petter Mæhlum, Jeremy Barnes, Lilja Øvrelid, Erik Velldal
Annotating evaluative sentences for sentiment analysis: a dataset for Norwegian
[Abstract and Fulltext]

Hans Moen, Laura-Maria Peltonen, Henry Suhonen, Hanna-Maria Matinolli, Riitta Mieronkoski, Kirsi Telen, Kirsi Terho, Tapio Salakoski, Sanna Salanterä
An Unsupervised Query Rewriting Approach Using N-gram Co-occurrence Statistics to Find Similar Phrases in Large Text Corpora
[Abstract and Fulltext]

Starkaður Barkarson, Steinþór Steingrímsson
Compiling and Filtering ParIce: An English-Icelandic Parallel Corpus
[Abstract and Fulltext]

Kristín Bjarnadóttir, Kristín Ingibjörg Hlynsdóttir, Steinþór Steingrímsson
DIM: The Database of Icelandic Morphology
[Abstract and Fulltext]

Sardana Ivanova, Anisia Katinskaia, Roman Roman
Tools for supporting language learning for Sakha
[Abstract and Fulltext]

Ann Lillieström, Koen Claessen, Nicholas Smallbone
Inferring morphological rules from small examples using 0/1 linear programming
[Abstract and Fulltext]

Jeremy Barnes, Samia Touileb, Lilja Øvrelid, Erik Velldal
Lexicon information in neural sentiment analysis: a multi-task learning approach
[Abstract and Fulltext]

Mickel Hoang, Oskar Alija Bihorac, Jacobo Rouces
Aspect-Based Sentiment Analysis using BERT
[Abstract and Fulltext]

Rasmus Lehmann, Leon Derczynski
Political Stance in Danish
[Abstract and Fulltext]

Anders Edelbo Lillie, Emil Refsgaard Middelboe, Leon Derczynski
Joint Rumour Stance and Veracity Prediction
[Abstract and Fulltext]

Bjarte Johansen
Named-Entity Recognition for Norwegian
[Abstract and Fulltext]

Jue Hou, Maximilian W. Koppatz, José María Hoya Quecedo, Roman Yangarber
Projecting named entity recognizers without annotated or parallel corpora
[Abstract and Fulltext]

Jenna Kanerva, Samuel Rönnqvist, Riina Kekki, Tapio Salakoski, Filip Ginter
Template-free Data-to-Text Generation of Finnish Sports News
[Abstract and Fulltext]

Eva Pettersson, Beáta Megyesi
Matching Keys and Encrypted Manuscripts
[Abstract and Fulltext]

Evgeniia Rykova, Stefan Werner
Perceptual and acoustic analysis of voice similarities between parents and young children
[Abstract and Fulltext]

Özge Alaçam
Enhancing Natural Language Understanding through Cross-Modal Interaction: Meaning Recovery from Acoustically Noisy Speech
[Abstract and Fulltext]

Aarne Talman, Antti Suni, Hande Celikkanat, Sofoklis Kakouros, Jörg Tiedemann, Martti Vainio
Predicting Prosodic Prominence from Text with Pre-trained Contextualized Word Representations
[Abstract and Fulltext]

Veronika Laippala, Roosa Kyllönen, Jesse Egbert, Douglas Biber, Sampo Pyysalo
Toward Multilingual Identification of Online Registers
[Abstract and Fulltext]

Jean-Philippe Bernardy, Stergios Chatzikyriakidis
A Wide-Coverage Symbolic Natural Language Inference System
[Abstract and Fulltext]

Ilmari Kylliäinen, Miikka Silfverberg
Ensembles of Neural Morphological Inflection Models
[Abstract and Fulltext]

Svanhvít Ingólfsdóttir, Hrafn Loftsson, Jón Daðason, Kristín Bjarnadóttir
Nefnir: A high accuracy lemmatizer for Icelandic
[Abstract and Fulltext]

Joakim Åkerström, Adel Daoud, Adel Daoud
Natural Language Processing in Policy Evaluation: Extracting Policy Conditions from IMF Loan Agreements
[Abstract and Fulltext]

David Alfter, Johannes Graën
Interconnecting lexical resources and word alignment: How do learners get on with particle verbs?
[Abstract and Fulltext]

Valentin Barriere, Amaury Fouret
May I Check Again? — A simple but efficient way to generate and use contextual dictionaries for Named Entity Recognition. Application to French Legal Texts
[Abstract and Fulltext]

Jean-Philippe Bernardy, Rasmus Blanck, Stergios Chatzikyriakidis, Shalom Lappin, Aleksandre Maskharashvili
Predicates as Boxes in Bayesian Semantics for Natural Language
[Abstract and Fulltext]

Leon Strømberg Derczynski, Alex Speed Kjeldsen
Bornholmsk Natural Language Processing: Resources and Tools
[Abstract and Fulltext]

Jeff Ens, Mika Hämäläinen, Jack Rueter, Philippe Pasquier
Morphosyntactic Disambiguation in an Endangered Language Setting
[Abstract and Fulltext]

Andre Kåsen, Kristin Hagen, Anders Nøklestad, Joel Priestley
Tagging a Norwegian Dialect Corpus
[Abstract and Fulltext]

Andreas Kirkedal, Barbara Plank, Leon Derczynski, Natalie Schluter
The Lacunae of Danish Natural Language Processing
[Abstract and Fulltext]

Svanhvít Ingólfsdóttir, Sigurjó Þorsteinsson, Hrafn Loftsson
Towards High Accuracy Named Entity Recognition for Icelandic
[Abstract and Fulltext]

Barbara Plank
Neural Cross-Lingual Transfer and Limited Annotated Data for Named Entity Recognition in Danish
[Abstract and Fulltext]

Sidsel Boldsen, Manex Agirrezabal
The Seemingly (Un)systematic Linking Element in Danish
[Abstract and Fulltext]

David Alfter, Therese Lindström Tiedemann, Elena Volodina
LEGATO: A flexible lexicographic annotation tool
[Abstract and Fulltext]

Mikko Aulamo, Jörg Tiedemann
The OPUS Resource Repository: An Open Package for Creating Parallel Corpora and Machine Translation Services
[Abstract and Fulltext]

Iben Nyholm Debess, Sandra Saxov Lamhauge, Peter Juel Juel Henrichsen
Garnishing a phonetic dictionary for ASR intake
[Abstract and Fulltext]

Marcus Klang, Pierre Nugues
Docria: Processing and Storing Linguistic Data with Wikipedia
[Abstract and Fulltext]

Daniel Varab, Natalie Schluter
UniParse: A universal graph-based parsing toolkit
[Abstract and Fulltext]

Proceedings of the 22nd Nordic Conference on Computational Linguistics (NoDaLiDa), September 30 - October 2, Turku, Finland

Mareike Hartman, Barbara Plank
Proceedings of the 22nd Nordic Conference on Computational Linguistics (NoDaLiDa), September 30 - October 2, Turku, Finland
Note: the following are taken directly from CrossRef
No citations available at the moment

Responsible for this page: Peter Berkesand
Last updated: 2019-11-06