Article | Proceedings of the 22nd Nordic Conference on Computational Linguistics (NoDaLiDa), September 30 - October 2, Turku, Finland | Towards High Accuracy Named Entity Recognition for Icelandic Linköping University Electronic Press Conference Proceedings
Göm menyn

Title:
Towards High Accuracy Named Entity Recognition for Icelandic
Author:
Svanhvít Ingólfsdóttir: Department of Computer Science, Reykjavik University, Iceland Sigurjó Þorsteinsson: Department of Computer Science, Reykjavik University, Iceland Hrafn Loftsson: Department of Computer Science, Reykjavik University, Iceland
Download:
Full text (pdf)
Year:
2019
Conference:
Proceedings of the 22nd Nordic Conference on Computational Linguistics (NoDaLiDa), September 30 - October 2, Turku, Finland
Issue:
167
Article no.:
042
Pages:
363--369
No. of pages:
7
Publication type:
Abstract and Fulltext
Published:
2019-10-02
ISBN:
978-91-7929-995-8
Series:
Linköping Electronic Conference Proceedings
ISSN (print):
1650-3686
ISSN (online):
1650-3740
Series:
NEALT Proceedings Series
Publisher:
Linköping University Electronic Press, Linköpings universitet


Export in BibTex, RIS or text

We report on work in progress which consists of annotating an Icelandic corpus for named entities (NEs) and using it for training a named entity recognizer based on a Bidirectional Long Short-Term Memory model. Currently, we have annotated 7,538 NEs appearing in the first 200,000 tokens of a 1 million token corpus, MIMGOLD, originally developed for serving as a gold standard for part-of-speech tagging. Our best performing model, trained on this subset of MIM-GOLD, and enriched with external word embeddings, obtains an overall F1 score of 81.3% when categorizing NEs into the following four categories: persons, locations, organizations and miscellaneous. Our preliminary results are promising, especially given the fact that 80% of MIM-GOLD has not yet been used for training.

Proceedings of the 22nd Nordic Conference on Computational Linguistics (NoDaLiDa), September 30 - October 2, Turku, Finland

Author:
Svanhvít Ingólfsdóttir, Sigurjó Þorsteinsson, Hrafn Loftsson
Title:
Towards High Accuracy Named Entity Recognition for Icelandic
References:
No references available

Proceedings of the 22nd Nordic Conference on Computational Linguistics (NoDaLiDa), September 30 - October 2, Turku, Finland

Author:
Svanhvít Ingólfsdóttir, Sigurjó Þorsteinsson, Hrafn Loftsson
Title:
Towards High Accuracy Named Entity Recognition for Icelandic
Note: the following are taken directly from CrossRef
Citations:
No citations available at the moment


Responsible for this page: Peter Berkesand
Last updated: 2019-11-06