Konferensartikel

May I Check Again? — A simple but efficient way to generate and use contextual dictionaries for Named Entity Recognition. Application to French Legal Texts

Valentin Barriere
Cour de Cassation, Palais de Justice, 5 quai de l’horloge, 75001 Paris, France

Amaury Fouret
Cour de Cassation, Palais de Justice, 5 quai de l’horloge, 75001 Paris, France

Ladda ner artikel

Ingår i: Proceedings of the 22nd Nordic Conference on Computational Linguistics (NoDaLiDa), September 30 - October 2, Turku, Finland

Linköping Electronic Conference Proceedings 167:36, s. 327--332

NEALT Proceedings Series 42:36, s. 327--332

Visa mer +

Publicerad: 2019-10-02

ISBN: 978-91-7929-995-8

ISSN: 1650-3686 (tryckt), 1650-3740 (online)

Abstract

In this paper we present a new method to learn a model robust to typos for a Named Entity Recognition task. Our improvement over existing methods helps the model to take into account the context of the sentence inside a justice decision in order to recognize an entity with a typo. We used state-of-the-art models and enriched the last layer of the neural network with high-level information linked with the potential of the word to be a certain type of entity. More precisely, we utilized the similarities between the word and the potential entity candidates the tagged sentence context. The experiments on a dataset of french justice decisions show a reduction of the relative F1-score error of 32\%, upgrading the score obtained with the most competitive fine-tuned state-of-the-art system from 94.85\% to 96.52\%.

Nyckelord

Named Entity Recognition BLSTM-CRF Legal Texts

Referenser

Inga referenser tillgängliga

Citeringar i Crossref