Konferensartikel

The Swedish Winogender Dataset

Saga Hansson

Konstantinos Mavromatakis

Yvonne Adesam

Gerlof Bouma

Dana Dannélls

Ladda ner artikel

Ingår i: Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa), May 31-June 2, 2021.

Linköping Electronic Conference Proceedings 178:52, s. 452-459

Visa mer +

Publicerad: 2021-05-21

ISBN: 978-91-7929-614-8

ISSN: 1650-3686 (tryckt), 1650-3740 (online)

Abstract

We introduce the SweWinogender test set, a diagnostic dataset to measure gender bias in coreference resolution. It is modelled after the English Winogender benchmark, and is released with reference statistics on the distribution of men and women between occupations and the association between gender and occupation in modern corpus material. The paper discusses the design and creation of the dataset, and presents a small investigation of the supplementary statistics.

Nyckelord

coreference, gender bias, diagnostics, Swedish

Referenser

Inga referenser tillgängliga

Citeringar i Crossref