Article | Selected Papers from the CLARIN Annual Conference 2019 | Enriching and Increasing the Usability of Lexicographical Data for Less-Resourced Linköping University Electronic Press Conference Proceedings
Göm menyn

Title:
Enriching and Increasing the Usability of Lexicographical Data for Less-Resourced
Author:
Dirk Goldhahn: Natural Language Processing Group, University of Leipzig, Germany. Saxon Academy of Sciences and Humanities, Leipzig, Germany Thomas Eckart: Natural Language Processing Group, University of Leipzig, Germany. Saxon Academy of Sciences and Humanities, Leipzig, Germany Sonja Bosch: Department of African Languages, University of South Africa, South Africa
DOI:
https://doi.org/10.3384/ecp2020172004
Download:
Full text (pdf)
Year:
2020
Conference:
Selected Papers from the CLARIN Annual Conference 2019
Issue:
172
Article no.:
004
Pages:
23-32
No. of pages:
10
Publication type:
Abstract and Fulltext
Published:
2020-07-03
ISBN:
978-91-7929-807-4
Series:
Linköping Electronic Conference Proceedings
ISSN (print):
1650-3686
ISSN (online):
1650-3740
Publisher:
Linköping University Electronic Press, Linköpings universitet


Export in BibTex, RIS or text

This paper presents a use case for enriching lexicographical data for less-resourced languages employing the CLARIN infrastructure. Newly prepared lexicographical data sets for under-resourced Bantu languages spoken in southern regions of the African continent form the basis of the presented work. These datasets have been made digitally available using well-established standards of the Linguistic Linked Open Data (LLOD) community. To overcome the insufficient amount of freely available reference material, a crowdsourcing web portal for collecting textual data for less-resourced languages has been created and incorporated into the CLARIN infrastructure. Using this portal, the number of available text resources for the respective languages was significantly increased in a community effort. The collected content is used to enrich lexicographical data with real-world samples to increase the usability of the entire resource.

Keywords: minority languages, lesser resourced languages, use case, lexical resources, Bantu languages

Selected Papers from the CLARIN Annual Conference 2019

Author:
Dirk Goldhahn, Thomas Eckart, Sonja Bosch
Title:
Enriching and Increasing the Usability of Lexicographical Data for Less-Resourced
DOI:
10.3384/ecp2020172004
References:
No references available

Selected Papers from the CLARIN Annual Conference 2019

Author:
Dirk Goldhahn, Thomas Eckart, Sonja Bosch
Title:
Enriching and Increasing the Usability of Lexicographical Data for Less-Resourced
DOI:
https://doi.org10.3384/ecp2020172004
Note: the following are taken directly from CrossRef
Citations:
No citations available at the moment


Responsible for this page: Peter Berkesand
Last updated: 2019-11-06