Jacobo Rouces
Språkbanken Text, University of Gothenburg, Sweden
Lars Borin
Språkbanken Text, University of Gothenburg, Sweden
Nina Tahmasebi
Språkbanken Text, University of Gothenburg, Sweden
Stian Rødven Eide
Språkbanken Text, University of Gothenburg, Sweden
Download articlePublished in: Selected papers from the CLARIN Annual Conference 2018, Pisa, 8-10 October 2018
Linköping Electronic Conference Proceedings 159:18, p. 177-187
Published: 2019-05-28
ISBN: 978-91-7685-034-3
ISSN: 1650-3686 (print), 1650-3740 (online)
The field of sentiment analysis or opinion mining consists in automatically classifying text according to the positive or negative sentiment expressed in it, and has become very popular in the last decade. However, most data and software resources are built for English and a few other languages. In this paper we compare and test different corpus-based and lexicon-based methods for creating a sentiment lexicon. We then manually curate the results of the best performing method. The result, SenSALDO, is a comprehensive sentiment lexicon for Swedish containing 7,618 word senses as well as a full-form version of this lexicon containing 65,953 items (text word forms). SenSALDO is freely available as a research tool in the SWE-CLARIN toolbox under an open-source CC-BY license.