Conference article

Polysemy, underspecification, and aspects - Questions of lumping or splitting in the construction of Swedish FrameNet

Karin Friberg Heppin
Språkbanken, Department of Swedish, University of Gothenburg, Sweden

Dana Dannélls
Språkbanken, Department of Swedish, University of Gothenburg, Sweden

Download article

Published in: Proceedings of the Workshop on Semantic resources and Semantic Annotation for Natural Language Processing and the Digital Humanities at NODALIDA 2015, Vilnius, 11th May, 2015

Linköping Electronic Conference Proceedings 112:3, p. 12–20

NEALT Proceedings Series 27:3, p. 12–20

Show more +

Published: 2015-05-06

ISBN: 978-91-7519-049-5

ISSN: 1650-3686 (print), 1650-3740 (online)

Abstract

When working on a lexical resource, such as Swedish FrameNet (SweFN), assumptions based on linguistic theories are made, and methodological directions based upon them are taken. These directions often need to be revised when not beforehand foreseen problems arise. One assumption that was made already in the early development stages of SweFN was that each lexical entry from the reference lexicon, SALDO, would evoke only one semantic frame in SweFN. If a lexical entry evoked more than one frame, it entailed more than one sense and therefore required a new entry in the lexicon. As work progressed, this inclination towards splitting, in the perpetual lumpers and splitters discussion (Kilgarriff, 1999), proved to be progressively untenable. This paper will give an account of the problems which were encountered and suggestions for solutions on polysemy issues forcing a discussion on lumping or splitting.

Keywords

polysemy; semantic frames; lexical entries; lexical units; lumping and splitting; frame semantics; Swedish FrameNet

References

Malin Ahlberg, Lars Borin, Markus Forsberg, Martin Hammarstedt, Leif-Jöran Olsson, Olof Olsson, Johan Roxendal, and Jonatan Uppström. 2013. Korp and karp a bestiary of language resources: the research infrastructure of spr°akbanken. In Proceedings of the 19th Nordic Conference of Computational Linguistics, pages 429–433. NEALT.

Sture Allén, Daniel Berg, Sture Berg, Martin Gellerstam, Louise Holmer, Ann-Kristin Hult, Susanne Lindstrand, Sven Lövfors, Sven-Göran Malmgren,
Christian Sj¨ogreen, Emma Sk¨oldberg, Lennart Tegner, and Maria Toporowska Gronostaj, editors. 2009. Svensk ordbok utgiven av Svenska Akademien. 1-2.

H&eactue;ctor Mart&iactue;nez Alonso, Bolette Sandford Pedersen, and N´uria Bel. 2013. Annotation of regular polysemy and underspecification. In Proceedings from the 51st annual meeting in Association for Computational Linguistics, pages 725–730. ACL.

J. Apresjan. 1974. Regular polysemy. Linguistics, 142:5–32.

Lars Borin and Markus Forsberg. 2009. All in the family: A comparison of SALDO and WordNet. In Proceedings of the 17th Nordic Conference of Computational Linguistics (NODALIDA 2009), NEALT Proceedings Series, Vol. 4 (2009), Odense, Denmark.

Kristiina Jokinen and Eckhard Bick. Lars Borin, Dana Dan´ells, Markus Forsberg, Dimitrios Kokkinakis, and Maria Toporowska Gronostaj. 2010. The past meets the present in Swedish FrameNet++. In 14th EURALEX International Congress, pages 269–281, Leeuwarden. EURALEX.

Lars Borin, Markus Forsberg, and Lennart Lönngren. 2013. SALDO: a touch of yin to WordNet’s yang. Language Resources and Evaluation, 47(4):1191–1211.

Dipanjan Das, Desai Chen, André F. T. Martins, Nathan Schneider, and Noah A. Smith. 2013. Frame-Semantic Parsing. Computational Linguistics,
40(1):9–56.

Christiane Fellbaum. 1998. WordNet: An electronic lexical database. MIT Press.

Charles J. Fillmore and Collin Baker. 2009. A frames approach to semantic analysis. The Oxford Handbook of Linguistic Analysis, pages 313–340.

Charles J. Fillmore, Christopher R. Johnson, and Miriam R.L. Petruck. 2003. Background to FrameNet. International Journal of Lexicography, 16(3):235–250.

Charles J. Fillmore, 1982. Frame semantics, pages 111–137. Hanshin Publishing Co., Seoul, South Korea.

Karin Friberg Heppin and Maria Toporowska Gronostaj. 2014. Exploiting FrameNet for Swedish: Mismatch? Constructions and Frames, 6(1):52–72.

Patrick Hanks. 2013. Lexical Analysis: Norms and Exploitations. MIT Press, Cambridge, Mass.

Richard Johansson and Luis Nieto Pina. 2015. Combining relational and distributional knowledge for word sense disambiguation. In Proceedings of the 20th Nordic Conference of Computational Linguistics, Vilnius, Lithuania.

Adam Kilgarriff. 1999. I don’t believe in word senses. Computers and the Humanities, 31(2):91–113.

Beth Levin. 2015. Semantics and pragmatics of argument alternations. The Annual Review of Linguistics, 1:63–83.

Bolette S. Pedersen, Sanni Nimb, and Anna Braasch. 2010. Merging specialist taxonomies and folk taxonomies in wordnets – A case study of plants, animals and foods in the Danish WordNet. In Proc. of the Seventh International Conference on Language Resources and Evaluation, Valletta, Malta. ELRA.

Josef Ruppenhofer, Michael Ellsworth, R. L. Miriam Petruck, R. Christopher Johnson, and Jan Scheffczy. 2010. FrameNet II: Extended Theory and Practice. ICSI, Berkeley.

Anna Wierzbicka. 1984. Cups and mugs: Lexicography and conceptual analysis. Australian Journal of Linguistics, 4(2):205–255.

Citations in Crossref