Konferensartikel

Detecting and Processing Figurative Language in Discourse

Caroline Sporleder
Universität Trier, Germany

Ladda ner artikel

Ingår i: Proceedings of the 19th Nordic Conference of Computational Linguistics (NODALIDA 2013); May 22-24; 2013; Oslo University; Norway. NEALT Proceedings Series 16

Linköping Electronic Conference Proceedings 85:2, s. 3-3

NEALT Proceedings Series 16:2, p. 3-3

Visa mer +

Publicerad: 2013-05-17

ISBN: 978-91-7519-589-6

ISSN: 1650-3686 (tryckt), 1650-3740 (online)

Abstract

Figurative language poses a serious challenge to NLP systems. The use of idiomatic and metaphoric expressions is not only extremely widespread in natural language; many figurative expressions; in particular idioms; also behave idiosyncratically. These idiosyncrasies are not restricted to a non-compositional meaning but often also extend to syntactic properties; selectional preferences etc. To deal appropriately with such expressions; NLP tools need to detect figurative language and assign the correct analyses to non-literal expressions. While there has been quite a bit of work on determining the general ‘idiomaticity’ of an expression (type-based approaches); this only solves part of the problem as many expressions; such as break the ice or play with fire; can also have a literal; perfectly compositional meaning (e.g. break the ice on the duck pond). Such expressions have to be disambiguated in context (token-based approaches). Token-based approaches have received increased attention recently. In this talk; I will present an unsupervised method for token-based idiom detection. The method exploits the fact that well-formed texts exhibit lexical cohesion; i.e. words are semantically related to other words in the context. I will show how cohesion can be modelled and how the cohesive structure can be used to distinguish literal and idiomatic usages and even detect newly coined figurative expressions.

Nyckelord

Discourse; Figurative Language; Token-Based Idiom Detection

Referenser

Inga referenser tillgängliga

Citeringar i Crossref