Johan Bos
Center for Language and Cognition, University of Groningen, The Netherlands
Malvina Nissim
Center for Language and Cognition, University of Groningen, The Netherlands
Download articlePublished in: Proceedings of the 20th Nordic Conference of Computational Linguistics, NODALIDA 2015, May 11-13, 2015, Vilnius, Lithuania
Linköping Electronic Conference Proceedings 109:32, p. 251-255
NEALT Proceedings Series 23:32, p. 251-255
Published: 2015-05-06
ISBN: 978-91-7519-098-3
ISSN: 1650-3686 (print), 1650-3740 (online)
Can relations described by English noun-noun compounds be adequately captured by prepositions? We attempt to answer this question in a data-driven way, using gamification to annotate a set of about a thousand noun-noun compound examples. Annotators could make a choice out of five prepositions generated with the help of paraphrases found in the Google n-gram corpus. We show that there is substantial agreement among the players of our linguistic annotation game, and that their answers differ in about 50\% of raw frequency counts of the Google n-gram corpus. Prepositions can be used to describe the majority of the implicit relations present in noun-noun compounds, but not all relations are captured by natural prepositions and some compounds are not easy to paraphrase with the use of a preposition.
Valerio Basile, Johan Bos, Kilian Evang, and Noortje Venhuizen. 2012. A platform for collaborative semantic annotation. In Proceedings of the Demonstrations at the 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL), pages 92–96, Avignon, France.
Corina Dima, Verena Henrich, Erhard Hinrichs, and Christina Hoppermann. 2014. How to Tell a Schneemann from a Milchmann: An Annotation Scheme for Compound-Internal Relations. In Calzolari et al., editor, Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC’14), Reykjavik, Iceland, may. European Language Resources Association (ELRA).
Roxana Girju. 2009. The syntax and semantics of prepositions in the task of automatic interpretation of nominal phrases and compounds: A cross-linguistic study. Computational Linguistics, 35(2):185–228.
Mark Lauer. 1995. Corpus statistics meet the noun compound: Some empirical results. In Proceedings of the 33rd Annual Meeting of the Association for Computational Linguistics, pages 47–54, Cambridge,
Massachusetts, USA, June. Association for Computational Linguistics.
Judith Levi. 1978. The Syntax and Semantics of Complex Nominals. Academic Press.
Guido Minnen, John Carroll, and Darren Pearce. 2001. Applied morphological processing of English. Journal of Natural Language Engineering, 7(3):207–223.
Preslav Ivanov Nakov. 2007. Using the Web as an Implicit Training Set: Application to Noun Compound Syntax and Semantics. Ph.D. thesis, University of California, Berkeley.
Noortje Venhuizen, Valerio Basile, Kilian Evang, and Johan Bos. 2013. Gamification for Word Sense Labeling. In Proceedings of the 10th International Conference on Computational Semantics (IWCS 2013) – Short Papers, pages 397–403, Potsdam, Germany.