Konferensartikel

A case study on supervised classification of Swedish pseudo-coordination

Malin Ahlberg
Språkbanken, Department of Swedish, University of Gothenburg, Sweden

Peter Andersson
Språkbanken, Department of Swedish, University of Gothenburg, Sweden

Markus Forsberg
Språkbanken, Department of Swedish, University of Gothenburg, Sweden

Nina Tahmasebi
Språkbanken, Department of Swedish, University of Gothenburg, Sweden

Ladda ner artikel

Ingår i: Proceedings of the 20th Nordic Conference of Computational Linguistics, NODALIDA 2015, May 11-13, 2015, Vilnius, Lithuania

Linköping Electronic Conference Proceedings 109:5, s. 11-19

NEALT Proceedings Series 23:5, p. 11-19

Visa mer +

Publicerad: 2015-05-06

ISBN: 978-91-7519-098-3

ISSN: 1650-3686 (tryckt), 1650-3740 (online)

Abstract

We present a case study on supervised classification of Swedish pseudo-coordination (SPC). The classification is attempted on the type-level with data collected from two data sets: a blog corpus and a fiction corpus. Two small experiments were designed to evaluate the feasability of this task. The first experiment explored a classifier’s ability to discriminate pseudo-coordinations from ordinary verb coordinations, given a small labeled data set created during the experiment. The second experiment evaluated how well the classifier performed at detecting and ranking SPCs in a set of unlabeled verb coordinations, to investigate if it could be used as a semi-automatic discovery procedure to find new SPCs.

Nyckelord

Inga nyckelord är tillgängliga

Referenser

Kristian Blensenius. 2014. Maintaining contact with pseudoprogressive pseudocoordinations: Swedish verbal coordinations with ’sit’, ’stand’, and ’lie’ from a spatial perspective. Ms. Dept. of Swedish, University of Gothenburg.

Lars Borin, Markus Forsberg, and Johan Roxendal. 2012. Korp – the corpus infrastructure of Spr°akbanken. In Proceedings of LREC 2012, pages 474–478, Istanbul. ELRA.

Leo Breiman. 2001. Random forests. In Machine Learning, pages 5–32.

William Croft. 2001. Radical construction grammar : syntactic theory in typological perspective. Oxford University Press, New York.

Markus Forsberg, Richard Johansson, Linnéa Bäckström, Lars Borin, Benjamin Lyngfelt, Joel Olofsson, and Julia Prentice. 2014. From construction candidates to constructicon entries: An experiment using semi-automatic methods for identifying constructions in corpora. Constructions and Frames, 6(1):114–135.

Mirjam Fried and Jan-Ola Östman. 2004. Construction grammar in a cross-language perspective. John Benjamins Pub., Amsterdam.

Adele E. Goldberg. 1995. Constructions : a construction grammar approach to argument structure. Univ. of Chicago Press, Chicago.

Adele E. Goldberg. 2006. Constructions at work : the nature of generalization in language. Oxford Univ. Press, New York.

Mark Hall, Eibe Frank, Geoffrey Holmes, Bernhard Pfahringer, Peter Reutemann, and Ian H. Witten. 2009. The weka data mining software: An update. SIGKDD Explor. Newsl., 11(1):10–18, November.

Martin Hilpert and Christian Koops. 2008. A quantitative approach to the development of complex predicates. The case of Swedish Pseudo-Coordination with sitta “sit”. Diachronica, 25(2):242–261.

Ulrika Kvist Darnell. 2008. Pseudosamordningar i svenska : särskilt såadana med verben sitta, ligga och stå. Institutionen f¨or lingvistik. Stockholms universitet, Stockholm.

Ulf Teleman, Staffan Hellberg, and Erik Andersson. 1999. Svenska Akademiens grammatik. Stockholm: Norstedts.

Yulia Tsvetkov and Shuly Wintner. 2011. Identification of multi-word expressions by combining multiple linguistic information sources. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP ’11, pages 836–845, Stroudsburg, PA, USA. Association for Computational
Linguistics.

Anna-Lena Wiklund. 2007. The syntax of tenselessness : tense/mood/aspect-agreeing infinitivals. Mouton de Gruyter, Berlin.

Citeringar i Crossref