Konferensartikel

Can We Create a Tool for General Domain Event Analysis?

Siim Orasmaa
Institute of Computer Science, University of Tartu, Estonia

Heiki-Jaan Kaalep
Institute of Computer Science, University of Tartu, Estonia

Ladda ner artikel

Ingår i: Proceedings of the 21st Nordic Conference on Computational Linguistics, NoDaLiDa, 22-24 May 2017, Gothenburg, Sweden

Linköping Electronic Conference Proceedings 131:22, s. 192-201

NEALT Proceedings Series 29:22, p. 192-201

Visa mer +

Publicerad: 2017-05-08

ISBN: 978-91-7685-601-7

ISSN: 1650-3686 (tryckt), 1650-3740 (online)

Abstract

This study outlines a question about the possibility of creation of a tool for general domain event analysis. We provide reasons for assuming that a TimeML-based event modelling could be a suitable basis for general domain event modelling. We revise and summarise Estonian efforts on TimeML analysis, both at automatic analysis and human analysis, and provide an overview of the current challenges/limitations of applying a TimeML model in an extensive corpus annotation. We conclude with a discussion on reducing complexity of the (TimeML-based) event model.

Nyckelord

Inga nyckelord är tillgängliga

Referenser

Mieke Bal. 1997. Narratology: Introduction to the Theory of Narrative. University of Toronto Press. https://archive.org/details/ BalNarratologyIntroductionToTheTheoryOfNarrative (Date accessed: 2017-01-10).

Cosmin Adrian Bejan and Sanda M Harabagiu. 2008. A Linguistic Resource for Discovering Event Structures and Resolving Event Coreference. In LREC.

Steven Bethard, Oleksandr Kolomiyets, and Marie- Francine Moens. 2012. Annotating Story Timelines as Temporal Dependency Structures. In Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC’12), Istanbul, Turkey, may. European Language Resources Association (ELRA).

André Bittar. 2010. Building a TimeBank for French: a Reference Corpus Annotated According to the ISO-TimeML Standard. Ph.D. thesis, Université Paris Diderot, Paris, France.

David B Bracewell. 2015. Long nights, rainy days, and misspent youth: Automatically extracting and categorizing occasions associated with consumer products. SocialNLP 2015 @ NAACL, pages 29–38.

Roberto Casati and Achille Varzi. 2014. Events. In Edward N. Zalta, editor, The Stanford Encyclopedia of Philosophy. Fall 2014 edition. http://plato.stanford.edu/archives/fall2014/entries/events/ (Date accessed: 2017-01-20).

Tommaso Caselli, Valentina Bartalesi Lenzi, Rachele Sprugnoli, Emanuele Pianta, and Irina Prodanof. 2011. Annotating Events, Temporal Expressions and Relations in Italian: the It-Timeml Experience for the Ita-TimeBank. In Linguistic Annotation Workshop, pages 143–151. The Association for Computer Linguistics.

Hamish Cunningham. 2005. Information Extraction, Automatic. Encyclopedia of Language and Linguistics, 5:665–677.

Agata Cybulska and Piek Vossen. 2013. Semantic Relations between Events and their Time, Locations and Participants for Event Coreference Resolution. In RANLP, pages 156–163.

Tiiu Erelt, Ülle Viks, Mati Erelt, Reet Kasik, Helle Metslang, Henno Rajandi, Kristiina Ross, Henn Saari, Kaja Tael, and Silvi Vare. 1993. Eesti keele grammatika. 2., Süntaks (Grammar of Estonian: The syntax). Tallinn: Eesti TA Keele ja Kirjanduse Instituut.

Lisa Ferro, Laurie Gerber, Inderjeet Mani, Beth Sundheim, and George Wilson. 2005. TIDES 2005 standard for the annotation of temporal expressions. https://www.ldc.upenn.edu/sites/www.ldc.upenn.edu/files/english-timex2-guidelines-v0.1.pdf (Date accessed: 2017-01-15).

Walter R Fisher. 1984. Narration as a human communication paradigm: The case of public moral argument. Communications Monographs, 51(1):1–22.

Antske Fokkens, Marieke Van Erp, Piek Vossen, Sara Tonelli, Willem Robert van Hage, Luciano Serafini, Rachele Sprugnoli, and Jesper Hoeksema. 2013. GAF: A grounded annotation framework for events. In NAACL HLT, volume 2013, pages 11–20. Citeseer.

Lucian Galescu and Nate Blaylock. 2012. A corpus of clinical narratives annotated with temporal information. In Proceedings of the 2nd ACM SIGHIT International Health Informatics Symposium, pages 715–720. ACM.

Martin Haspelmath. 1997. From space to time: Temporal adverbials in the world’s languages. Lincom Europa.

Graham Katz and Fabrizio Arosio. 2001. The annotation of temporal information in natural language sentences. In Proceedings of the Workshop on Temporal and Spatial Information Processing, volume 13, pages 15–22. Association for Computational Linguistics.

Anaïs Lefeuvre-Halftermeyer, Jean-Yves Antoine, Alain Couillault, Emmanuel Schang, Lotfi Abouda, Agata Savary, Denis Maurel, Iris Eshkol-Taravella, and Delphine Battistelli. 2016. Covering various Needs in Temporal Annotation: a Proposal of Extension of ISO-TimeML that Preserves Upward Compatibility. In LREC 2016.

G. Marsic. 2012. Syntactically Motivated Task Definition for Temporal Relation Identification. Special Issue of the TAL (Traitement Automatique des Langues) Journal on Processing of Temporal and Spatial Information in Language - Traitement automatique des informations temporelles et spatiales en langage naturel, vol. 53, no. 2:23–55.

Marie-Francine Moens, Oleksandr Kolomiyets, Emanuele Pianta, Sara Tonelli, and Steven Bethard. 2011. D3. 1: State-of-the-art and design of novel annotation languages and technologies: Updated version. Technical report, TERENCE project–ICT FP7 Programme–ICT-2010-25410. http://www.terenceproject.eu/c/document_library/get_file?p_l_id=16136&folderId=12950&name=DLFE-1910.pdf (Date accessed: 2017-01-15).

Kadri Muischnek, Kaili M¨u¨urisep, Tiina Puolakainen, Eleri Aedmaa, Riin Kirt, and Dage S¨arg. 2014. Estonian Dependency Treebank and its annotation scheme. In Proceedings of 13th Workshop on Treebanks and Linguistic Theories (TLT13), pages 285–291.

David Nadeau and Satoshi Sekine. 2007. A survey of named entity recognition and classification. Lingvisticae Investigationes, 30(1):3–26.

Joel Nothman. 2013. Grounding event references in news. Ph.D. thesis, The University of Sydney.

Siim Orasmaa. 2012. Automaatne ajav¨aljendite tuvastamine eestikeelsetes tekstides (Automatic Recognition and Normalization of Temporal Expressions in Estonian Language Texts). Eesti Rakenduslingvistika U¨ hingu aastaraamat, (8):153–169.

Siim Orasmaa. 2014a. How Availability of Explicit Temporal Cues Affects Manual Temporal Relation Annotation. In Human Language Technologies—The Baltic Perspective: Proceedings of the Sixth International Conference Baltic HLT 2014, volume 268, pages 215–218. IOS Press.

Siim Orasmaa. 2014b. Towards an Integration of Syntactic and Temporal Annotations in Estonian. In LREC, pages 1259–1266.

Siim Orasmaa. 2016. Explorations of the Problem of Broad-coverage and General Domain Event Analysis: The Estonian Experience. Ph.D. thesis, University of Tartu, Estonia.

James Pustejovsky and Jessica Moszkowicz. 2012. The Role of Model Testing in Standards Development: The Case of ISO-Space. In LREC, pages 3060–3063.

James Pustejovsky and Amber Stubbs. 2012. Natural Language Annotation for Machine Learning. O’Reilly Media, Inc.

James Pustejovsky, José Castaño, Robert Ingria, Roser Saurí, Robert Gaizauskas, Andrea Setzer, and Graham Katz. 2003a. TimeML: Robust specification of event and temporal expressions in text. In Fifth International Workshop on Computational Semantics (IWCS-5).

James Pustejovsky, Patrick Hanks, Roser Sauri, Andrew See, Robert Gaizauskas, Andrea Setzer, Dragomir Radev, Beth Sundheim, David Day, Lisa Ferro, et al. 2003b. The TimeBank corpus. In Corpus Linguistics, volume 2003, pages 647–656.

James Pustejovsky, Kiyong Lee, Harry Bunt, and Laurent Romary. 2010. ISO-TimeML: An International Standard for Semantic Annotation. In LREC.

James Pustejovsky, Jessica L Moszkowicz, and Marc Verhagen. 2011. ISO-Space: The annotation of spatial information in language. In Proceedings of the Sixth Joint ISO-ACL SIGSEM Workshop on Interoperable Semantic Annotation, pages 1–9.

Hans Reichenbach. 1947. Elements of symbolic logic. Macmillan Co.

Livio Robaldo, Tommaso Caselli, Irene Russo, and Matteo Grella. 2011. From Italian text to TimeML document via dependency parsing. In Computational Linguistics and Intelligent Text Processing, pages 177–187. Springer.

Roser Saurí, Robert Knippen, Marc Verhagen, and James Pustejovsky. 2005. Evita: a robust event recognizer for QA systems. In Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, pages 700–707. Association for Computational Linguistics.

Roser Saurí, Jessica Littman, Robert Gaizauskas, Andrea Setzer, and James Pustejovsky. 2006. TimeML annotation guidelines, version 1.2.1. http://www.timeml.org/publications/timeMLdocs/annguide_1.2.1.pdf (Date accessed: 2017-01-20).

Roser Saur´i, Lotus Goldberg, Marc Verhagen, and James Pustejovsky. 2009. Annotating Events in English. TimeML Annotation Guidelines. http://www.timeml.org/tempeval2/tempeval2-trial/guidelines/EventGuidelines-050409.pdf (Date accessed: 2017-01-15).

Naushad UzZaman, Hector Llorens, Leon Derczynski, Marc Verhagen, James Allen, and James Pustejovsky. 2013. SemEval-2013 Task 1: TEMPEVAL-3: Evaluating Time Expressions, Events, and Temporal Relations. http://derczynski.com/sheffield/papers/tempeval-3.pdf (Date accessed: 2017-01-15).

Zeno Vendler. 1957. Verbs and times. The philosophical review, pages 143–160.

Marc Verhagen, Robert Gaizauskas, Frank Schilder, Mark Hepple, Jessica Moszkowicz, and James Pustejovsky. 2009. The TempEval challenge: identifying temporal relations in text. Language Resources and Evaluation, 43(2):161–179.

Marc Verhagen, Roser Sauri, Tommaso Caselli, and James Pustejovsky. 2010. SemEval-2010 task 13: TempEval-2. In Proceedings of the 5th international workshop on semantic evaluation, pages 57–62. Association for Computational Linguistics.

Piek Vossen, German Rigau, Luciano Serafini, Pim Stouten, Francis Irving, and Willem Robert Van Hage. 2014. Newsreader: recording history from daily news streams. In Proceedings of the 9th Language Resources and Evaluation Conference (LREC2014), Reykjavik, Iceland, May 26-31.

Patrick Henry Winston. 2011. The Strong Story Hypothesis and the Directed Perception Hypothesis. In Pat Langley, editor, Technical Report FS-11-01, Papers from the AAAI Fall Symposium, pages 345–352, Menlo Park, CA. AAAI Press.

Nianwen Xue and Yuping Zhou. 2010. Applying Syntactic, Semantic and Discourse Constraints in Chinese Temporal Annotation. In Proceedings of the 23rd International Conference on Computational Linguistics, COLING ’10, pages 1363–1372, Stroudsburg, PA, USA. Association for Computational Linguistics.

Yadollah Yaghoobzadeh, Gholamreza Ghassem-Sani, Seyed Abolghasem Mirroshandel, and Mahbaneh Eshaghzadeh. 2012. ISO-TimeML Event Extraction in Persian Text. In COLING, pages 2931–2944.

Annie Zaenen. 2006. Mark-up barking up the wrong tree. Computational Linguistics, 32(4):577–580.

Rolf A Zwaan and Gabriel A Radvansky. 1998. Situation models in language comprehension and memory. Psychological Bulletin, 123(2):162.

Citeringar i Crossref