HistoBankVis: Detecting Language Change via Data Visualization

Christin Schätzle
Department of Linguistics, University of Konstanz, Germany

Michael Hund
Department of Computer Science, University of Konstanz, Germany

Frederik L. Dennig
Department of Computer Science, University of Konstanz, Germany

Miriam Butt
Department of Linguistics, University of Konstanz, Germany

Daniel A. Keim
Department of Computer Science, University of Konstanz, Germany

Ladda ner artikel

Ingår i: Proceedings of the NoDaLiDa 2017 Workshop on Processing Historical Language

Linköping Electronic Conference Proceedings 133:7, s. 32-39

NEALT Proceedings Series 32:7, s. 32-39

Visa mer +

Publicerad: 2017-05-10

ISBN: 978-91-7685-503-4

ISSN: 1650-3686 (tryckt), 1650-3740 (online)


We present HistoBankVis, a novel visualization system designed for the interactive analysis of complex, multidimensional data to facilitate historical linguistic work. In this paper, we illustrate the visualization’s efficacy and power by means of a concrete case study investigating the diachronic interaction of word order and subject case in Icelandic.


Inga nyckelord är tillgängliga


R. Harald Baayen. 2008. Analyzing Linguistic Data. A Practical Introduction to Statistics Using R. Cambridge University Press, Cambridge.

Miriam Butt, Tina Bögel, Kristina Kotcheva, Christin Schätzle, Christian Rohrdantz, Dominik Sacha, Nicole Dehe, and Daniel Keim. 2014. V1 in Icelandic: A multifactorical visualization of historical data. In Proceedings of the LREC 2014 Workshop “VisLR: Visualization as added value in the development, use and evaluation of Language Resources”, Reykjavik, Iceland.

David Dowty. 1991. Thematic proto-roles and argument selection. Language, 67(3):547–619.

Irene Franco. 2008. V1, V2 and criterial movement in Icelandic. Studies in Linguistics, 2:141 – 164.

Jane Grimshaw. 1990. Argument Structure. The MIT Press, Cambridge.

Einar Haugen. 1984. Die skandinavischen Sprachen: Eine Einführung in ihre Geschichte. Hamburg: Buske.

Martin Hilpert and Stefan Th. Gries. 2016. Quantitative approaches to diachronic corpus linguistics. In Merja Kytö and Päivi Pahta, editors, The Cambridge Handbook of English Historical Linguistics, pages 36–53. Cambridge University Press, Cambridge.

Thorbjörg Hróarsdóttir. 2000. Word Order Change in Icelandic. From OV to VO. John Benjamins, Amsterdam. Daniel Keim, Gennady Andrienko, Jean-Daniel Fekete, Carsten Görg, Jörn Kohlhammer, and Guy Melançon. 2008. Visual analytics: Definition, process, and challenges. In Information visualization, pages 154–175. Springer.

Paul Kiparsky. 1996. The shift to head-initial VP in Germanic. In H. Thráinsson, J. Peter, and S. Epstein, editors, Comparative Germanic Syntax. Kluwer.

Verena Lyding, Stefania Degaetano-Ortlieb, Ekaterina Lapshinova-Koltunski, Henrik Dittmann, and Christopher Culy. 2012. Visualising linguistic evolution in academic discourse. In Proceedings of the EACL 2012 Joint Workshop of LINGVIS & UNCLH, pages 44–48. Association for Computational Linguistics.

Christopher D. Manning and Hinrich Schütze. 2003. Foundations of Statistical Natural Language Processing. The MIT Press, Cambridge, 6 edition.

Mitchell P. Marcus, Mary Ann Marcinkiewicz, and Beatrice Santorini. 1993. Building a large annotated corpus of English: the Penn Treebank. Computational Linguistics, 19(2):313–330.

Eiríkur Rögnvaldsson, Anton Karl Ingason, and Einar Freyr Sigurðsson. 2011. Coping with variation in the Icelandic Parsed Historical Corpus (IcePaHC). In J.B. Johannessen, editor, Language Variation Infrastructure, volume 3 of Oslo Studies in Language, pages 97–112.

Eiríkur Rögnvaldsson. 1996. Word order variation in the VP in Old Icelandic. Working Papers in Scandinavian Syntax, 58:55–86.

Christian Rohrdantz, Annette Hautli, Thomas Mayer, Miriam Butt, Frans Plank, and Daniel A. Keim. 2011. Towards Tracking Semantic Change by Visual Analytics. In Proceedings of ACL 2011 (Short Papers), pages 305–310.

Christian Rohrdantz, Andreas Niekler, Annette Hautli, Miriam Butt, and Daniel A. Keim. 2012. Lexical Semantics and Distribution of Suffixes - A Visual Analysis. Proceedings of the EACL 2012 Joint Workshop of LINGVIS & UNCLH, pages 7–15, April.

Christian Rohrdantz. 2014. Visual Analytics of Change in Natural Language. Ph.D. thesis, University of Konstanz.

Christin Schätzle and Dominik Sacha. 2016. Visualizing language change: Dative subjects in Icelandic. In Annette Hautli-Janisz and Verena Lyding, editors, Proceedings of the LREC 2016 Workshop “VisLR II: Visualization as Added Value in the Development, Use and Evaluation of Language Resources”, pages 8–15.

Christin Schätzle, Miriam Butt, and Kristina Kotcheva. 2015. The diachrony of dative subjects and the middle in Icelandic: A corpus study. In M. Butt and T. H. King, editors, Proceedings of the LFG15 Conference. CSLI Publications.

Halldór Ármann Sigurðsson. 1990. V1 declaratives and verb raising in icelandic. In Joan Maling and Annie Zaenen, editors, Modern Icelandic Syntax (Syntax and Semantics 24), pages 41–69. Academic Press, San Diego.

Roberto Theron and Laura Fontanillo. 2015. Diachronic-information visualization in historical dictionaries. Information Visualization, 14(2):111–136.

Höskuldur Thráinsson. 1996. Icelandic. In Ekkehard König and Johan van der Auwera, editors, The Germanic Languages, pages 142–189. Routledge, London.

Joel C. Wallenberg, Anton Karl Ingason, Einar Freyr Sigurðsson, and Eiríkur Rögnvaldsson. 2011. Icelandic Parced Historical Corpus (IcePaHC). Version 0.9.

Citeringar i Crossref