Conference article

Self Organizing Maps for the Visual Analysis of Pitch Contours

Dominik Sacha
Data Analysis and Visualization Group & Department of Linguistics, University of Konstanz, Germany

Yuki Asano
Data Analysis and Visualization Group & Department of Linguistics, University of Konstanz, Germany

Christian Rohrdantz
Data Analysis and Visualization Group & Department of Linguistics, University of Konstanz, Germany

Felix Hamborg
Data Analysis and Visualization Group & Department of Linguistics, University of Konstanz, Germany

Daniel Keim
Data Analysis and Visualization Group & Department of Linguistics, University of Konstanz, Germany

Bettina Brau
Data Analysis and Visualization Group & Department of Linguistics, University of Konstanz, Germany

Miriam Butt
Data Analysis and Visualization Group & Department of Linguistics, University of Konstanz, Germany

Download article

Published in: Proceedings of the 20th Nordic Conference of Computational Linguistics, NODALIDA 2015, May 11-13, 2015, Vilnius, Lithuania

Linköping Electronic Conference Proceedings 109:23, s. 181-189

NEALT Proceedings Series 23:23, s. 181-189

Show more +

Published: 2015-05-06

ISBN: 978-91-7519-098-3

ISSN: 1650-3686 (print), 1650-3740 (online)

Abstract

We present a novel interactive approach for the visual analysis of intonation contours. Audio data are processed algorithmically and presented to researchers through interactive visualizations. To this end, we automatically analyze the data using machine learning in order to find groups or patterns. These results are visualized with respect to meta-data. We present a flexible, interactive system for the analysis of prosodic data. Using realworld application examples, one containing preprocessed, the other raw data, we demonstrate that our system enables researchers to interact dynamically with the data at several levels and by means of different types of visualizations, thus arriving at a better understanding of the data via a cycle of hypothesis generation and testing that takes full advantage of our visual pro- cessing abilities.

Keywords

No keywords available

References

Richard Arias-Hernandez, Linda T Kaastra, Tera Marie Green, and Brian Fisher. 2011. Pair analytics: Capturing reasoning processes in collaborative visual analytics. In System Sciences (HICSS), 2011 44th Hawaii International Conference on, pages 1–10. IEEE.

Mary E. Beckman, Julia Hirschberg, and Stefanie Shattuck-Hufnagel. 2005. The original ToBI sys tem and the evolution of the ToBI framework. In S.-A. Jun, editor, Prosodic Typology – The Phonology of Intonation and Phrasing. Oxford University Press.

Paul Boersma and DavidWeenink. 2011. Praat: Doing phonetics by computer [computer program] version 5.2.20. Carl de Boor. 2001. A Practical Guide to Splines.
Springer, New York.

Michele Gubian, Yuki Asano, Salomi Asaridou, and Francesco Cangemi. 2013. Rapid and smooth pitch contour manipulation. In Proceedings of the 14th Annual Conference of the International Speech Communication Association, Lyon, France, pages 31–35.

Carlos Gussenhoven. 2004. The Phonology of Tone and Intonation. Research surveys in linguistics. Cambridge University Press, Cambridge. 2003065202 Carlos Gussenhoven. ill. ; 24 cm. Includes bibliographical references (p. 321-344) and index.

Daniel Keim, Gennady Andrienko, Jean-Daniel Fekete, Carsten G¨org, J¨orn Kohlhammer, and Guy Melanc¸on. 2008. Visual analytics: Definition, process, and challenges. Springer.

Daniel A. Keim, J¨orn Kohlhammer, Geoffrey P. Ellis, and Florian Mansmann. 2010. Mastering the Information Age - Solving Problems with Visual Analytics. Eurographics Association. Teuvo Kohonen. 2001. Self-organizing Maps, volume 30. Springer.

Rudolf Mayer, Jakob Frank, and Andreas Rauber. 2009. Analytic comparison of audio feature sets using self-organising maps. In Proceedings of the Workshop on Exploring Musical Information Spaces, in Conjunction with ECDL, pages 62–67.

Julia Moehrmann, Andre Burkovski, Evgeny Baranovskiy, Geoffrey-Alexeij Heinze, Andrej Rapoport, and Gunther Heidemann. 2011. A discussion on visual interactive data exploration using self-organizing maps. In Advances in Self-Organizing Maps - 8th International Workshop, WSOM 2011, Espoo, Finland, June 13-15, 2011. Proceedings, pages 178–187.

James Ramsay and Bernard. W. Silverman. 2009. Functional Data Analysis. Springer.

James Ramsay, Giles Hookers, and Spencer Graves. 2009. Functional Data Analysis with R and MAT-LAB. Springer.

Dominik Sacha, Andreas Stoffel, Florian Stoffel, Bum Chul Kwon, Geoffrey P. Ellis, and Daniel A. Keim. 2014. Knowledge generation model for visual analytics. IEEE Transactions on Visualization and Computer Graphics, 20(12):1604–1613.

Tobias Schreck, J¨urgen Bernard, Tatiana Tekuov, and J¨orn Kohlhammer. 2009. Visual cluster analysis of trajectory data with interactive Kohonen maps. Palgrave Macmillan Information Visualization, 8:14–29.

Tobias Schreck, 2010. Visual-Interactive AnalysisWith Self- Organizing Maps — Advances and Research Challenges, pages 83–96. Intech.

Larry Selinker. 1972. Interlanguage. IRAL-International Review of Applied Linguistics in Language Teaching, 10(1-4):209–232.

Ana Cristina C Silva, Ana Cristina P Macedo, and Guilherme A Barreto. 2011. A SOM-based analysis of early prosodic acquisition of English by Brazilian learners: preliminary results. In Advances in Self-Organizing Maps, pages 267–276. Springer.

Ryszard Tadeusiewicz, Wieslaw Wszolek, Antoni Izworski, and Tadeusz Wszolek. 1999. The methods of pathological speech visualization [using Kohonen neural networks]. In Engineering in Medicine and Biology, 1999. 21st Annual Conference and the 1999 Annual Fall Meetring of the Biomedical Engineering Society, BMES/EMBS Conference, 1999. Proceedings of the First Joint, volume 2, pages 980 vol.2–, Oct.

James J. Thomas and Kristin A. Cook. 2006. A visual analytics agenda. IEEE Computer Graphics and Applications, 26(1):10–13.

Juha Vesanto. 1999. SOM-based data visualization methods. Intelligent Data Analysis, 3(2):111–126.

Nigel G Ward and Joshua L Mccartney. 2010. Visualization to support the discovery of prosodic contours related to turn-taking.

Nigel G Ward. 2014. Automatic discovery of simply-composable prosodic elements. In Speech Prosody, volume 2014, pages 915–919.

Citations in Crossref