Conference article

Conversion and Annotation Web Services for Spoken Language Data in CLARIN

Thomas Schmidt
Institute for the German Language (IDS), Mannheim, Germany

Hanna Hedeland
Hamburg Centre for Language Corpora (HZSK), University of Hamburg, Germany

Daniel Jettka
Hamburg Centre for Language Corpora (HZSK), University of Hamburg, Germany

Download article

Published in: Selected papers from the CLARIN Annual Conference 2016, Aix-en-Provence, 26–28 October 2016, CLARIN Common Language Resources and Technology Infrastructure

Linköping Electronic Conference Proceedings 136:9, p. 113-130

Show more +

Published: 2017-05-23

ISBN: 978-91-7685-499-0

ISSN: 1650-3686 (print), 1650-3740 (online)

Abstract

We present an approach to making existing CLARIN web services usable for spoken language transcriptions. Our approach is based on a new TEI-based ISO standard for such transcriptions. We show how existing tool formats can be transformed to this standard, how an encoder/decoder pair for the TCF format enables users to feed this type of data through a WebLicht tool chain, and why and how web services operating directly on the standard format would be useful.

Keywords

Standardization, spoken language, web service, toolchain

References

[Banski et al. 2016] Banski, Piotr, Bertrand Gaiffe, Patrice Lopez, Simon Meoni, Laurent Romary, Thomas Schmidt, Peter Stadler, and Andreas Witt. 2016. In Claudia Resch, Vanessa Hannesschläger, and Tanja Wissik, editors, TEI Conference and Members’ Meeting 2016 – Book of Abstracts, Vienna, pages 35-37.http://tei2016.acdh.oeaw.ac.at/sites/default/files/TEIconf2016_BookOfAbstracts.pdf

[Barras et al. 2000] Barras, Claude, Edouard Geoffrois, Zhibiao Wu, and Mark Liberman. 2001. Transcriber: development and use of a tool for assisting speech corpora production. Speech Communicatin – Special issue on Speech Annotation and Corpus Tools, 33(1-2):5–22. http://languagelog.ldc.upenn.edu/myl/Barras2001.pdf. DOI: 10.1016/S0167-6393(00)00067-4.

[Bird and Liberman 2001] Bird, Steven and Mark Liberman. 2001. A formal framework for linguistic annotation. Speech Communicatin – Special issue on Speech Annotation and Corpus Tools, 33(1-2):23–60. http://languagelog.ldc.upenn.edu/myl/BirdLiberman2001.pdf. DOI: 10.1016/S0167-6393(00)00068-6.

[Boersma 2014] Boersma, Paul. 2014. The Use of Praat in Corpus Research. In Jacques Durand, Ulrike Gut, and Gjert Kristoffersen, editors, The Oxford Handbook of Corpus Phonology. Oxford University Press, Oxford, UK. DOI: 10.1093/oxfordhb/9780199571932.013.016.

[Brouwer and Kemps-Snijders 2016] Brouwer, Maththijs and Kemps-Snijders, Marc. 2016. A SOLR/Lucene based Multi Tier Annotation Search solution. Proceedings of the CLARIN Annual Conference (CAC), 2016, Aix, France. https://www.clarin.eu/sites/default/files/brouwer-kempssnijdersCLARIN2016_paper_21.pdf

[Eckart de Castilho et. al 2014] Eckart de Castilho, Richard, Biemann, Chris, Gurevych, Irina and Yimam, S.M. 2014. WebAnno: a flexible, web-based annotation tool for CLARIN. In Proceedings of the CLARIN Annual Conference (CAC) 2014, Soesterberg, Netherlands.

[Eshkol-Taravella et. al 2012] Eshkol-Taravella, Iris, Oliver Baude, Denis Maurel, Linda Hriba, Céline Dugua, and Isabelle Tellier. 2012. Un grand corpus oral «disponible»: le corpus d’Orléans 1968-2012. Ressources linguistiques libres, 52(3/2011):17–46. https://www.atala.org/IMG/pdf/Eshkol-TAL52-3.pdf.

[Groupe ICOR 2017] GROUPE ICOR (Heike Baldauf-Quilliatre, Isabel Colón de Carvajal, Carole Etienne, Emilie Jouin-Chardon, Sandra Teston-Bonnard, and Véronique Traverso) (in press). CLAPI, une base de données multimodale pour la parole en interaction : apports et dilemmes. In Mathieu Avanzi, Marie-José Béguelin, and Federica Diémoz, editors, Corpus de français parlés et français parlés des corpus, Cahiers Corpus, Université de Neuchâtel.

[Hinrichs and Vogel 2010] Hinrichs, Erhard and Iris Vogel. 2010. CLARIN - Interoperability and Standards. In CLARIN devliverable D5.C-3. http://www-sk.let.uu.nl/u/D5C-3.pdf.

[Hinrichs et al. 2010] Hinrichs, Marie, Thomas Zastrow, and Erhard Hinrichs. 2010. WebLicht: Web-based LRT Services in a Distributed eScience Infrastructure. In Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC’10), Valetta, Malta. http://www.lrec-conf.org/proceedings/lrec2010/pdf/270_Paper.pdf.

[IETF] Internet Engineering Task Force. 2013. Media Type Specifications and Registration Procedures – Best Practices. https://tools.ietf.org/html/rfc6838#section-3.4.

[ISO 2016] ISO 2462:2016. Language resource management – Transcription of spoken language. http://www.iso.org/iso/catalogue_detail.htm?csnumber=37338.

[Jettka & Stein 2014] Jettka, Daniel, Daniel Stein. 2014. The HZSK Repository: Implementation, Features, and Use Cases of a Repository for Spoken Language Corpora. D-Lib Magazine, 20(9/10). http://www.dlib.org/dlib/september14/jettka/09jettka.html. DOI: 10.1045/september2014-jettka.

[Kipp 2014] Kipp, Michael. 2014. ANVIL: A Universal Video Research Tool. In Jacques Durand, Ulrike Gut, and Gjert Kristoffersen, editors, The Oxford Handbook of Corpus Phonology. Oxford University Press, Oxford, UK. DOI: 10.1093/oxfordhb/9780199571932.013.024.

[Kisler et al. 2010] Kisler, Thomas, Florian Schiel, and Han Sloetjes. 2012. Signal processing via web services: the use case WebMAUS. In Proceedings of Digital Humanities 2012, Hamburg, pages 30–34. http://clarind.de/images/workshops/proceedingssoasforthehumanities.pdf.

[Klein and Schütte 2001] Klein, Wolfgang, Wilfried Schütte. 2001. Transkriptionsrichtlinien für die Eingabe in DIDA. Institut für Deutsche Sprache, Mannheim, Germany. http://agd.ids-mannheim.de/download/dida-trl.pdf.

[Krause and Zeldes 2016] Krause, Thomas and Zeldes, Amir. 2016. ANNIS3: A new architecture for generic corpus query and visualization. In Digital Scholarship in the Humanities 2016 (31). http://dsh.oxfordjournals.org/content/31/1/118.

[MacWhinney 2000] MacWhinney, Brian. 2000. The CHILDES Project: Tools for Analyzing Talk. Psychology Press, New York, USA.

[Menke et al. 2015] Menke, Peter, Farina Freigang, Thomas Kronenberg, Sören Klett, and Kirsten Bergmann. 2015. First Steps towards a Tool Chain for Automatic Processing of Multimodal Corpora. Journal of Multimodal Communication Studies, 2:30–43. http://jmcs.home.amu.edu.pl/wp-content/uploads/2015/09/Menke_et_al_2014_JMCS.pdf.

[Parisse and Morgenstern 2010] Parisse, Christophe and Aliyah Morgenstern. 2010. A multi-software integration platform and support for multimedia transcripts of language. In Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC’10) [Workshop on Multimodal Corpora: Advances in Capturing, Coding and Analyzing Multimodality], Valetta, Malta. http://www.lrec-conf.org/proceedings/lrec2010/.

[Rehbein et al. 2004] Rehbein, Jochen, Thomas Schmidt, Bernd Meyer, Franziska Watzke, and Annette Herkenrath. 2004. Handbuch für das computergestützte Transkribieren nach HIAT. In Arbeiten zur Mehrsprachigkeit, volume 56. http://www.exmaralda.org/files/azm_56.pdf.

[Schmidt 2005] Schmidt, Thomas. 2005. Time-based data models and the Text Encoding Initiative’s guidelines
for transcription of speech. In Arbeiten zur Mehrsprachigkeit (Folge B), volume 62. http://www.exmaralda.org/files/SFB_AzM62.pdf.

[Schmidt et al. 2009] Schmidt, Thomas, Susan Duncan, Oliver Ehmer, Jeffrey Hoyt, Michael Kipp, Dan Loehr, Magnus Magnusson, Travis Rose, and Han Sloetjes. 2009. An Exchange Format for Multimodal Annotations. In Michael Kipp, Jean-Claude Martin, Patrizia Paggio, Dirk Heylen, editors, Multimodal Corpora. Springer, Berlin, Germany. http://link.springer.com/chapter/10.1007/978-3-642-04793-0_13. DOI: 10.1007/978-3-642-04793-0_13.

[Schmidt 2011] Schmidt, Thomas. 2011. A TEI-based Approach to Standardising Spoken Language Transcription. Journal of the Text Encoding Initiative, 1:1–22. http://jtei.revues.org/142. DOI: 10.4000/jtei.142.

[Schmidt 2012] Schmidt, Thomas. 2012. EXMARaLDA and the FOLK tools. In Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC’12), Istanbul, Turkey. http://www.lrecconf.org/proceedings/lrec2012/pdf/529_Paper.pdf.

[Schmidt 2014] Schmidt, Thomas. 2014. The Database for Spoken German – DGD2. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC’14), Reykjavik, Iceland. http://www.lrec-conf.org/proceedings/lrec2014/pdf/171_Paper.pdf.

[Schmidt and Wörner 2014] Schmidt, Thomas and Kai Wörner. 2014. EXMARaLDA. In Jacques Durand, Ulrike Gut, and Gjert Kristoffersen, editors, The Oxford Handbook of Corpus Phonology. Oxford University Press, Oxford, UK. DOI: http://dx.doi.org/10.1093/oxfordhb/9780199571932.013.030.

[Schmidt et al. 2015] Schmidt, Thomas, Wilfried Schütte, and Jenny Winterscheid. 2015. cGAT. Konventionen für das computergestützte Transkribieren in Anlehnung an das Gesprächsanalytische Transkriptionssystem 2 (GAT2). https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/4616.

[Selting et al. 2009] Selting, Margret, Peter Auer, Dagmar Barth-Weingarten, Jörg Bergmann, Pia Bergmann, Karin Birkner, Elizabeth Couper-Kuhlen, Arnulf Deppermann, Peter Gilles, Susanne Günthner, Martin Hartung, Friederike Kern, Christine Mertzlufft, Christian Meyer, Miriam Morek, Frank Oberzaucher, Jörg Peters, Uta Quasthoff, Wilfried Schütte, Anja Stukenbrock, and Susanne Uhmann. 2009. Gesprächsanalytisches Transkriptionssystem 2 (GAT 2). Gesprächsforschung - Online-Zeitschrift zur verbalen Interaktion, 10:353–402. http://www.gespraechsforschung-ozs.de/heft2009/px-gat2.pdf.

[Sloetjes 2014] Sloetjes, Han. 2014. ELAN: Multimedia Annotation Application. In Jacques Durand, Ulrike Gut, and Gjert Kristoffersen, editors, The Oxford Handbook of Corpus Phonology. Oxford University Press, Oxford, UK. DOI: 10.1093/oxfordhb/9780199571932.013.019.

[TEI] Text Encoding Initiative. 2015. Guidelines. http://www.tei-c.org/Guidelines/.

[Westpfahl and Schmidt 2016] Westpfahl, Swantje and Thomas Schmidt. 2016. FOLK-Gold – A GOLD standard for Part-of-Speech-Tagging of Spoken German. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), Portorož, Slovenia. http://www.lrec-conf.org/proceedings/lrec2016/pdf/397_Paper.pdf.

[Zipser and Romary 2010] Zipser, Florian and Romary, Laurent. 2010. A model oriented approach to the mapping of annotation formats using standards. In: Proceedings of the Workshop on Language Resource and Language Technology Standards, LREC 2010. Malta. http://hal.archives-ouvertes.fr/inria-00527799/en/.‘

Citations in Crossref