Konferensartikel

Services for text simplification and analysis

Johan Falkenjack
Linköping University and RISE SICS East AB, Linköping, Sweden

Evelina Rennes
Linköping University and RISE SICS East AB, Linköping, Sweden

Daniel Fahlborg
Linköping University and RISE SICS East AB, Linköping, Sweden

Vida Johansson
Linköping University and RISE SICS East AB, Linköping, Sweden

Arne Jönsson
Linköping University and RISE SICS East AB, Linköping, Sweden

Ladda ner artikel

Ingår i: Proceedings of the 21st Nordic Conference on Computational Linguistics, NoDaLiDa, 22-24 May 2017, Gothenburg, Sweden

Linköping Electronic Conference Proceedings 131:44, s. 309-313

NEALT Proceedings Series 29:44, s. 309-313

Visa mer +

Publicerad: 2017-05-08

ISBN: 978-91-7685-601-7

ISSN: 1650-3686 (tryckt), 1650-3740 (online)

Abstract

We present a language technology service for web editors’ work on making texts easier to understand, including tools for text complexity analysis, text simplification and text summarization. We also present a text analysis service focusing on measures of text complexity.

Nyckelord

Inga nyckelord är tillgängliga

Referenser

Sarah Albertsson, Evelina Rennes, and Arne J¨onsson. 2016. Similarity-based alignment of monolingual corpora for text simplification. In Coling 2016 Workshop on Computational Linguistics for Linguistic Complexity (CL4LC), Osaka, Japan.

Sandra Alusio, Lucia Specia, Caroline Gasperin, and Carolina Scarton. 2010. Readability assessment for text simplification. In Proceedings of the NAACL HLT 2010 Fifth Workshop on Innovative Use of NLP for Building Educational Applications, pages 1–9.

Nilhadri Chatterjee and Shiwali Mohan. 2007. Extraction-Based Single-Document Summarization Using Random Indexing. In Proceedings of the 19th IEEE international Conference on Tools with Artificial intelligence – (ICTAI 2007), pages 448–455.

Edgar Dale and Jeanne S. Chall. 1949. The concept of readability. Elementary English, 26(23).

Anna Decker. 2003. Towards automatic grammatical simplification of swedish text. Master’s thesis, Stockholm University.

Felice Dell’Orletta, Simonetta Montemagni, and Giulia Venturi. 2011. READ-IT: Assessing Readability of Italian Texts with a View to Text Simplification. In Proceedings of the 2nd Workshop on Speech and Language Processing for Assistive Technologies, pages 73–83, July.

Eva Ejerhed, Gunnel Källgren, and Benny Brodda. 2006. Stockholm Umeå Corpus version 2.0.

Daniel Fahlborg and Evelina Rennes. 2016. Introducing SAPIS - an API service for text analysis and simplification. In The second national Swe-Clarin workshop: Research collaborations for the digital age, Umeå, Sweden.

Johan Falkenjack and Arne Jönsson. 2014. Classifying easy-to-read texts without parsing. In The 3rd Workshop on Predicting and Improving Text Readability for Target Reader Populations (PITR 2014), Göteborg, Sweden.

Johan Falkenjack, Katarina HeimannM¨uhlenbock, and Arne Jönsson. 2013. Features indicating readability in Swedish text. In Proceedings of the 19th Nordic Conference of Computational Linguistics (NoDaLiDa-2013), Oslo, Norway, NEALT Proceedings Series 16.

Martin Hassel. 2007. Resource Lean and Portable Automatic Text Summarization. Ph.D. thesis, ISRNKTH/CSC/A–07/09-SE, KTH, Sweden.

Martin Hassel. 2011. Java Random Indexing toolkit, January 2011. http://www.csc.kth.se/~xmartin/java/.

Michael J. Heilman, Kevyn Collins-Thompson, Jamie Callan, and Maxine Eskenazi. 2007. Combining Lexical and Grammatical Features to Improve Readability Measures for First and Second Language Texts. In Proceedings of NAACL HLT 2007, pages 460–467.

Katarina Heimann M¨uhlenbock. 2013. I see what you mean. Assessing readability for specific target groups. Dissertation, Språkbanken, Dept of Swedish, University of Gothenburg. Derrick Higgins and Jill Burstein. 2007. Sentence similarity measures for essay coherence. In Proceedings of the 7th International Workshop on Computational Semantics (IWCS), Tilburg, The Netherlands.

Vida Johansson and Evelina Rennes. 2016. Automatic extraction of synonyms from an easy-to-read corpus. In Proceedings of the Sixth Swedish Language Technology Conference (SLTC-16), Umeå, Sweden.

Robin Keskisärkkä and Arne J¨onsson. 2013. Investigations of Synonym Replacement for Swedish. Northern European Journal of Language Technology, 3(3):41–59.

John Lee, Wenlong Zhao, and Wenxiu Xie. 2016. A customizable editor for text simplification. In Proceedings of COLING, Osaka, Japan.

Haitao Liu. 2008. Dependency distance as a metric of language comprehension difficulty. Journal of Cognitive Science, 9(2):169–191.

Rada Mihalcea. 2004. Graph-based ranking algorithms for sentence extraction, applied to text summarization. In Proceedings of the ACL 2004 on Interactive poster and demonstration sessions, ACLdemo ’04, Morristown, NJ, USA. Association for Computational Linguistics.

Thomas Morton, Joern Kottmann, Jason Baldridge, and Gann Bierner. 2005. Opennlp: A java-based nlp toolkit.

Ani Nenkova, Jieun Chae, Annie Louis, and Emily Pitler. 2010. Structural Features for Predicting the Linguistic Quality of Text Applications to Machine Translation, Automatic Summarization and Human–Authored Text. In E. Krahmer and M. Theune, editors, Empirical Methods in NLG, pages 222–241. Springer-Verlag.

Joakim Nivre, Johan Hall, Jens Nilsson, Atanas Chanev, G¨uls¸en Eryigit, Sandra K¨ubler, Svetoslav Marinov, and Erwin Marsi. 2007. MaltParser: A language-independent system for data-driven dependency parsing. Natural Language Engineering, 13(2):95–135.

Robert O¨ stling. 2013. Stagger: an open-source part of speech tagger for swedish. Northen European Journal of Language Technology, 3.

Sarah Petersen and Mari Ostendorf. 2009. A machine learning approach toreading level assessment. Computer Speech and Language, 23:89–106.

Sarah Petersen. 2007. Natural language processing tools for reading level assessment and text simplification for bilingual education. Ph.D. thesis, University of Washington, Seattle, WA.

Evelina Rennes and Arne Jönsson. 2015. A tool for automatic simplification of swedish texts,. In Proceedings of the 20th Nordic Conference of Computational Linguistics (NoDaLiDa-2015), Vilnius, Lithuania.

Evelina Rennes and Arne Jönsson. 2016. Towards a corpus of easy to read authority web texts. In Proceedings of the Sixth Swedish Language Technology Conference (SLTC-16), Umeå, Sweden.

Jonas Rybing, Christian Smith, and Annika Silvervarg. 2010. Towards a Rule Based System for Automatic Simplification of Texts. In Swedish Language Technology Conference, SLTC, Linköping, Sweden.

Horacio Saggion, Sanja Stajner, Stefan Bott, Simon Mille, Luz Rello, and Biljana Drndarevic. 2015. Making it simplext: Implementation and evaluation of a text simplification system for spanish. ACM Transactions on Accessible Computing, 6(4).

Carolina Scarton, Matheus de Oliveira, Arnaldo Candido, Jr., Caroline Gasperin, and Sandra Maria Alu´isio. 2010. Simplifica: A tool for authoring simplified texts in brazilian portuguese guided by readability assessments. In Proceedings of the NAACL HLT 2010 Demonstration Session, HLT-DEMO ’10, pages 41–44, Stroudsburg, PA, USA. Association for Computational Linguistics.

Christian Smith and Arne Jönsson. 2011a. Automatic Summarization As Means Of Simplifying Texts, An Evaluation For Swedish. In Proceedings of the 18th Nordic Conference of Computational Linguistics (NoDaLiDa-2010), Riga, Latvia.

Christian Smith and Arne Jönsson. 2011b. Enhancing extraction based summarization with outside word space. In Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand.

Citeringar i Crossref