Leveraging known Semantics for Spelling Correction

Levi King
Indiana University, Bloomington, IN USA

Markus Dickinson
Indiana University, Bloomington, IN USA

Ladda ner artikel

Ingår i: Proceedings of the third workshop on NLP for computer-assisted language learning at SLTC 2014, Uppsala University

Linköping Electronic Conference Proceedings 107:4, s. 43–58

NEALT Proceedings Series 22:4, s. 43–58

Visa mer +

Publicerad: 2014-11-11

ISBN: 978-91-7519-175-1

ISSN: 1650-3686 (tryckt), 1650-3740 (online)


Focusing on applications for analyzing learner language which evaluate semantic appropriateness and accuracy, we build from previous work which modeled some aspects of interaction, namely a picture description task (PDT), with the goal of integrating a spelling correction component in this context. After parsing a sentence and extracting semantic relations, a surprising number of analysis failures stem from misspellings, deviating from expected input in ways that can be modeled when the content of the interaction is known. We thus explore the use of spelling correction tools and language modeling to correct misspellings that often lead to errors in obtaining semantic forms, and we show that such tools can significantly reduce the number of unanalyzable cases. The work is useful for any context where image descriptions or some expected content is available, but not necessarily expected linguistic forms.


Picture description task; semantic analysis; spelling correction; language modeling


Atkinson, K. (1998). GNU Aspell. http://aspell.net.

Clarkson, P. and Rosenfeld, R. (1997). Statistical language modeling using the CMU-Cambridge Toolkit. In Eurospeech, volume 97, pages 2707–2710.

Dale, R., Anisimoff, I., and Narroway, G. (2012). HOO 2012: A report on the preposition and determiner error correction shared task. In Proceedings of the Seventh Workshop on Building Educational Applications Using NLP, pages 54–62, Montréal. de Marneffe, M.-C., MacCartney, B., and Manning, C. D. (2006). Generating typed dependency parses from phrase structure parses. In Proceedings of LREC 2006, Genoa, Italy.

DeSmedt, W. (1995). Herr Kommissar: An ICALL conversation simulator for intermediate German. In Holland, V. M., Kaplan, J., and Sams, M., editors, Intelligent Language Tutors: Theory Shaping Technology, pages 153–174. Lawrence Erlbaum, Mahwah, NJ.

Ellis, R. (2000). Task-based research and language pedagogy. Language Teaching Research, 4(3):193–220.

Fellbaum, C., editor (1998). WordNet: An Electronic Lexical Database. The MIT Press, Cambridge, MA.

Flor, M. (2012). Four types of context for automatic spelling correction. TAL, 53(2):61–99.

Flor, M. and Futagi, Y. (2012). On using context for automatic correction of non-word misspellings in student essays. In Proceedings of the Seventh Workshop on Building Educational Applications Using NLP, pages 105–115. Association for Computational Linguistics.

Flor, M., Futagi, Y., Lopez, M., and Mulholland, M. (2013). Patterns of misspellings in L2 and L1 English: A view from the ETS Spelling Corpus. In Proceedings of the Second Learner Corpus Research Conference (LCR 2013).

Forbes-McKay, K. and Venneri, A. (2005). Detecting subtle spontaneous language decline in early Alzheimer’s disease with a picture description task. Neurological Sciences, 26(4):243–254.

Graff, D., Kong, J., Chen, K., and Maeda, K. (2007). English Gigaword, Third Edition. Hahn, M. and Meurers, D. (2012). Evaluating the meaning of answers to reading comprehension questions: A semantics-based approach. In Proceedings of the 7th Workshop on Innovative Use of NLP for Building Educational Applications (BEA7), pages 326–336, Montreal, Canada. Association for Computational Linguistics.

Heift, T. and Schulze, M. (2007). Errors and Intelligence in Computer-Assisted Language Learning: Parsers and Pedagogues. Routledge.

Hovermale, D. (2008). SCALE: Spelling Correction Adapted for Learners of English. Pre-CALICO Workshop on “Automatic Analysis of Learner Language: Bridging Foreign Language Teaching Needs and NLP Possibilities”. March 18-19, 2008. San Francisco, CA.

Hovermale, D. (2010). An analysis of the spelling errors of L2 English learners. In CALICO 2010 Conference, Amherst, MA, USA.

King, L. and Dickinson, M. (2013). Shallow semantic analysis of interactive learner sentences. In Proceedings of the Eighth Workshop on Innovative Use of NLP for Building Educational Applications, pages 11–21, Atlanta, Georgia.

Klein, D. and Manning, C. D. (2003). Accurate unlexicalized parsing. In Proceedings of ACL-03, Sapporo, Japan.

Lachowicz, D. (2003). Enchant. http://abisource.com/projects/enchant.

Leacock, C., Chodorow, M., Gamon, M., and Tetreault, J. (2010). Automated Grammatical Error Detection for Language Learners. Synthesis Lectures on Human Language Technologies. Morgan Claypool.

Manning, C. D., Surdeanu, M., Bauer, J., Finkel, J., Bethard, S. J., and McClosky, D. (2014). The Stanford CoreNLP natural language processing toolkit. In Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pages 55–60, Baltimore, Maryland. Association for Computational Linguistics.

Meurers, D. (2012). Natural language processing and language learning. In Chapelle, C. A., editor, Encyclopedia of Applied Linguistics. Blackwell.

Meurers, D., Ziai, R., Ott, N., and Bailey, S. (2011). Integrating parallel analysis modules to evaluate the meaning of answers to reading comprehension questions. Special Issue on Free-text Automatic Evaluation. International Journal of Continuing Engineering Education and Life-Long Learning (IJCEELL), 21(4):355–369.

Petersen, K. A. (2010). Implicit Corrective Feedback in Computer-Guided Interaction: Does Mode Matter? PhD thesis, Georgetown University, Washington, DC.

Somasundaran, S. and Chodorow, M. (2014). Automated measures of specific vocabulary knowledge from constructed responses (‘Use these words to write a sentence based on this picture’). In Proceedings of the Ninth Workshop on Innovative Use of NLP for Building Educational Applications, pages 1–11, Baltimore, Maryland.

Sproat, R., Black, A. W., Chen, S., Kumar, S., Ostendorf, M., and Richards, C. (2001). Normalization of non-standard words. Computer Speech & Language, 15(3):287–333.

Citeringar i Crossref