Publicerad: 2001-01-25
ISBN:
ISSN: 1650-3686 (tryckt), 1650-3740 (online)
Noun phrase internal agreement errors and erroneously split compounds are two error types in Swedish texts that have become increasingly more frequent during the last decade. In this paper we present empirical results from studying occurrences of such errors in running text. The results show interesting frequency patterns in the realization of errors. For example; occurrences of agreement errors generally seem to deviate from a possible correction in only one morphological feature. Thus; we find error examples such as ett bil (the article ett deviates in gender from bil; Eng. car); but no examples such as ett bilarna (the article ett deviates in gender; number and species from the noun bilarna; Eng. the cars). Possible explanations for the rise of such error patterns are discussed.
We also show how agreement errors and split compound errors can be detected and corrected by Granska; a Swedish grammar checker being developed at the Royal Institute of Technology in Stockholm. From a Swedish perspective; detection of these errors is an important task since they are typically not handled in their morphological richness by grammar checkers originally made for English. However; the task is far from trivial; so problems in detecting errors and possible solutions are discussed.
Arppe; A. 2000: Developing a grammar checker for Swedish. I: T. Nordgård; ed. Nodalida’ 99; Trondheim; december 1999.
Birn; J. 2000: Detecting Grammar Errors with Lingsoft’s Swedish Grammar Checker. I: T. Nordgård; ed. Nodalida’99; Trondheim; december 1999.
Carlberger; J. & Kann; V. 1999: Implementing an efficient part-of-speech tagger. I: Software - Practice and Experience; 29 (9); pp. 815-832.
Domeij; R. 1997: Datorn och språkriktigheten. I: O. Josephson; ed.; Svenskan och IT-samhället. Hallgren & Fallgren; 1997.
Domeij; R. 1998: Detecting; presenting and correcting low-level problems when editing with and without computer aids. I: TEXT Technology; vol 8; no. 1. Wright State University; Celina; USA.
Domeij; R.; Eklundh; K.; Knutsson; O.; Larsson; S. & Rex; Å. 1998: Granskaprojektet 1996- 1997. Technical Report NADA; KTH.
Domeij; R.; Knutsson; O.; Carlberger; J. & Kann; V. 2000: Granska - ett effektivt hybridsystem för kontroll av svensk grammatik. I: T. Nordgård; ed. Nodalida’99; Trondheim; december 1999.
Kann; V.; Domeij; R.; Hollman; J.; & Tillenius; M. 1999: Implementaion aspects and applications of a spelling correction algorithm. In: R. Koehler; L. Uhlirova; G. Wimmer; eds; Text as a Linguistic Paradigm: Levels; Constituents; Constructs. Festschrift in honour of Ludek Hrebicek.
Knutsson O. 1996: Agreement Error Detection. Master’s Thesis; Department of Linguistics; Stockholm University.
Kohut & Gorman 1995: The effectiveness of leading grammar/style software in analysing business students’ writing. I: Journal of Technical Writing and Communication pp. 341- 361. July 1995.
Kukich; K. 1992: Techniques for automatically correcting words in text. ACM Computing Surveys; Vol. 24; No. 4; pp. 377-439.
Larsson; S. 1998: Interaktivitet och användbarhet vid datorstödd språkgranskning och redigering i en integrerad skrivmiljö. Exjobbsrapport TRITA-NA-E9833 (IPLab-150); Nada.
Severinson Eklundh; K. 1995: Skrivmönster med ordbehandlare. I: Språkvård; nr 4; sid 11– 18.
Sågvall Hein; A. 1998: A Chart-Based Framework for Grammar Checking. Initial studies. Nodalida’98; Copenhagen; january 1998.
Öhrman; L. 1998: Felaktigt särskrivna sammansättningar. C-uppsats i datorlingvistik; Institutionen för lingvistik; Stockholms Universitet.
Öhrman; L. 2000: Datorstödd språkgranskning och andraspråksinlärare. D-uppsats i datorlingvistik; Institutionen för lingvistik; Stockholms Universitet.