Techniques for automatically correcting words in text
- 1 December 1992
- journal article
- Published by Association for Computing Machinery (ACM) in ACM Computing Surveys
- Vol. 24 (4) , 377-439
- https://doi.org/10.1145/146370.146380
Abstract
Research aimed at correcting words in text has focused on three progressively more difficult problems:(1) nonword error detection; (2) isolated-word error correction; and (3) context-dependent work correction. In response to the first problem, efficient pattern-matching and n -gram analysis techniques have been developed for detecting strings that do not appear in a given word list. In response to the second problem, a variety of general and application-specific spelling correction techniques have been developed. Some of them were based on detailed studies of spelling error patterns. In response to the third problem, a few experiments using natural-language-processing tools or statistical-language models have been carried out. This article surveys documented findings on spelling error patterns, provides descriptions of various nonword detection and isolated-word error correction techniques, reviews the state of the art of context-dependent word correction techniques, and discusses research issues related to all three areas of automatic error correction in text.Keywords
This publication has 97 references indexed in Scilit:
- Conventional and associative memory approaches to automatic spelling correctionEngineering Applications of Artificial Intelligence, 1992
- Probability scoring for spelling correctionStatistics and Computing, 1991
- Fault-tolerant database using distributed associative memoriesInformation Sciences, 1991
- Visual text recognition through contextual processingPattern Recognition, 1988
- Contextual word recognition using probabilistic relaxation labelingPattern Recognition, 1988
- Median split treesCommunications of the ACM, 1978
- The use of context in pattern recognitionPattern Recognition, 1978
- Context in word recognitionPattern Recognition, 1976
- Order- n correction for regular languagesCommunications of the ACM, 1974
- String similarity and misspellingsCommunications of the ACM, 1967