Some experiments with large-vocabulary isolated-word sentence recognition

Abstract
This paper deals with two experiments with a large vocabulary isolated word recognizer. The first compares word error rates for 1) meaningful sentences belonging to actual documents and 2) random word lists from the same vocabulary. The error rate is considerably lower for random word lists. The second experiment investigates the performance of the recognition system on sentences containing words outside the vocabulary of the recognizer. Sentences from a 5000 word vocabulary task are recognized with a recognizer limited to a 2000 word subvocabulary. The error rate is only slightly higher than it would be if recognition of the full 5000 word vocabulary was allowed. Author(s) Bahl, L. IBM Thomas J. Watson Research Center, Yorktown Heights, NY, USA Das, S. ; de Souza, P.V. ; Jelinek, F. ; Katz, S. ; Mercer, R.L. ; Picheny, M.

This publication has 7 references indexed in Scilit: