Abstract
This paper describes a massively parallel system for word recognition. Based on the connectionist network model adopted from cognitive science and artificial intelligence, the system consists of a large number of simple neuron-like processing units, or nodes, which represent words, phonetic segments, or phonetic features. The computation consists of constant updating of activation levels of all nodes, resulting from the excitatory links and inhibitory links between the nodes. Input to the system consists of frame-by-frame scores of similarity to a set of pre-defined spectral filters, which represents the set of phonetic segments necessary for distinguishing between words in the vocabulary. These similarity scores are combined into phonetic feature indexes for each frame of speech as input to the feature nodes in the network. A linguistic knowledge base is built into the network, allowing both data-driven processing and top-down prediction to cooperate or compete in working toward the correct lexical hypothesis.

This publication has 4 references indexed in Scilit: