A Practical Approach for Representing Context and for Performing Word Sense Disambiguation Using Neural Networks

1 September 1991

journal article
Published by MIT Press in Neural Computation

Vol. 3 (3) , 293-309
https://doi.org/10.1162/neco.1991.3.3.293

Abstract

Representing and manipulating context information is one of the hardest problems in natural language processing. This paper proposes a method for representing some context information so that the correct meaning for a word in a sentence can be selected. The approach is primarily based on work by Waltz and Pollack (1985, 1984), who emphasized neutrally plausible systems. By contrast this paper focuses on computationally feasible methods applicable to full-scale natural language processing systems. There are two key elements: a collection of context vectors defined for every word used by a natural language processing system, and a context algorithm that computes a dynamic context vector at any position in a body of text. Once the dynamic context vector has been computed it is easy to choose among competing meanings for a word. This choice of definitions is essentially a neural network computation, and neural network learning algorithms should be able to improve such choices. Although context vectors do not represent all context information, their use should improve those full-scale systems that have avoided context as being too difficult to deal with. Good candidates for full-scale context vector implementations are machine translation systems and Japanese word processors. A main goal of this paper is to encourage such large-scale implementations and tests of context vector approaches. A variety of interesting directions for research in natural language processing and machine learning will be possible once a full set of context vectors has been created. In particular the development of more powerful context algorithms will be an important topic for future research.

Keywords

This publication has 14 references indexed in Scilit:

Illusory conjunctions in the perception of objects
Published by Elsevier ,2004
Perceptron-based learning algorithms
IEEE Transactions on Neural Networks, 1990
The Design and Implementation of Marker-passing Systems
Connection Science, 1989
Types and tokens in visual letter perception.
Journal of Experimental Psychology: Human Perception and Performance, 1989
Semantic interpretation and ambiguity
Artificial Intelligence, 1988
Repetition blindness: Type recognition without token individuation
Cognition, 1987
Massively parallel parsing: A strongly interactive model of natural language interpretation
Cognitive Science, 1985
Making preferences more active
Artificial Intelligence, 1978
Distinctive features, categorical perception, and probability learning: Some applications of a neural model.
Psychological Review, 1977
A preferential, pattern-seeking, Semantics for natural language inference
Artificial Intelligence, 1975