A Practical Approach for Representing Context and for Performing Word Sense Disambiguation Using Neural Networks
- 1 September 1991
- journal article
- Published by MIT Press in Neural Computation
- Vol. 3 (3) , 293-309
- https://doi.org/10.1162/neco.1991.3.3.293
Abstract
Representing and manipulating context information is one of the hardest problems in natural language processing. This paper proposes a method for representing some context information so that the correct meaning for a word in a sentence can be selected. The approach is primarily based on work by Waltz and Pollack (1985, 1984), who emphasized neutrally plausible systems. By contrast this paper focuses on computationally feasible methods applicable to full-scale natural language processing systems. There are two key elements: a collection of context vectors defined for every word used by a natural language processing system, and a context algorithm that computes a dynamic context vector at any position in a body of text. Once the dynamic context vector has been computed it is easy to choose among competing meanings for a word. This choice of definitions is essentially a neural network computation, and neural network learning algorithms should be able to improve such choices. Although context vectors do not represent all context information, their use should improve those full-scale systems that have avoided context as being too difficult to deal with. Good candidates for full-scale context vector implementations are machine translation systems and Japanese word processors. A main goal of this paper is to encourage such large-scale implementations and tests of context vector approaches. A variety of interesting directions for research in natural language processing and machine learning will be possible once a full set of context vectors has been created. In particular the development of more powerful context algorithms will be an important topic for future research.Keywords
This publication has 14 references indexed in Scilit:
- Illusory conjunctions in the perception of objectsPublished by Elsevier ,2004
- Perceptron-based learning algorithmsIEEE Transactions on Neural Networks, 1990
- The Design and Implementation of Marker-passing SystemsConnection Science, 1989
- Types and tokens in visual letter perception.Journal of Experimental Psychology: Human Perception and Performance, 1989
- Semantic interpretation and ambiguityArtificial Intelligence, 1988
- Repetition blindness: Type recognition without token individuationCognition, 1987
- Massively parallel parsing: A strongly interactive model of natural language interpretationCognitive Science, 1985
- Making preferences more activeArtificial Intelligence, 1978
- Distinctive features, categorical perception, and probability learning: Some applications of a neural model.Psychological Review, 1977
- A preferential, pattern-seeking, Semantics for natural language inferenceArtificial Intelligence, 1975