An investigation of content representation using text grammars
- 2 January 1993
- journal article
- Published by Association for Computing Machinery (ACM) in ACM Transactions on Information Systems
- Vol. 11 (1) , 51-75
- https://doi.org/10.1145/151480.151490
Abstract
We extend prior work on a model for natural language text representation and retrieval using a linguistic device called text grammar. We demonstrate the value of this approach in accessing relevant items from a collection of empirical abstracts in a medical domain. The advantage, when compared to traditional keyword retrieval, is that this approach is a significant move towards knowledge representation and retrieval. Text representation in this model includes keywords and their conceptual roles in the text. In particular, it involves extracting TOPIC predicates representing the research issue addressed and DESIGN predicates representing important methodological features of the empirical study. Preliminary experimentation shows that keywords exhibit a variety of text-grammar roles in a text database. Second, as intuitively expected, retrieval using TOPIC predicates identifies a smaller subset of texts than Boolean retrieval does. These empirical results along with the theoretical work indicate that the representation and retrieval strategies proposed have a significant potential. Finally, EMPIRICIST, a prototype system is described. In it the text representation predicates are implemented as a network while retrieval is through constrained-spreading activation strategies.Keywords
This publication has 21 references indexed in Scilit:
- Topic parsing: Accounting for text macro structures in full-text analysisInformation Processing & Management, 1990
- The Effect of Prereading, Rhetorically-Oriented Frameworks on the Recall of Two Structurally Different Expository TextsStudies in Second Language Acquisition, 1990
- Constructing literature abstracts by computer: Techniques and prospectsInformation Processing & Management, 1990
- Knowledge-based search tactics for an intelligent intermediary systemACM Transactions on Information Systems, 1989
- Abstracts and other information filtersJournal of Chemical Information and Computer Sciences, 1985
- Signposts for the reader: A corpus-based study of text deixisText & Talk - An Interdisciplinary Journal of Language, Discourse & Communication Studies, 1985
- Principles, procedures and rules in an expert system for information retrievalInformation Processing & Management, 1985
- Toward a model of text comprehension and production.Psychological Review, 1978
- INFORMATION RETRIEVAL THROUGH MAN‐MACHINE DIALOGUEJournal of Documentation, 1977
- Remembrance of things parsed: Story structure and recallCognitive Psychology, 1977