Recent Studies in Automatic Text Analysis and Document Retrieval
- 1 April 1973
- journal article
- Published by Association for Computing Machinery (ACM) in Journal of the ACM
- Vol. 20 (2) , 258-278
- https://doi.org/10.1145/321752.321757
Abstract
Many experts in mechanized text processing now agree that useful automatic language analysis procedures are largely unavailable and that the existing linguistic methodologies generally produce disappointing results. An attempt is made in the present study to identify those automatic procedures which appear most effective as a replacement for the missing language analysis. A series of computer experiments is described, designed to simulate a conventional document retrieval environment. It is found that a simple duplication, by automatic means, of the standard, manual document indexing and retrieval operations will not produce acceptable output results. New mechanized approaches to document handling are proposed, including document ranking methods, automatic dictionary and word list generation, and user feedback searches. It is shown that the fully automatic methodology is superior in effectiveness to the conventional procedures in normal use.Keywords
This publication has 13 references indexed in Scilit:
- The function of semantics in automated language processingPublished by Association for Computing Machinery (ACM) ,1971
- Automatic parsing for content analysisCommunications of the ACM, 1970
- Automatic Text AnalysisScience, 1970
- New Methods in Automatic ExtractingJournal of the ACM, 1969
- A comparison between manual and automatic indexing methodsAmerican Documentation, 1969
- Computer Evaluation of Indexing and Text ProcessingJournal of the ACM, 1968
- STUDY AND TEST OF A METHODOLOGY FOR LABORATORY EVALUATION OF MESSAGE RETRIEVAL SYSTEMS.Published by American Psychological Association (APA) ,1966
- Statistical association methods for mechanized documentation :Published by National Institute of Standards and Technology (NIST) ,1965
- Automatic indexing :Published by National Institute of Standards and Technology (NIST) ,1965
- Searching Natural Language Text by ComputerScience, 1960