WEIRD

1 April 1979

journal article
Published by Association for Computing Machinery (ACM) in ACM SIGIR Forum

Vol. 13 (4) , 32-50
https://doi.org/10.1145/1095366.1095368

Abstract

WEIRD is an automatic document retrieval system designed and implemented at Syracuse University, which attempts to advance the art of computerized retrieval from word-matching to judging conceptual similarity. WEIRD uses a vector space model to represent the relations among terms and documents. Items in the space are located according to their "meaning", which is their proximity to all other items in the data base as measured by co-occurrence frequencies. This is done without manipulating large matrices. The dimensions of the space are not used to define relations; items are defined solely by their position relative to the other items. Retrieval is determined by Euclidean distance from the plotted query. In the first section of the paper the basic characteristics of WEIRD are described. Second, the results of a preliminary evaluation are reported. Alternatives for further development of WEIRD are then considered.

Keywords

This publication has 14 references indexed in Scilit:

Automatic ranked output from boolean searches in SIRE
Journal of the American Society for Information Science, 1977
Operations Research Applied to Document Indexing and Retrieval Decisions
Journal of the ACM, 1977
Ann-dimensional retrieval model
Journal of the American Society for Information Science, 1976
A probabilistic approach to automatic keyword indexing. Part I. On the Distribution of Specialty Words in a Technical Literature
Journal of the American Society for Information Science, 1975
An historical note on the use of word-frequency contiguities in content analysis
Computers and the Humanities, 1974
Theoretical foundations of thesaurus-construction and some methodological considerations for thesaurus-updating
Journal of the American Society for Information Science, 1973
A highly associative document retrieval system
Journal of the American Society for Information Science, 1970
Expected search length: A single measure of retrieval effectiveness based on the weak ordering action of retrieval systems
American Documentation, 1968
Semantic Road Maps for Literature Searchers
Journal of the ACM, 1961
On Relevance, Probabilistic Indexing and Information Retrieval
Journal of the ACM, 1960