WEIRD
- 1 April 1979
- journal article
- Published by Association for Computing Machinery (ACM) in ACM SIGIR Forum
- Vol. 13 (4) , 32-50
- https://doi.org/10.1145/1095366.1095368
Abstract
WEIRD is an automatic document retrieval system designed and implemented at Syracuse University, which attempts to advance the art of computerized retrieval from word-matching to judging conceptual similarity. WEIRD uses a vector space model to represent the relations among terms and documents. Items in the space are located according to their "meaning", which is their proximity to all other items in the data base as measured by co-occurrence frequencies. This is done without manipulating large matrices. The dimensions of the space are not used to define relations; items are defined solely by their position relative to the other items. Retrieval is determined by Euclidean distance from the plotted query. In the first section of the paper the basic characteristics of WEIRD are described. Second, the results of a preliminary evaluation are reported. Alternatives for further development of WEIRD are then considered.Keywords
This publication has 14 references indexed in Scilit:
- Automatic ranked output from boolean searches in SIREJournal of the American Society for Information Science, 1977
- Operations Research Applied to Document Indexing and Retrieval DecisionsJournal of the ACM, 1977
- Ann-dimensional retrieval modelJournal of the American Society for Information Science, 1976
- A probabilistic approach to automatic keyword indexing. Part I. On the Distribution of Specialty Words in a Technical LiteratureJournal of the American Society for Information Science, 1975
- An historical note on the use of word-frequency contiguities in content analysisComputers and the Humanities, 1974
- Theoretical foundations of thesaurus-construction and some methodological considerations for thesaurus-updatingJournal of the American Society for Information Science, 1973
- A highly associative document retrieval systemJournal of the American Society for Information Science, 1970
- Expected search length: A single measure of retrieval effectiveness based on the weak ordering action of retrieval systemsAmerican Documentation, 1968
- Semantic Road Maps for Literature SearchersJournal of the ACM, 1961
- On Relevance, Probabilistic Indexing and Information RetrievalJournal of the ACM, 1960