Finding themes in Medline documents - probabilistic similarity search
- 1 January 2000
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 183-192
- https://doi.org/10.1109/adl.2000.848381
Abstract
Large on-line document databases, such as Medine, pose a major challenge of retrieving the few documents most relevant to the user's needs, while multimizing the return rate of nonrelevant documents. Retrieval of documents similar to a user provided example document is a promising query paradigm towards meeting this goal. We present a new theme-based probabilistic approach for finding documents relevant to a given query document, and summarizing their contents. Preliminary experiments conducted over a subset of Medline documents related to AIDS demonstrate the effectiveness of our approach.Keywords
This publication has 14 references indexed in Scilit:
- Information fusion in the context of multi-document summarizationPublished by Association for Computational Linguistics (ACL) ,1999
- Combining automatic and manual index representations in probabilistic retrievalJournal of the American Society for Information Science, 1995
- Some Simple Effective Approximations to the 2-Poisson Model for Probabilistic Weighted RetrievalPublished by Springer Nature ,1994
- Distributional clustering of English wordsPublished by Association for Computational Linguistics (ACL) ,1993
- Scatter/Gather: a cluster-based approach to browsing large document collectionsPublished by Association for Computing Machinery (ACM) ,1992
- Improving the retrieval of information from external sourcesBehavior Research Methods, Instruments & Computers, 1991
- Indexing by latent semantic analysisJournal of the American Society for Information Science, 1990
- Inference networks for document retrievalPublished by Association for Computing Machinery (ACM) ,1989
- Using latent semantic analysis to improve access to textual informationPublished by Association for Computing Machinery (ACM) ,1988
- A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov ChainsThe Annals of Mathematical Statistics, 1970