MedlineRanker: flexible ranking of biomedical literature
Open Access
- 7 May 2009
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 37 (suppl_2) , W141-W146
- https://doi.org/10.1093/nar/gkp353
Abstract
The biomedical literature is represented by millions of abstracts available in the Medline database. These abstracts can be queried with the PubMed interface, which provides a keyword-based Boolean search engine. This approach shows limitations in the retrieval of abstracts related to very specific topics, as it is difficult for a non-expert user to find all of the most relevant keywords related to a biomedical topic. Additionally, when searching for more general topics, the same approach may return hundreds of unranked references. To address these issues, text mining tools have been developed to help scientists focus on relevant abstracts. We have implemented the MedlineRanker webserver, which allows a flexible ranking of Medline for a topic of interest without expert knowledge. Given some abstracts related to a topic, the program deduces automatically the most discriminative words in comparison to a random selection. These words are used to score other abstracts, including those from not yet annotated recent publications, which can be then ranked by relevance. We show that our tool can be highly accurate and that it is able to process millions of abstracts in a practical amount of time. MedlineRanker is free for use and is available at http://cbdm.mdc-berlin.de/tools/medlineranker.Keywords
This publication has 15 references indexed in Scilit:
- MScanner: a classifier for retrieving Medline citationsBMC Bioinformatics, 2008
- PubMed related articles: a probabilistic topic-based model for content similarityBMC Bioinformatics, 2007
- Biomedical knowledge navigation by literature clusteringJournal of Biomedical Informatics, 2007
- Relemed: sentence-level search engine with relevance score for the MEDLINE database of biomedical articlesBMC Medical Informatics and Decision Making, 2007
- Text similarity: an alternative way to search MEDLINEBioinformatics, 2006
- Protein annotation by EBIMedNature Biotechnology, 2006
- GoPubMed: exploring PubMed with the Gene OntologyNucleic Acids Research, 2005
- Ranking the whole MEDLINE database according to a large training set using text indexingBMC Bioinformatics, 2005
- ask MEDLINE: a free-text, natural language query tool for MEDLINE/PubMedBMC Medical Informatics and Decision Making, 2005
- XplorMed: a tool for exploring MEDLINE abstractsTrends in Biochemical Sciences, 2001