Frontiers of biomedical text mining: current progress
Open Access
- 20 June 2007
- journal article
- review article
- Published by Oxford University Press (OUP) in Briefings in Bioinformatics
- Vol. 8 (5) , 358-375
- https://doi.org/10.1093/bib/bbm045
Abstract
It is now almost 15 years since the publication of the first paper on text mining in the genomics domain, and decades since the first paper on text mining in the medical domain. Enormous progress has been made in the areas of information retrieval, evaluation methodologies and resource construction. Some problems, such as abbreviation-handling, can essentially be considered solved problems, and others, such as identification of gene mentions in text, seem likely to be solved soon. However, a number of problems at the frontiers of biomedical text mining continue to present interesting challenges and opportunities for great improvements and interesting research. In this article we review the current state of the art in biomedical text mining or ‘BioNLP’ in general, focusing primarily on papers published within the past year.Keywords
This publication has 84 references indexed in Scilit:
- Text-derived concept profiles support assessment of DNA microarray data for acute myeloid leukemia and for androgen receptor stimulationBMC Bioinformatics, 2007
- MINING PATENTS USING MOLECULAR SIMILARITY SEARCHPacific Symposium on Biocomputing, 2006
- GeneLibrarian: an effective gene-information summarization and visualization systemBMC Bioinformatics, 2006
- BBP: Brucella genome annotation with literature mining and curationBMC Bioinformatics, 2006
- Argument-predicate distance as a filter for enhancing precision in extracting predications on the genetic etiology of diseaseBMC Bioinformatics, 2006
- Combining evidence, biomedical literature and statistical dependence: new insights for functional annotation of gene setsBMC Bioinformatics, 2006
- Extraction of Transcript Diversity from Scientific LiteraturePLoS Computational Biology, 2005
- The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical textPublished by Elsevier ,2004
- Towards a medical question-answering system: a feasibility study.2003
- NS5A, a nonstructural protein of hepatitis C virus, binds growth factor receptor-bound protein 2 adaptor protein in a Src homology 3 domain/ligand-dependent manner and perturbs mitogenic signalingProceedings of the National Academy of Sciences, 1999