Mining the Biomedical Literature in the Genomic Era: An Overview
- 1 December 2003
- journal article
- review article
- Published by Mary Ann Liebert Inc in Journal of Computational Biology
- Vol. 10 (6) , 821-855
- https://doi.org/10.1089/106652703322756104
Abstract
The past decade has seen a tremendous growth in the amount of experimental and computational biomedical data, specifically in the areas of genomics and proteomics. This growth is accompanied by an accelerated increase in the number of biomedical publications discussing the findings. In the last few years, there has been a lot of interest within the scientific community in literature-mining tools to help sort through this abundance of literature and find the nuggets of information most relevant and useful for specific analysis tasks. This paper provides a road map to the various literature-mining methods, both in general and within bioinformatics. It surveys the disciplines involved in unstructured-text analysis, categorizes current work in biomedical literature mining with respect to these disciplines, and provides examples of text analysis methods applied towards meeting some of the current challenges in bioinformatics.Keywords
This publication has 48 references indexed in Scilit:
- The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003Nucleic Acids Research, 2003
- BIND: the Biomolecular Interaction Network DatabaseNucleic Acids Research, 2003
- Automatic scientific text classification using local patternsACM SIGKDD Explorations Newsletter, 2002
- Rule-based extraction of experimental evidence in the biomedical domainACM SIGKDD Explorations Newsletter, 2002
- The frame-based module of the SUISEKI information extraction systemIEEE Intelligent Systems and their Applications, 2002
- Predicting Subcellular Localization of Proteins Based on their N-terminal Amino Acid SequenceJournal of Molecular Biology, 2000
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- An information measure of retrieval performanceInformation Systems, 1992
- Indexing by latent semantic analysisJournal of the American Society for Information Science, 1990
- A THEORETICAL BASIS FOR THE USE OF CO‐OCCURRENCE DATA IN INFORMATION RETRIEVALJournal of Documentation, 1977