Text Detective: a rule-based system for gene annotation in biomedical texts
Open Access
- 24 May 2005
- journal article
- research article
- Published by Springer Nature in BMC Bioinformatics
- Vol. 6 (S1) , S10
- https://doi.org/10.1186/1471-2105-6-s1-s10
Abstract
Background: The identification of mentions of gene or gene products in biomedical texts is a critical step in the development of text mining applications in biosciences. The complexity and ambiguity of gene nomenclature makes this a very difficult task. Methods: Here we present a novel approach based on a combination of carefully designed rules and several lexicons of biological concepts, implemented in the Text Detective system. Text Detective is able to normalize the results of gene mentions found by offering the appropriate database reference. Results: In BioCreAtIvE evaluation, Text Detective achieved results of 84% precision, 71% recall for task 1A, and 79% precision, 71% recall for mouse genes in task 1B.Keywords
This publication has 10 references indexed in Scilit:
- A critical assessment of text mining methods in molecular biology. Proceedings of a workshop. March 28-31, 2004. Granada, Spain.2005
- Recognizing names in biomedical texts: a machine learning approachBioinformatics, 2004
- Tough MiningPLoS Biology, 2003
- Extracting synonymous gene and protein terms from biological literatureBioinformatics, 2003
- Tagging gene and protein names in biomedical textBioinformatics, 2002
- What's in a name?Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- An Introduction to Support Vector Machines and Other Kernel-based Learning MethodsPublished by Cambridge University Press (CUP) ,2000
- An Algorithm that Learns What's in a NameMachine Learning, 1999
- Toward information extraction: identifying protein names from biological papers.1998
- The Nature of Statistical Learning TheoryPublished by Springer Nature ,1995