Overview of BioCreative II gene mention recognition
Top Cited Papers
Open Access
- 1 September 2008
- journal article
- research article
- Published by Springer Nature in Genome Biology
- Vol. 9 (S2) , S2
- https://doi.org/10.1186/gb-2008-9-s2-s2
Abstract
Nineteen teams presented results for the Gene Mention Task at the BioCreative II Workshop. In this task participants designed systems to identify substrings in sentences corresponding to gene name mentions. A variety of different methods were used and the results varied with a highest achieved F1 score of 0.8721. Here we present brief descriptions of all the methods used and a statistical analysis of the results. We also demonstrate that, by combining the results from all submissions, an F score of 0.9066 is feasible, and furthermore that the best result makes use of the lowest scoring submissions.Keywords
This publication has 14 references indexed in Scilit:
- Data Mining: Practical Machine Learning Tools and TechniquesPublished by Elsevier ,2011
- Overview of BioCreative II gene normalizationGenome Biology, 2008
- BioThesaurus: a web-based thesaurus of protein and gene namesBioinformatics, 2005
- GENETAG: a tagged corpus for gene/protein named entity recognitionBMC Bioinformatics, 2005
- BioCreAtIvE Task 1A: gene mention finding evaluationBMC Bioinformatics, 2005
- Bidirectional inference with the easiest-first strategy for tagging sequence dataPublished by Association for Computational Linguistics (ACL) ,2005
- MedTagPublished by Association for Computational Linguistics (ACL) ,2005
- The Unified Medical Language System (UMLS): integrating biomedical terminologyNucleic Acids Research, 2004
- A SIMPLE ALGORITHM FOR IDENTIFYING ABBREVIATION DEFINITIONS IN BIOMEDICAL TEXTPacific Symposium on Biocomputing, 2002
- Tagging gene and protein names in biomedical textBioinformatics, 2002