EXTRACTION OF GENE-DISEASE RELATIONS FROM MEDLINE USING DOMAIN DICTIONARIES AND MACHINE LEARNING
- 1 December 2005
- proceedings article
- Published by World Scientific Pub Co Pte Ltd
Abstract
We describe a system that extracts disease-gene relations from MedLine. We constructed a dictionary for disease and gene names from six public databases and extracted relation candidates by dictionary matching. Since dictionary matching produces a large number of false positives, we developed a method of machine learning-based named entity recognition (NER) to filter out false recognitions of disease/gene names. We found that the performance of relation extraction is heavily dependent upon the performance of NER filtering and that the filtering improves the precision of relation extraction by 26.7% at the cost of a small reduction in recall.Keywords
This publication has 0 references indexed in Scilit: