EBIMed—text crunching to gather facts for proteins from Medline
- 15 January 2007
- journal article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 23 (2) , e237-e244
- https://doi.org/10.1093/bioinformatics/btl302
Abstract
Summary: To allow efficient and systematic retrieval of statements from Medline we have developed EBIMed, a service that combines document retrieval with co-occurrence-based analysis of Medline abstracts. Upon keyword query, EBIMed retrieves the abstracts from EMBL-EBI's installation of Medline and filters for sentences that contain biomedical terminology maintained in public bioinformatics resources. The extracted sentences and terminology are used to generate an overview table on proteins, Gene Ontology (GO) annotations, drugs and species used in the same biological context. All terms in retrieved abstracts and extracted sentences are linked to their entries in biomedical databases. We assessed the quality of the identification of terms and relations in the retrieved sentences. More than 90% of the protein names found indeed represented a protein. According to the analysis of four protein–protein pairs from the Wnt pathway we estimated that 37% of the statements containing such a pair mentioned a meaningful interaction and clarified the interaction of Dkk with LRP. We conclude that EBIMed improves access to information where proteins and drugs are involved in the same biological process, e.g. statements with GO annotations of proteins, protein–protein interactions and effects of drugs on proteins.Availability: Available atSupplementary Data: Supplementary Data are available at Bioinformatics online.Contact: Rebholz@ebi.ac.ukKeywords
This publication has 24 references indexed in Scilit:
- Distributed modules for text annotation and IE applied to the biomedical domainInternational Journal of Medical Informatics, 2006
- Resolving abbreviations to their senses in MedlineBioinformatics, 2005
- A simple approach for protein name identification: prospects and limitsBMC Bioinformatics, 2005
- Facts from Text—Is Text Mining Ready to Deliver?PLoS Biology, 2005
- BioIE: extracting informative sentences from the biomedical literatureBioinformatics, 2005
- Co-occurrence based meta-analysis of scientific texts: retrieving biological relationships between genesBioinformatics, 2005
- Textpresso: An Ontology-Based Information Retrieval and Extraction System for Biological LiteraturePLoS Biology, 2004
- GeneWays: a system for extracting, analyzing, visualizing, and integrating molecular pathway dataJournal of Biomedical Informatics, 2003
- Involvement of NLK and Sox11 in neural induction in Xenopus developmentGenes to Cells, 2002
- Regulation of β-Catenin Signaling by the B56 Subunit of Protein Phosphatase 2AScience, 1999