Text processing through Web services: calling Whatizit
Top Cited Papers
Open Access
- 15 November 2007
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 24 (2) , 296-298
- https://doi.org/10.1093/bioinformatics/btm557
Abstract
Motivation: Text-mining (TM) solutions are developing into efficient services to researchers in the biomedical research community. Such solutions have to scale with the growing number and size of resources (e.g. available controlled vocabularies), with the amount of literature to be processed (e.g. about 17 million documents in PubMed) and with the demands of the user community (e.g. different methods for fact extraction). These demands motivated the development of a server-based solution for literature analysis. Whatizit is a suite of modules that analyse text for contained information, e.g. any scientific publication or Medline abstracts. Special modules identify terms and then link them to the corresponding entries in bioinformatics databases such as UniProtKb/Swiss-Prot data entries and gene ontology concepts. Other modules identify a set of selected annotation types like the set produced by the EBIMed analysis pipeline for proteins. In the case of Medline abstracts, Whatizit offers access to EBI's in-house installation via PMID or term query. For large quantities of the user's own text, the server can be operated in a streaming mode (http://www.ebi.ac.uk/webservices/whatizit). Contact: rebholz@ebi.ac.ukKeywords
This publication has 8 references indexed in Scilit:
- EBIMed—text crunching to gather facts for proteins from MedlineBioinformatics, 2007
- Taverna: a tool for building and running workflows of servicesNucleic Acids Research, 2006
- Distributed modules for text annotation and IE applied to the biomedical domainInternational Journal of Medical Informatics, 2006
- Annotation and disambiguation of semantic types in biomedical textPublished by Association for Computational Linguistics (ACL) ,2006
- Implementing the iHOP concept for navigation of biomedical literatureBioinformatics, 2005
- Overview of BioCreAtIvE task 1B: normalized gene listsBMC Bioinformatics, 2005
- GENIES: a natural-language processing system for the extraction of molecular pathways from journal articlesBioinformatics, 2001
- Gene Ontology: tool for the unification of biologyNature Genetics, 2000