PIE: an online prediction system for protein-protein interactions from text
Open Access
- 19 May 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 36 (Web Server) , W411-W415
- https://doi.org/10.1093/nar/gkn281
Abstract
Protein–protein interaction (PPI) extraction has been an important research topic in bio-text mining area, since the PPI information is critical for understanding biological processes. However, there are very few open systems available on the Web and most of the systems focus on keyword searching based on predefined PPIs. PIE (Protein Interaction information Extraction system) is a configurable Web service to extract PPIs from literature, including user-provided papers as well as PubMed articles. After providing abstracts or papers, the prediction results are displayed in an easily readable form with essential, yet compact features. The PIE interface supports more features such as PDF file extraction, PubMed search tool and network communication, which are useful for biologists and bio-system developers. The PIE system utilizes natural language processing techniques and machine learning methodologies to predict PPI sentences, which results in high precision performance for Web users. PIE is freely available at http://bi.snu.ac.kr/pie/ .Keywords
This publication has 9 references indexed in Scilit:
- Negation of protein–protein interactions: analysis and extractionBioinformatics, 2007
- CARGO: a web portal to integrate customized biological informationNucleic Acids Research, 2007
- Finding the evidence for protein-protein interactions from PubMed abstractsBioinformatics, 2006
- Literature mining for the biologist: from information retrieval to biological discoveryNature Reviews Genetics, 2006
- A survey of current work in biomedical text miningBriefings in Bioinformatics, 2005
- Text-mining and information-retrieval services for molecular biologyGenome Biology, 2005
- Content-rich biological network constructed by mining PubMed abstractsBMC Bioinformatics, 2004
- A gene network for navigating the literatureNature Genetics, 2004
- GENIA corpus—a semantically annotated corpus for bio-textminingBioinformatics, 2003