PIE: an online prediction system for protein-protein interactions from text

Open Access

19 May 2008

journal article
research article
Published by Oxford University Press (OUP) in Nucleic Acids Research

Vol. 36 (Web Server) , W411-W415
https://doi.org/10.1093/nar/gkn281

Abstract

Protein–protein interaction (PPI) extraction has been an important research topic in bio-text mining area, since the PPI information is critical for understanding biological processes. However, there are very few open systems available on the Web and most of the systems focus on keyword searching based on predefined PPIs. PIE (Protein Interaction information Extraction system) is a configurable Web service to extract PPIs from literature, including user-provided papers as well as PubMed articles. After providing abstracts or papers, the prediction results are displayed in an easily readable form with essential, yet compact features. The PIE interface supports more features such as PDF file extraction, PubMed search tool and network communication, which are useful for biologists and bio-system developers. The PIE system utilizes natural language processing techniques and machine learning methodologies to predict PPI sentences, which results in high precision performance for Web users. PIE is freely available at http://bi.snu.ac.kr/pie/ .

Keywords

This publication has 9 references indexed in Scilit:

Negation of protein–protein interactions: analysis and extraction
Bioinformatics, 2007
CARGO: a web portal to integrate customized biological information
Nucleic Acids Research, 2007
Finding the evidence for protein-protein interactions from PubMed abstracts
Bioinformatics, 2006
Literature mining for the biologist: from information retrieval to biological discovery
Nature Reviews Genetics, 2006
A survey of current work in biomedical text mining
Briefings in Bioinformatics, 2005
Text-mining and information-retrieval services for molecular biology
Genome Biology, 2005
Content-rich biological network constructed by mining PubMed abstracts
BMC Bioinformatics, 2004
A gene network for navigating the literature
Nature Genetics, 2004
GENIA corpus—a semantically annotated corpus for bio-textmining
Bioinformatics, 2003