Annotation-based inference of transporter function
Open Access
- 1 July 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 24 (13) , i259-i267
- https://doi.org/10.1093/bioinformatics/btn180
Abstract
Motivation: We present a method for inferring and constructing transport reactions for transporter proteins based primarily on the analysis of the names of individual proteins in the genome annotation of an organism. Transport reactions are declarative descriptions of transporter activities, and thus can be manipulated computationally, unlike free-text protein names. Once transporter activities are encoded as transport reactions, a number of computational analyses are possible including database queries by transporter activity; inclusion of transporters into an automatically generated metabolic-map diagram that can be painted with omics data to aid in their interpretation; detection of anomalies in the metabolic and transport networks, such as substrates that are transported into the cell but are not inputs to any metabolic reaction or pathway; and comparative analyses of the transport capabilities of different organisms. Results: On randomly selected organisms, the method achieves precision and recall rates of 0.93 and 0.90, respectively in identifying transporter proteins by name within the complete genome. The method obtains 67.5% accuracy in predicting complete transport reactions; if allowance is made for predictions that are overly general yet not incorrect, reaction prediction accuracy is 82.5%. Availability: The method is implemented as part of PathoLogic, the inference component of the Pathway Tools software. Pathway Tools is freely available to researchers at non-commercial institutions, including source code; a fee applies to commercial institutions. Contact: tomlee@ai.sri.com Supplementary information: Supplementary data are available at Bioinformatics online.Keywords
This publication has 19 references indexed in Scilit:
- Expanded protein information at SGD: new pages and proteome browserNucleic Acids Research, 2007
- A genome‐scale metabolic reconstruction for Escherichia coli K‐12 MG1655 that accounts for 1260 ORFs and thermodynamic informationMolecular Systems Biology, 2007
- TransportDB: a comprehensive database resource for cytoplasmic membrane transport systems and outer membrane channelsNucleic Acids Research, 2006
- The Pathway Tools cellular overview diagram and Omics ViewerNucleic Acids Research, 2006
- dictyBase, the model organism database for Dictyostelium discoideumNucleic Acids Research, 2006
- MetaCyc: a multiorganism database of metabolic pathways and enzymesNucleic Acids Research, 2006
- TCDB: the Transporter Classification Database for membrane transport protein analyses and informationNucleic Acids Research, 2006
- Prediction of transporter family from protein sequence by support vector machine approachProteins-Structure Function and Bioinformatics, 2005
- Querying and computing with BioCyc databasesBioinformatics, 2005
- Using functional and organizational information to improve genome-wide computational prediction of transcription units on pathway-genome databasesBioinformatics, 2004