MITOPRED: a genome-scale method for prediction of nucleus-encoded mitochondrial proteins
Open Access
- 22 March 2004
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 20 (11) , 1785-1794
- https://doi.org/10.1093/bioinformatics/bth171
Abstract
Motivation: Currently available methods for the prediction of subcellular location of mitochondrial proteins rely largely on the presence of mitochondrial targeting signals in the protein sequences. However, a large fraction of mitochondrial proteins lack such signals, making those tools ineffective for genome-scale prediction of mitochondria-targeted proteins. Here, we propose a method for genome-scale prediction of nucleus-encoded mitochondrial proteins. The new method, MITOPRED, is based on the Pfam domain occurrence patterns and the amino acid compositional differences between mitochondrial and non-mitochondrial proteins. Results: MITOPRED could predict mitochondrial proteins with 100% specificity at a 44% sensitivity rate and with 67% specificity at 99% sensitivity. Additionally, it was sufficiently robust to predict mitochondrial proteins across different eukaryotic species with similar accuracy. Based on Matthews correlation coefficient measure, the prediction performance of MITOPRED is clearly superior (0.73) to those of the two popular methods TargetP (0.51) and PSORT (0.53). Using this method, we predicted the nucleus-encoded mitochondrial proteins from six complete genomes (three invertebrate, two vertebrate and one plant species) and estimated the total number in each genome. In human, our method estimated the existence of 1362 mitochondrial proteins corresponding to 4.8% of the total proteome. Availability: MITOPRED program is freely accessible at http://mitopred.sdsc.edu. Source code is available on request from the authors. Supplementary information: Training data sets are also available at http://mitopred.sdsc.eduKeywords
This publication has 8 references indexed in Scilit:
- Global analysis of protein localization in budding yeastNature, 2003
- Comparison of the genomes of human and mouse lays the foundation of genome zoologyHuman Molecular Genetics, 2003
- Characterization of the human heart mitochondrial proteomeNature Biotechnology, 2003
- Predicting Protein Cellular Localization Using a Domain Projection MethodGenome Research, 2002
- Subcellular localization of the yeast proteomeGenes & Development, 2002
- A graphic representation of protein sequence and predicting the subcellular locations of prokaryotic proteinsThe International Journal of Biochemistry & Cell Biology, 2002
- Prediction of the subcellular location of prokaryotic proteins based on the hydrophobicity index of amino acidsInternational Journal of Biological Macromolecules, 2001
- Prediction of Protein Subcellular Locations by Incorporating Quasi-Sequence-Order EffectBiochemical and Biophysical Research Communications, 2000