MultiLoc: prediction of protein subcellular localization using N-terminal targeting sequences, sequence motifs and amino acid composition
Open Access
- 20 January 2006
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 22 (10) , 1158-1165
- https://doi.org/10.1093/bioinformatics/btl002
Abstract
Motivation: Functional annotation of unknown proteins is a major goal in proteomics. A key annotation is the prediction of a protein's subcellular localization. Numerous prediction techniques have been developed, typically focusing on a single underlying biological aspect or predicting a subset of all possible localizations. An important step is taken towards emulating the protein sorting process by capturing and bringing together biologically relevant information, and addressing the clear need to improve prediction accuracy and localization coverage. Results: Here we present a novel SVM-based approach for predicting subcellular localization, which integrates N-terminal targeting sequences, amino acid composition and protein sequence motifs. We show how this approach improves the prediction based on N-terminal targeting sequences, by comparing our method TargetLoc against existing methods. Furthermore, MultiLoc performs considerably better than comparable methods predicting all major eukaryotic subcellular localizations, and shows better or comparable results to methods that are specialized on fewer localizations or for one organism. Availability: Contact:hoeglund@informatik.uni-tuebingen.deKeywords
This publication has 37 references indexed in Scilit:
- Mimicking Cellular Sorting Improves Prediction of Subcellular LocalizationJournal of Molecular Biology, 2005
- PSLpred: prediction of subcellular localization of bacterial proteinsBioinformatics, 2005
- Predicting protein localization in budding YeastBioinformatics, 2004
- Predicting subcellular localization of proteins in a hybridization spaceBioinformatics, 2004
- Automatic prediction of protein functionCellular and Molecular Life Sciences, 2003
- Using Functional Domain Composition and Support Vector Machines for Prediction of Protein Subcellular LocationJournal of Biological Chemistry, 2002
- Predicting Subcellular Localization of Proteins Based on their N-terminal Amino Acid SequenceJournal of Molecular Biology, 2000
- Adaptation of protein surfaces to subcellular location 1 1Edited by F. E. CohenJournal of Molecular Biology, 1998
- Protein transport via amino-terminal targeting sequences: common themes in diverse systems (Review)Molecular Membrane Biology, 1995
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994