LocTree2 predicts localization for all domains of life
Open Access
- 3 September 2012
- journal article
- conference paper
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 28 (18) , i458-i465
- https://doi.org/10.1093/bioinformatics/bts390
Abstract
Motivation: Subcellular localization is one aspect of protein function. Despite advances in high-throughput imaging, localization maps remain incomplete. Several methods accurately predict localization, but many challenges remain to be tackled. Results: In this study, we introduced a framework to predict localization in life's three domains, including globular and membrane proteins (3 classes for archaea; 6 for bacteria and 18 for eukaryota). The resulting method, LocTree2, works well even for protein fragments. It uses a hierarchical system of support vector machines that imitates the cascading mechanism of cellular sorting. The method reaches high levels of sustained performance (eukaryota: Q18=65%, bacteria: Q6=84%). LocTree2 also accurately distinguishes membrane and non-membrane proteins. In our hands, it compared favorably with top methods when tested on new data. Availability: Online through PredictProtein (predictprotein.org); as standalone version at http://www.rostlab.org/services/loctree2. Contact: localization@rostlab.org Supplementary Information: Supplementary data are available at Bioinformatics online.This publication has 50 references indexed in Scilit:
- LocDB: experimental annotations of localization for Homo sapiens and Arabidopsis thalianaNucleic Acids Research, 2010
- PSORTb 3.0: improved protein subcellular localization prediction with refined localization subcategories and predictive capabilities for all prokaryotesBioinformatics, 2010
- MultiLoc2: integrating phylogeny and Gene Ontology terms improves subcellular protein localization predictionBMC Bioinformatics, 2009
- Prediction of membrane-protein topology from first principlesProceedings of the National Academy of Sciences, 2008
- WoLF PSORT: protein localization predictorNucleic Acids Research, 2007
- Feature-based prediction of non-classical and leaderless protein secretionProtein Engineering, Design and Selection, 2004
- Automatic prediction of protein functionCellular and Molecular Life Sciences, 2003
- Prediction of protein cellular attributes using pseudo‐amino acid compositionProteins-Structure Function and Bioinformatics, 2001
- The Protein Data BankNucleic Acids Research, 2000
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997