Fishing new proteins in the twilight zone of genomes: The test case of outer membrane proteins in Escherichia coli K12, Escherichia coli O157:H7, and other Gram‐negative bacteria
Open Access
- 1 June 2003
- journal article
- Published by Wiley in Protein Science
- Vol. 12 (6) , 1158-1168
- https://doi.org/10.1110/ps.0223603
Abstract
We address the problem of clustering the whole protein content of genomes into three different categories—globular, all‐α, and all‐β membrane proteins—with the aim of fishing new membrane proteins in the pool of nonannotated proteins (twilight zone). The focus is then mainly on outer membrane proteins. This is performed by using an integrated suite of programs (Hunter) specifically developed for predicting the occurrence of signal peptides in proteins of Gram‐negative bacteria and the topography of all‐α and all‐β membrane proteins. Hunter is tested on the well and partially annotated proteins (2160 and 760, respectively) of Escherichia coli K 12 scoring as high as 95.6% in the correct assignment of each chain to the category. Of the remaining 1253 nonannotated sequences, 1099 are predicted globular, 136 are all‐α, and 18 are all‐β membrane proteins. In Escherichia coli 0157:H7 we filtered 1901 nonannotated proteins. Our analysis classifies 1564 globular chains, 327 inner membrane proteins, and 10 outer membrane proteins. With Hunter, new membrane proteins are added to the list of putative membrane proteins of Gram‐negative bacteria. The content of outer membrane proteins per genome (nine are analyzed) ranges from 1.5% to 2.4%, and it is one order of magnitude lower than that of inner membrane proteins. The finding is particularly relevant when it is considered that this is the first large‐scale analysis based on validated tools that can predict the content of outer membrane proteins in a genome and can allow cross‐comparison of the same protein type between different species.Keywords
This publication has 36 references indexed in Scilit:
- Comparative assessment of large-scale data sets of protein–protein interactionsNature, 2002
- Enzyme Function Less Conserved than AnticipatedJournal of Molecular Biology, 2002
- Comparing function and structure between entire proteomesProtein Science, 2001
- Prediction of the transmembrane regions of β‐barrel membrane proteins with a neural network‐based predictorProtein Science, 2001
- Automated discovery of structural signatures of protein fold and functionJournal of Molecular Biology, 2001
- Predicting transmembrane protein topology with a hidden markov model: application to complete genomes11Edited by F. CohenJournal of Molecular Biology, 2001
- Enhanced genome annotation using structural profiles in the program 3D-PSSM 1 1Edited by J. ThorntonJournal of Molecular Biology, 2000
- High-resolution structure of the OmpA membrane domainJournal of Molecular Biology, 2000
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Transmembrane helices predicted at 95% accuracyProtein Science, 1995