Transmembrane helices predicted at 95% accuracy
Open Access
- 1 March 1995
- journal article
- research article
- Published by Wiley in Protein Science
- Vol. 4 (3) , 521-533
- https://doi.org/10.1002/pro.5560040318
Abstract
We describe a neural network system that predicts the locations of transmembrane helices in integral membrane proteins. By using evolutionary information as input to the network system, the method significantly improved on a previously published neural network prediction method that had been based on single sequence information. The input data were derived from multiple alignments for each position in a window of 13 adjacent residues: amino acid frequency, conservation weights, number of insertions and deletions, and position of the window with respect to the ends of the protein chain. Additional input was the amino acid composition and length of the whole protein. A rigorous cross-validation test on 69 proteins with experimentally determined locations of transmembrane segments yielded an overall two-state per-residue accuracy of 95%. About 94% of all segments were predicted correctly. When applied to known globular proteins as a negative control, the network system incorrectly predicted fewer than 5% of globular proteins as having transmembrane helices. The method was applied to all 269 open reading frames from the complete yeast VIII chromosome. For 59 of these, at least two transmembrane helices were predicted. Thus, the prediction is that about one-fourth of all proteins from yeast VIII contain one transmembrane helix, and some 20%, more than one.Keywords
This publication has 57 references indexed in Scilit:
- Prediction of Transmembrane Segments in Proteins Utilising Multiple Sequence AlignmentsJournal of Molecular Biology, 1994
- Redefining the goals of protein secondary structure predictionJournal of Molecular Biology, 1994
- Prediction of Protein Secondary Structure at Better than 70% AccuracyJournal of Molecular Biology, 1993
- Quadratic Minimization of Predictors for Protein Secondary StructureJournal of Molecular Biology, 1993
- Non-random Distribution of Amino Acids in the Transmembrane Segments of Human Type I Single Span Membrane ProteinsJournal of Molecular Biology, 1993
- The rapid generation of mutation data matrices from protein sequencesBioinformatics, 1992
- Membrane protein structure predictionJournal of Molecular Biology, 1992
- Model for the structure of bacteriorhodopsin based on high-resolution electron cryo-microscopyJournal of Molecular Biology, 1990
- Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical featuresBiopolymers, 1983
- The protein data bank: A computer-based archival file for macromolecular structuresJournal of Molecular Biology, 1977