On filtering false positive transmembrane protein predictions
- 1 September 2002
- journal article
- Published by Oxford University Press (OUP) in Protein Engineering, Design and Selection
- Vol. 15 (9) , 745-752
- https://doi.org/10.1093/protein/15.9.745
Abstract
While helical transmembrane (TM) region prediction tools achieve high (>90%) success rates for real integral membrane proteins, they produce a considerable number of false positive hits in sequences of known nontransmembrane queries. We propose a modification of the dense alignment surface (DAS) method that achieves a substantial decrease in the false positive error rate. Essentially, a sequence that includes possible transmembrane regions is compared in a second step with TM segments in a sequence library of documented transmembrane proteins. If the performance of the query sequence against the library of documented TM segment-containing sequences in this test is lower than an empirical threshold, it is classified as a non-transmembrane protein. The probability of false positive prediction for trusted TM region hits is expressed in terms of E-values. The modified DAS method, the DAS-TMfilter algorithm, has an unchanged high sensitivity for TM segments ( approximately 95% detected in a learning set of 128 documented transmembrane proteins). At the same time, the selectivity measured over a non-redundant set of 526 soluble proteins with known 3D structure is approximately 99%, mainly because a large number of falsely predicted single membrane-pass proteins are eliminated by the DAS-TMfilter algorithm.Keywords
This publication has 41 references indexed in Scilit:
- Post-translational GPI lipid anchor modification of proteins in kingdoms of life: analysis of protein sequence data from complete genomesProtein Engineering, Design and Selection, 2001
- Use of immobilized PCR primers to generate covalently immobilized DNAs for in vitro transcription/translation reactionsNucleic Acids Research, 2000
- The Protein Data BankNucleic Acids Research, 2000
- Prediction of Potential GPI-modification Sites in Proprotein SequencesJournal of Molecular Biology, 1999
- SOSUI: classification and secondary structure prediction system for membrane proteins.Bioinformatics, 1998
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Prediction of transmembrane alpha-helices in prokaryotic membrane proteins: the dense alignment surface methodProtein Engineering, Design and Selection, 1997
- Protein Structure Prediction: Recognition of Primary, Secondary, and Tertiary Structural Features from Amino Acid SequenceCritical Reviews in Biochemistry and Molecular Biology, 1995
- New Alignment Strategy for Transmembrane ProteinsJournal of Molecular Biology, 1994
- IDENTIFYING NONPOLAR TRANSBILAYER HELICES IN AMINO ACID SEQUENCES OF MEMBRANE PROTEINSAnnual Review of Biophysics, 1986