Fold recognition by combining sequence profiles derived from evolution and from depth‐dependent structural alignment of fragments
- 2 November 2004
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 58 (2) , 321-328
- https://doi.org/10.1002/prot.20308
Abstract
Recognizing structural similarity without significant sequence identity has proved to be a challenging task. Sequence‐based and structure‐based methods as well as their combinations have been developed. Here, we propose a fold‐recognition method that incorporates structural information without the need of sequence‐to‐structure threading. This is accomplished by generating sequence profiles from protein structural fragments. The structure‐derived sequence profiles allow a simple integration with evolution‐derived sequence profiles and secondary‐structural information for an optimized alignment by efficient dynamic programming. The resulting method (called SP3) is found to make a statistically significant improvement in both sensitivity of fold recognition and accuracy of alignment over the method based on evolution‐derived sequence profiles alone (SP) and the method based on evolution‐derived sequence profile and secondary structure profile (SP2). SP3 was tested in SALIGN benchmark for alignment accuracy and Lindahl, PROSPECTOR 3.0, and LiveBench 8.0 benchmarks for remote‐homology detection and model accuracy. SP3 is found to be the most sensitive and accurate single‐method server in all benchmarks tested where other methods are available for comparison (although its results are statistically indistinguishable from the next best in some cases and the comparison is subjected to the limitation of time‐dependent sequence and/or structural library used by different methods.). In LiveBench 8.0, its accuracy rivals some of the consensus methods such as ShotGun‐INBGU, Pmodeller3, Pcons4, and ROBETTA. SP3 fold‐recognition server is available on http://theory.med.buffalo.edu. Proteins 2005.Keywords
This publication has 80 references indexed in Scilit:
- Single‐body residue‐level knowledge‐based energy score combined with sequence‐profile and secondary structure information for fold recognitionProteins-Structure Function and Bioinformatics, 2004
- Within the twilight zone: a sensitive profile-profile comparison tool based on information theoryJournal of Molecular Biology, 2002
- Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structureJournal of Molecular Biology, 2001
- Pairwise sequence alignment below the twilight zone11Edited by B. HonigJournal of Molecular Biology, 2001
- Assessment of the CASP4 fold recognition categoryProteins-Structure Function and Bioinformatics, 2001
- Protein secondary structure prediction based on position-specific scoring matrices 1 1Edited by G. Von HeijneJournal of Molecular Biology, 1999
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- A new approach to protein fold recognitionNature, 1992
- Basic local alignment search toolJournal of Molecular Biology, 1990
- Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical featuresBiopolymers, 1983