Analysis of TASSER-based CASP7 protein structure prediction results
- 1 January 2007
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 69 (S8) , 90-97
- https://doi.org/10.1002/prot.21649
Abstract
An improved TASSER (Threading/ASSEmbly/Refinement) methodology is applied to predict the tertiary structure for all CASP7 targets. TASSER employs template identification by threading, followed by tertiary structure assembly by rearranging continuous template fragments, where conformational space is searched via Parallel Hyperbolic Monte Carlo sampling with an optimized force‐field that includes knowledge‐based statistical potentials and restraints derived from threading templates. The final models are selected by clustering structures from the low temperature replicas. Improvements in TASSER over CASP6 involve use of better templates from 3D‐jury applied to three threading programs, PROSPECTOR_3, SP3, and SPARKS, and a fragment comparison method for better model ranking. For targets with no reliable templates, a variant of TASSER (chunk‐TASSER) is also applied with potentials and restraints extracted from ab initio folded supersecondary chunks of the target to build full‐length models. For all 124 CASP targets/domains, the average root‐mean‐square‐deviation (RMSD) from native and alignment coverage of the best initial threading models from 3D‐jury are 6.2 Å and 93%, respectively. Following TASSER reassembly, the average RMSD of the best model in the template aligned region decreases to 4.9 Å and the average TM‐score increases from 0.617 for the template to 0.678 for the best full‐length model. Based on target difficulty, the average TM‐scores of the final model to native are 0.904, 0.671, and 0.307 for high‐accuracy template‐based modeling, template‐based modeling, and free modeling targets/domains, respectively. For the more difficult targets, TASSER with modest human intervention performed better in comparison to its server counterpart, MetaTASSER, which used a limited time simulation. Proteins 2007.Keywords
This publication has 30 references indexed in Scilit:
- Developing a move-set for protein model refinementBioinformatics, 2006
- SPARKS 2 and SP3 servers in CASP6Proteins-Structure Function and Bioinformatics, 2005
- PROTEIN REFINEMENT: A NEW CHALLENGE FOR CASP IN ITS 10TH ANNIVERSARYBioinformatics, 2005
- Single‐body residue‐level knowledge‐based energy score combined with sequence‐profile and secondary structure information for fold recognitionProteins-Structure Function and Bioinformatics, 2004
- SPICKER: A clustering approach to identify near‐native protein foldsJournal of Computational Chemistry, 2004
- GenTHREADER: an efficient and reliable protein fold recognition method for genomic sequencesJournal of Molecular Biology, 1999
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Sequence alignment and penalty choiceJournal of Molecular Biology, 1994
- A new approach to protein fold recognitionNature, 1992
- Basic local alignment search toolJournal of Molecular Biology, 1990