MM-align: a quick algorithm for aligning multiple-chain protein complex structures using iterative dynamic programming
Open Access
- 14 May 2009
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 37 (11) , e83
- https://doi.org/10.1093/nar/gkp318
Abstract
Structural comparison of multiple-chain protein complexes is essential in many studies of protein–protein interactions. We develop a new algorithm, MM-align, for sequence-independent alignment of protein complex structures. The algorithm is built on a heuristic iteration of a modified Needleman–Wunsch dynamic programming (DP) algorithm, with the alignment score specified by the inter-complex residue distances. The multiple chains in each complex are first joined, in every possible order, and then simultaneously aligned with cross-chain alignments prevented. The alignments of interface residues are enhanced by an interface-specific weighting factor. MM-align is tested on a large-scale benchmark set of 205 × 3897 non-homologous multiple-chain complex pairs. Compared with a naïve extension of the monomer alignment program of TM-align, the alignment accuracy of MM-align is significantly higher as judged by the average TM-score of the physically-aligned residues. MM-align is about two times faster than TM-align because of omitting the cross-alignment zone of the DP matrix. It also shows that the enhanced alignment of the interfaces helps in identifying biologically relevant protein complex pairs.Keywords
This publication has 33 references indexed in Scilit:
- Protein structure prediction: when is it useful?Current Opinion in Structural Biology, 2009
- MultiBind and MAPPIS: webservers for multiple alignment of protein 3D-binding sites and their interactionsNucleic Acids Research, 2008
- Alignment of Non-Covalent Interactions at Protein-Protein InterfacesPLOS ONE, 2008
- SABERTOOTH: protein structural alignment based on a vectorial structure representationBMC Bioinformatics, 2007
- The impact of translocations and gene fusions on cancer causationNature Reviews Cancer, 2007
- SCOP: A structural classification of proteins database for the investigation of sequences and structuresPublished by Elsevier ,2006
- The Protein Data BankNucleic Acids Research, 2000
- The refined crystal structure of Drosophila lebanonensis alcohol dehydrogenase at 1.9 å resolution 1 1Edited by R. HuberJournal of Molecular Biology, 1998
- CATH – a hierarchic classification of protein domain structuresPublished by Elsevier ,1997
- Isolation, crystallization, crystal structure analysis and refinement of allophycocyanin from the cyanobacterium Spirulina platensis at 2.3 Å resolutionJournal of Molecular Biology, 1995