MM-align: a quick algorithm for aligning multiple-chain protein complex structures using iterative dynamic programming

Open Access

14 May 2009

journal article
research article
Published by Oxford University Press (OUP) in Nucleic Acids Research

Vol. 37 (11) , e83
https://doi.org/10.1093/nar/gkp318

Abstract

Structural comparison of multiple-chain protein complexes is essential in many studies of protein–protein interactions. We develop a new algorithm, MM-align, for sequence-independent alignment of protein complex structures. The algorithm is built on a heuristic iteration of a modified Needleman–Wunsch dynamic programming (DP) algorithm, with the alignment score specified by the inter-complex residue distances. The multiple chains in each complex are first joined, in every possible order, and then simultaneously aligned with cross-chain alignments prevented. The alignments of interface residues are enhanced by an interface-specific weighting factor. MM-align is tested on a large-scale benchmark set of 205 × 3897 non-homologous multiple-chain complex pairs. Compared with a naïve extension of the monomer alignment program of TM-align, the alignment accuracy of MM-align is significantly higher as judged by the average TM-score of the physically-aligned residues. MM-align is about two times faster than TM-align because of omitting the cross-alignment zone of the DP matrix. It also shows that the enhanced alignment of the interfaces helps in identifying biologically relevant protein complex pairs.

Keywords

HEURISTICS

This publication has 33 references indexed in Scilit:

Protein structure prediction: when is it useful?
Current Opinion in Structural Biology, 2009
MultiBind and MAPPIS: webservers for multiple alignment of protein 3D-binding sites and their interactions
Nucleic Acids Research, 2008
Alignment of Non-Covalent Interactions at Protein-Protein Interfaces
PLOS ONE, 2008
SABERTOOTH: protein structural alignment based on a vectorial structure representation
BMC Bioinformatics, 2007
The impact of translocations and gene fusions on cancer causation
Nature Reviews Cancer, 2007
SCOP: A structural classification of proteins database for the investigation of sequences and structures
Published by Elsevier ,2006
The Protein Data Bank
Nucleic Acids Research, 2000
The refined crystal structure of Drosophila lebanonensis alcohol dehydrogenase at 1.9 å resolution 1 1Edited by R. Huber
Journal of Molecular Biology, 1998
CATH – a hierarchic classification of protein domain structures
Published by Elsevier ,1997
Isolation, crystallization, crystal structure analysis and refinement of allophycocyanin from the cyanobacterium Spirulina platensis at 2.3 Å resolution
Journal of Molecular Biology, 1995