Rapid motif-based prediction of circular permutations in multi-domain proteins

Open Access

1 April 2005

journal article
research article
Published by Oxford University Press (OUP) in Bioinformatics

Vol. 21 (7) , 932-937
https://doi.org/10.1093/bioinformatics/bti085

Abstract

Motivation: Rearrangements of protein domains and motifs such as swaps and circular permutations (CPs) can produce erroneous results in searching sequence databases when using traditional methods based on linear sequence alignments. Circular permutations are also of biological relevance because they can help to better understand both protein evolution and functionality. Results: We have developed an algorithm, RASPODOM, which is based on the classical recursive alignment scheme. Sequences are represented as strings of domains taken from precompiled resources of domain (motif) databases such as ProDom. The algorithm works several orders of magnitude faster than a reimplementation of the existing CP detection algorithm working on strings of amino acids, produces virtually no false positives and allows the discrimination of true CPs from ‘intermediate’ CPs (iCPs). Several true CPs which have not been reported in literature so far could be identified from Swiss-Prot/TrEMBL within minutes. Availability: Source codes, additional scripts, data and a web-based interface can be found on: http://www.uni-muenster.de/Biologie.Botanik/ebb/projects/raspodom/ Contact:ebb@uni-muenster.de

Keywords

This publication has 26 references indexed in Scilit:

Identification of common molecular subsequences
Published by Elsevier ,2004
Structure, function and evolution of multidomain proteins
Current Opinion in Structural Biology, 2004
More than the sum of their parts: On the evolution of proteins from peptides
BioEssays, 2003
Circularly permuted proteins in the protein structure database
Protein Science, 2001
Random circular permutation of DsbA reveals segments that are essential for protein folding and stability
Journal of Molecular Biology, 1999
Modular arrangement of proteins as inferred from analysis of homology
Protein Science, 1994
Basic Local Alignment Search Tool
Journal of Molecular Biology, 1990
Basic local alignment search tool
Journal of Molecular Biology, 1990
Circular and circularly permuted forms of bovine pancreatic trypsin inhibitor
Journal of Molecular Biology, 1983
A general method applicable to the search for similarities in the amino acid sequence of two proteins
Journal of Molecular Biology, 1970