Rapid motif-based prediction of circular permutations in multi-domain proteins
Open Access
- 1 April 2005
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 21 (7) , 932-937
- https://doi.org/10.1093/bioinformatics/bti085
Abstract
Motivation: Rearrangements of protein domains and motifs such as swaps and circular permutations (CPs) can produce erroneous results in searching sequence databases when using traditional methods based on linear sequence alignments. Circular permutations are also of biological relevance because they can help to better understand both protein evolution and functionality. Results: We have developed an algorithm, RASPODOM, which is based on the classical recursive alignment scheme. Sequences are represented as strings of domains taken from precompiled resources of domain (motif) databases such as ProDom. The algorithm works several orders of magnitude faster than a reimplementation of the existing CP detection algorithm working on strings of amino acids, produces virtually no false positives and allows the discrimination of true CPs from ‘intermediate’ CPs (iCPs). Several true CPs which have not been reported in literature so far could be identified from Swiss-Prot/TrEMBL within minutes. Availability: Source codes, additional scripts, data and a web-based interface can be found on: http://www.uni-muenster.de/Biologie.Botanik/ebb/projects/raspodom/ Contact:ebb@uni-muenster.deKeywords
This publication has 26 references indexed in Scilit:
- Identification of common molecular subsequencesPublished by Elsevier ,2004
- Structure, function and evolution of multidomain proteinsCurrent Opinion in Structural Biology, 2004
- More than the sum of their parts: On the evolution of proteins from peptidesBioEssays, 2003
- Circularly permuted proteins in the protein structure databaseProtein Science, 2001
- Random circular permutation of DsbA reveals segments that are essential for protein folding and stabilityJournal of Molecular Biology, 1999
- Modular arrangement of proteins as inferred from analysis of homologyProtein Science, 1994
- Basic Local Alignment Search ToolJournal of Molecular Biology, 1990
- Basic local alignment search toolJournal of Molecular Biology, 1990
- Circular and circularly permuted forms of bovine pancreatic trypsin inhibitorJournal of Molecular Biology, 1983
- A general method applicable to the search for similarities in the amino acid sequence of two proteinsJournal of Molecular Biology, 1970