Domain deletions and substitutions in the modular protein evolution
Open Access
- 26 April 2006
- journal article
- research article
- Published by Wiley in The FEBS Journal
- Vol. 273 (9) , 2037-2047
- https://doi.org/10.1111/j.1742-4658.2006.05220.x
Abstract
The main mechanisms shaping the modular evolution of proteins are gene duplication, fusion and fission, recombination and loss of fragments. While a large body of research has focused on duplications and fusions, we concentrated, in this study, on how domains are lost. We investigated motif databases and introduced a measure of protein similarity that is based on domain arrangements. Proteins are represented as strings of domains and comparison was based on the classic dynamic alignment scheme. We found that domain losses and duplications were more frequent at the ends of proteins. We showed that losses can be explained by the introduction of start and stop codons which render the terminal domains nonfunctional, such that further shortening, until the whole domain is lost, is not evolutionarily selected against. We demonstrated that domains which also occur as single‐domain proteins are less likely to be lost at the N terminus and in the middle, than at the C terminus. We conclude that fission/fusion events with single‐domain proteins occur mostly at the C terminus. We found that domain substitutions are rare, in particular in the middle of proteins.We also showed that many cases of substitutions or losses result from erroneous annotations, but we were also able to find courses of evolutionary events where domains vanish over time. This is explained by a case study on the bacterial formate dehydrogenases.Keywords
This publication has 28 references indexed in Scilit:
- Evolution of Circular Permutations in Multidomain ProteinsMolecular Biology and Evolution, 2006
- The evolution of domain arrangements in proteins and interaction networksCellular and Molecular Life Sciences, 2005
- Structure, function and evolution of multidomain proteinsCurrent Opinion in Structural Biology, 2004
- Supra-domains: Evolutionary Units Larger than Single Protein DomainsJournal of Molecular Biology, 2004
- The geometry of domain combination in proteins 1 1Edited by J. ThorntonJournal of Molecular Biology, 2002
- Domain combinations in archaeal, eubacterial and eukaryotic proteomesJournal of Molecular Biology, 2001
- Crystal Structure of Formate Dehydrogenase H: Catalysis Involving Mo, Molybdopterin, Selenocysteine, and an Fe 4 S 4 ClusterScience, 1997
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- Evolutionarily Mobile Modules in ProteinsScientific American, 1993
- A general method applicable to the search for similarities in the amino acid sequence of two proteinsJournal of Molecular Biology, 1970