Reassessing Domain Architecture Evolution of Metazoan Proteins: The Contribution of Different Evolutionary Mechanisms
Open Access
- 4 August 2011
- Vol. 2 (3) , 578-598
- https://doi.org/10.3390/genes2030578
Abstract
In the accompanying papers we have shown that sequence errors of public databases and confusion of paralogs and epaktologs (proteins that are related only through the independent acquisition of the same domain types) significantly distort the picture that emerges from comparison of the domain architecture (DA) of multidomain Metazoan proteins since they introduce a strong bias in favor of terminal over internal DA change. The issue of whether terminal or internal DA changes occur with greater probability has very important implications for the DA evolution of multidomain proteins since gene fusion can add domains only at terminal positions, whereas domain-shuffling is capable of inserting domains both at internal and terminal positions. As a corollary, overestimation of terminal DA changes may be misinterpreted as evidence for a dominant role of gene fusion in DA evolution. In this manuscript we show that in several recent studies of DA evolution of Metazoa the authors used databases that are significantly contaminated with incomplete, abnormal and mispredicted sequences (e.g., UniProtKB/TrEMBL, EnsEMBL) and/or the authors failed to separate paralogs and epaktologs, explaining why these studies concluded that the major mechanism for gains of new domains in metazoan proteins is gene fusion. In contrast with the latter conclusion, our studies on high quality orthologous and paralogous Swiss-Prot sequences confirm that shuffling of mobile domains had a major role in the evolution of multidomain proteins of Metazoa and especially those formed in early vertebrates.Keywords
This publication has 31 references indexed in Scilit:
- Reassessing Domain Architecture Evolution of Metazoan Proteins: Major Impact of Errors Caused by Confusing Paralogs and EpaktologsGenes, 2011
- Quantifying the mechanisms of domain gain in animal proteinsGenome Biology, 2010
- Agrin Binds BMP2, BMP4 and TGFβ1PLOS ONE, 2010
- How do proteins gain new domains?Genome Biology, 2010
- Domain mobility in proteins: functional and evolutionary implicationsBriefings in Bioinformatics, 2008
- Sequence Similarity Network Reveals Common Ancestry of Multidomain ProteinsPLoS Computational Biology, 2008
- Evolution of protein domain promiscuity in eukaryotesGenome Research, 2008
- Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot projectNature, 2007
- Global Discriminative Learning for Higher-Accuracy Computational Gene PredictionPLoS Computational Biology, 2007
- Modeling the Evolution of Protein Domain Architectures Using Maximum ParsimonyJournal of Molecular Biology, 2006