Gene Family Evolution by Duplication, Speciation, and Loss
- 1 October 2008
- journal article
- research article
- Published by Mary Ann Liebert Inc in Journal of Computational Biology
- Vol. 15 (8) , 1043-1062
- https://doi.org/10.1089/cmb.2008.0054
Abstract
We consider two algorithmic questions related to the evolution of gene families. First, given a gene tree for a gene family, can the evolutionary history of this family be explained with only speciation and duplication events? Such gene trees are called DS-trees. We show that this question can be answered in linear time, and that a DS-tree induces a single species tree. We then study a natural extension of this problem: what is the minimum number of gene losses involved in an evolutionary history leading to an observed gene tree or set of gene trees? Based on our characterization of DS-trees, we propose a heuristic for this problem, and evaluate it on a dataset of plants gene families and on simulated data.Keywords
This publication has 30 references indexed in Scilit:
- Comparing Genomes with Duplications: A Computational Complexity Point of ViewIEEE/ACM Transactions on Computational Biology and Bioinformatics, 2007
- Inferring a Duplication, Speciation and Loss History from a Gene Tree (Extended Abstract)Published by Springer Nature ,2007
- Optimal Gene Trees from Sequences and Species Trees Using a Soft Interpretation of ParsimonyJournal of Molecular Evolution, 2006
- The gain and loss of genes during 600 million years of vertebrate evolutionGenome Biology, 2006
- Eleven ancestral gene families lost in mammals and vertebrates while otherwise universally conserved in animalsBMC Ecology and Evolution, 2006
- Reconciling Gene Trees with Apparent PolytomiesPublished by Springer Nature ,2006
- Reconciling a gene tree to a species tree under the duplication cost modelTheoretical Computer Science, 2005
- Rates and patterns of gene duplication and loss in the human genomeProceedings Of The Royal Society B-Biological Sciences, 2005
- NOTUNG: A Program for Dating Gene Duplications and Optimizing Gene Family TreesJournal of Computational Biology, 2000
- The computational complexity of inferring rooted phylogenies by parsimonyMathematical Biosciences, 1986