New algorithms for the duplication-loss model
- 8 April 2000
- proceedings article
- Published by Association for Computing Machinery (ACM)
- p. 138-146
- https://doi.org/10.1145/332306.332359
Abstract
We consider the problem of constructing a species tree given a number of gene trees. In the frameworks introduced by Goodman et al. [3], Page [10], and Guigó, Muchnik, and Smith [5] this is formulated as an optimization problem; namely, that of finding the species tree requiring the minimum number of duplications and/ or losses in order to explain the gene trees.In this paper, we introduce the WIDTH k DUPLICATION-LOSS and WIDTH k DUPLICATION problems. A gene tree has width k w.r.t. a species tree, if the species tree can be reconciled with the gene tree using at most k simultaneously active copies of the gene along its branches. We explain w.r.t. to the underlying biological model, why this width is typically very small in comparison to the total number of duplications and losses. We show polynomial time algorithms for finding optimal species trees having bounded width w.r.t. at least one of the input gene trees. Furthermore, we present the first algorithm for input gene trees that are unrooted. Lastly, we apply our algorithms to a dataset from [5] and show a species tree requiring significantly fewer duplications and fewer duplications/losses than the trees given in the original paper.Keywords
This publication has 7 references indexed in Scilit:
- Predicting function: from genes to genomes and backJournal of Molecular Biology, 1998
- GeneTree: comparing gene and species phylogenies using reconciled trees.Bioinformatics, 1998
- On reconstructing species trees from gene trees in term of duplications and lossesPublished by Association for Computing Machinery (ACM) ,1998
- A Genomic Perspective on Protein FamiliesScience, 1997
- From Gene to Organismal Phylogeny: Reconciled Trees and the Gene Tree/Species Tree ProblemMolecular Phylogenetics and Evolution, 1997
- A Biologically Consistent Model for Comparing Molecular PhylogeniesJournal of Computational Biology, 1995
- Fitting the Gene Lineage into its Species Lineage, a Parsimony Strategy Illustrated by Cladograms Constructed from Globin SequencesSystematic Zoology, 1979