Efficient reconstruction of phylogenetic networks with constrained recombination
- 1 January 2003
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 111, 363-374
- https://doi.org/10.1109/csb.2003.1227337
Abstract
A phylogenetic network is a generalization of a phylogenetic tree, allowing structural properties that are not treelike. With the growth of genomic data, much of which does not fit ideal tree models, there is greater need to understand the algorithmics and combinatorics of phylogenetic networks. We consider the problem of determining whether the sequences can be derived on a phylogenetic network where the recombination cycles are node disjoint. In this paper, we call such a phylogenetic network a "galled-tree". By more deeply analysing the combinatorial constraints on cycle-disjoint phylogenetic networks, we obtain an efficient algorithm that is guaranteed to be both a necessary and sufficient test for the existence of a galled-tree for the data. If there is a galled-tree, the algorithm constructs one and obtains an implicit representation of all the galled trees for the data, and can create these in linear time for each one. We also note two additional results related to galled trees: first, any set of sequences that can be derived on a galled tree can be derived on a true tree (without recombination cycles), where at most one back mutation is allowed per site; second, the site compatibility problem (which is NP-hard in general) can be solved in linear time for any set of sequences that can be derived on a galled tree. The combinatorial constraints we develop apply (for the most part) to node-disjoint cycles in any phylogenetic network (not just galled-trees), and can be used for example to prove that a given site cannot be on a node-disjoint cycle in any phylogenetic network. Perhaps more important than the specific results about galled-trees, we introduce an approach that can be used to study recombination in phylogenetic networks that go beyond galled-trees.Keywords
This publication has 12 references indexed in Scilit:
- Perfect Phylogenetic Networks with RecombinationJournal of Computational Biology, 2001
- Heterogeneous geographic patterns of nucleotide sequence diversity between two alcohol dehydrogenase genes in wild barley ( Hordeum vulgare subspecies spontaneum )Proceedings of the National Academy of Sciences, 2001
- Intraspecific gene genealogies: trees grafting into networksTrends in Ecology & Evolution, 2001
- Reconstructing a history of recombinations from a set of sequencesDiscrete Applied Mathematics, 1998
- Algorithms on Strings, Trees and SequencesPublished by Cambridge University Press (CUP) ,1997
- A heuristic method to reconstruct the history of sequences subject to recombinationJournal of Molecular Evolution, 1993
- Efficient algorithms for inferring evolutionary treesNetworks, 1991
- Reconstructing evolution of sequences subject to recombination using parsimonyMathematical Biosciences, 1990
- Computational Complexity of Inferring Phylogenies by CompatibilitySystematic Zoology, 1986
- STATISTICAL PROPERTIES OF THE NUMBER OF RECOMBINATION EVENTS IN THE HISTORY OF A SAMPLE OF DNA SEQUENCESGenetics, 1985