Using PQ Structures for Genomic Rearrangement Phylogeny
- 1 December 2006
- journal article
- Published by Mary Ann Liebert Inc in Journal of Computational Biology
- Vol. 13 (10) , 1685-1700
- https://doi.org/10.1089/cmb.2006.13.1685
Abstract
Various international efforts are underway to catalog the genomic similarities and variations in the human population. Some key discoveries such as inversions and transpositions within the members of the species have also been made over the years. The task of constructing a phylogeny tree of the members of the same species, given this knowledge and data, is an important problem. In this context, a key observation is that the "distance" between two members, or member and ancestor, within the species is small. In this paper, we pose the tree reconstruction problem exploiting some of these peculiarities. The central idea of the paper is based on the notion of minimal consensus PQ tree T of sequences. We use a modified PQ structure (termed oPQ) and show that both the number and size of each T is O(1). We further show that the tree reconstruction problem is statistically well-defined (Theorem 7) and give a simple scheme to construct the phylogeny tree and the common ancestors. Our preliminary experiments with simulated data look very promising.Keywords
This publication has 21 references indexed in Scilit:
- The International HapMap ProjectNature, 2003
- Molecular Characterization of the Pericentric Inversion That Causes Differences Between Chimpanzee Chromosome 19 and Human Chromosome 17American Journal of Human Genetics, 2002
- Use of Comparative Physical and Sequence Mapping to Annotate Mouse Chromosome 16 and Human Chromosome 21Genomics, 2001
- Olfactory Receptor–Gene Clusters, Genomic-Inversion Polymorphisms, and Common Chromosome RearrangementsAmerican Journal of Human Genetics, 2001
- Multiple Genome Rearrangement and Breakpoint PhylogenyJournal of Computational Biology, 1998
- Paracentric inversions in humans: A review of 446 paracentric inversions with presentation of 120 new casesAmerican Journal of Medical Genetics, 1995
- Identification of novel genes, SYT and SSX, involved in the t(X;18)(p11.2;q11.2) translocation found in human synovial sarcomaNature Genetics, 1994
- Inversions disrupting the factor VIII gene are a common cause of severe haemophilia ANature Genetics, 1993
- Chromosomal translocation t(15;17) in human acute promyelocytic leukemia fuses RARα with a novel putative transcription factor, PMLCell, 1991
- Testing for the consecutive ones property, interval graphs, and graph planarity using PQ-tree algorithmsJournal of Computer and System Sciences, 1976