Genome rearrangement with gene families
Open Access
- 1 November 1999
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 15 (11) , 909-917
- https://doi.org/10.1093/bioinformatics/15.11.909
Abstract
Motivation: The theory and practice of genome rearrangement analysis breaks down in the biologically widespread contexts where each gene may be present in a number of copies, not necessarily contiguous. In some of these contexts it is, however, appropriate to ask which members of each gene family in two genomes G and H, lengths lG and lH, are its true exemplars , i.e. which best reflect the original position of the ancestral gene in the common ancestor genome. This entails a search for the two exemplar strings of same length n (= number of gene families, including singletons), having the smallest possible rearrangement distance: the exemplar distance . Results: A branch and bound algorithm calculates these distances efficiently when based on easily calculated traditional rearrangement distances, such as signed reversals distance or breakpoint distance, which also satisfy a property of monotonicity in the number of genes. Simulations show that in two random genomes, the expected exemplar distance/n is sensitive to the number and size of gene families, but approaches 1 as the number of singleton families increases. When the basic rearrangement distance is just the number of breakpoints, the expected cost of computing the exemplar breakpoints distance (EBD), as measured by total calls to the underlying breakpoint distance routine, is highly dependent on both n and the configuration of gene families. On the other hand, basing exemplar distance on exemplar reversals distance (ERD), the expected computing cost depends on the configuration of gene families but is not sensitive to n. Availability: Code for EBD and ERD is available from the author or may be accessed at http://www.crm.umontreal.ca/~viart/exemplar˙dis.html Contact: sankoff@ere.umontreal.caKeywords
This publication has 0 references indexed in Scilit: