Genome Rearrangements and Sorting by Reversals
- 1 April 1996
- journal article
- Published by Society for Industrial & Applied Mathematics (SIAM) in SIAM Journal on Computing
- Vol. 25 (2) , 272-289
- https://doi.org/10.1137/s0097539793250627
Abstract
Sequence comparison in molecular biology is in the beginning of a major paradigm shift---a shift from gene comparison based on local mutations (i.e., insertions, deletions, and substitutions of nucleotides) to chromosome comparison based on global rearrangements (i.e., inversions and transpositions of fragments). The classical methods of sequence comparison do not work for global rearrangements, and little is known in computer science about the edit distance between sequences if global rearrangements are allowed. In the simplest form, the problem of gene rearrangements corresponds to sorting by reversals, i.e., sorting of an array using reversals of arbitrary fragments. Recently, Kececioglu and Sankoff gave the first approximation algorithm for sorting by reversals with guaranteed error bound 2 and identified open problems related to chromosome rearrangements. One of these problems is Gollan's conjecture on the reversal diameter of the symmetric group. This paper proves the conjecture. Further, the problem of expected reversal distance between two random permutations is investigated. The reversal distance between two random permutations is shown to be very close to the reversal diameter, thereby indicating that reversal distance provides a good separation between related and nonrelated sequences in molecular evolution studies. The gene rearrangement problem forces us to consider reversals of signed permutations, as the genes in DNA could be positively or negatively oriented. An approximation algorithm for signed permutation is presented, which provides a performance guarantee of ${3 \over 2}$. Finally, using the signed permutations approach, an approximation algorithm for sorting by reversals is described which achieves a performance guarantee of ${7 \over 4}$.
Keywords
This publication has 22 references indexed in Scilit:
- Exact and approximation algorithms for the inversion distance between two chromosomesPublished by Springer Nature ,2005
- DNA physical mapping and alternating Eulerian cycles in colored graphsAlgorithmica, 1995
- Analytical approaches to genomic evolutionBiochimie, 1993
- Evolution and Taxonomy of Positive-Strand RNA Viruses: Implications of Comparative Analysis of Amino Acid SequencesCritical Reviews in Biochemistry and Molecular Biology, 1993
- Chloroplast DNA Evidence on the Ancient Evolutionary Split in Vascular Land PlantsScience, 1992
- Reversing trains: A turn of the century sorting problemJournal of Algorithms, 1989
- Nucleotide sequence of regions homologous tonifH (nitrogenase Fe protein) from the nitrogen-fixing archaebacteriaMethanococcus thermolithotrophicus andMethanobacterium ivanovii: Evolutionary implicationsJournal of Molecular Evolution, 1988
- The chromosome inversion problemJournal of Theoretical Biology, 1982
- The minimum-length generator sequence problem is NP-hardJournal of Algorithms, 1981
- Bounds for sorting by prefix reversalDiscrete Mathematics, 1979