Algorithms for Computing and Integrating Physical Maps Using Unique Probes

1 January 1997

journal article
research article
Published by Mary Ann Liebert Inc in Journal of Computational Biology

Vol. 4 (4) , 449-466
https://doi.org/10.1089/cmb.1997.4.449

Abstract

Current physical mapping projects based on STS-probes involve additional clues such as the fact that some probes are anchored to a known map and that others come from the ends of clones. Because of the disparate combinatorial contributions of these varied data items, it is difficult to design a "tailored" algorithm that incorporates them all. Moreover, it is inevitable that new experiments will provide new kinds of data, making obsolete any such algorithm. We show how to convert the physical mapping problem into a 0/1 linear programming (LP) problem. We further show how one can incorporate additional clues as additional constraints in the LP formulation. We give a simple relaxation of the 0/1 LP problem, which solves problems of the same scale as previously reported tailored algorithms, to equal or greater optimization levels. We also present a theorem proving that when the data is 100% accurate, then the relaxed and integer solutions coincide. The LP algorithm suffices to solve problems on the order of 80–100 probes—the typical size of the 2- or 3-connected contigs of Arratia et al. (1991). We give a heuristic algorithm which attempts to order and link the set of LP-solved contigs. Unlike previous work, this algorithm only links and orders contigs when the join is 90% or more likely to be correct. It is our view that there is no value in computing an optimal solution with respect to some criteria over very noisy data as this optimal solution rarely corresponds to the true solution. The paper involves extensive empirical trials over real and simulated data.

Keywords

This publication has 10 references indexed in Scilit:

Combinatorial optimization
ACM Computing Surveys, 1996
Practical problem solving with cutting plane algorithms in combinatorial optimization
Published by American Mathematical Society (AMS) ,1995
Computing near-optimal solutions to combinatorial optimization problems
Published by American Mathematical Society (AMS) ,1995
A Note on Scoring Clones Given a Probe Ordering
Journal of Computational Biology, 1995
On the Complexity of DNA Physical Mapping
Advances in Applied Mathematics, 1994
Genomic mapping by anchoring random clones: A mathematical analysis
Genomics, 1991
Randomized rounding: A technique for provably good algorithms and algorithmic proofs
Combinatorica, 1987
Total Ordering Problem
SIAM Journal on Computing, 1979
Polynomial Complete Consecutive Information Retrieval Problems
SIAM Journal on Computing, 1977
Testing for the consecutive ones property, interval graphs, and graph planarity using PQ-tree algorithms
Journal of Computer and System Sciences, 1976