Algorithms for Computing and Integrating Physical Maps Using Unique Probes
- 1 January 1997
- journal article
- research article
- Published by Mary Ann Liebert Inc in Journal of Computational Biology
- Vol. 4 (4) , 449-466
- https://doi.org/10.1089/cmb.1997.4.449
Abstract
Current physical mapping projects based on STS-probes involve additional clues such as the fact that some probes are anchored to a known map and that others come from the ends of clones. Because of the disparate combinatorial contributions of these varied data items, it is difficult to design a "tailored" algorithm that incorporates them all. Moreover, it is inevitable that new experiments will provide new kinds of data, making obsolete any such algorithm. We show how to convert the physical mapping problem into a 0/1 linear programming (LP) problem. We further show how one can incorporate additional clues as additional constraints in the LP formulation. We give a simple relaxation of the 0/1 LP problem, which solves problems of the same scale as previously reported tailored algorithms, to equal or greater optimization levels. We also present a theorem proving that when the data is 100% accurate, then the relaxed and integer solutions coincide. The LP algorithm suffices to solve problems on the order of 80–100 probes—the typical size of the 2- or 3-connected contigs of Arratia et al. (1991). We give a heuristic algorithm which attempts to order and link the set of LP-solved contigs. Unlike previous work, this algorithm only links and orders contigs when the join is 90% or more likely to be correct. It is our view that there is no value in computing an optimal solution with respect to some criteria over very noisy data as this optimal solution rarely corresponds to the true solution. The paper involves extensive empirical trials over real and simulated data.Keywords
This publication has 10 references indexed in Scilit:
- Combinatorial optimizationACM Computing Surveys, 1996
- Practical problem solving with cutting plane algorithms in combinatorial optimizationPublished by American Mathematical Society (AMS) ,1995
- Computing near-optimal solutions to combinatorial optimization problemsPublished by American Mathematical Society (AMS) ,1995
- A Note on Scoring Clones Given a Probe OrderingJournal of Computational Biology, 1995
- On the Complexity of DNA Physical MappingAdvances in Applied Mathematics, 1994
- Genomic mapping by anchoring random clones: A mathematical analysisGenomics, 1991
- Randomized rounding: A technique for provably good algorithms and algorithmic proofsCombinatorica, 1987
- Total Ordering ProblemSIAM Journal on Computing, 1979
- Polynomial Complete Consecutive Information Retrieval ProblemsSIAM Journal on Computing, 1977
- Testing for the consecutive ones property, interval graphs, and graph planarity using PQ-tree algorithmsJournal of Computer and System Sciences, 1976