Physical Mapping by STS Hybridization: Algorithmic Strategies and the Challenge of Software Evaluation
- 1 January 1995
- journal article
- Published by Mary Ann Liebert Inc in Journal of Computational Biology
- Vol. 2 (2) , 219-273
- https://doi.org/10.1089/cmb.1995.2.219
Abstract
An important tool in the analysis of genomic sequences is the physical map. In this paper we examine the construction of physical maps from hybridization data between sequence tag sites (STS) probes and clones of genomic fragments. An algorithmic theory of the mapping process, a proposed performance evaluation procedure, and several new algorithmic strategies for mapping are given. A unifying theme for these developments is the idea of a "conservative extension." An algorithm, measure of algorithm quality, or description of physical map is a conservative extension if it is a generalization for data with errors of a corresponding concept in the error-free case. In our algorithmic theory we show that the nature of hybridization experiments imposes inherent limitations on the mapping information recorded in the experimental data. We prove that only certain types of mapping information can be reliably calculated by any algorithm. A test generator is then presented along with quantitative measures for determining how much of the possible information is being computed by a given algorithm. Weaknesses and strengths of these measures are discussed. Each of the new algorithms presented in this paper is based on combinatorial optimizations. Despite the fact that all the optimizations are NP-complete, we have developed algorithmic tools for the design of competitive approximation algorithms. We apply our performance evaluation program to our algorithms and obtain solid evidence that the algorithms are capable of retrieving high-level reliable mapping information.Keywords
This publication has 18 references indexed in Scilit:
- An Algorithm to Detect Chimeric Clones and Random Noise in Genomic MappingGenomics, 1994
- A first-generation physical map of the human genomeNature, 1993
- The Human Y Chromosome: A 43-Interval Map Based on Naturally Occurring DeletionsScience, 1992
- The Human Y Chromosome: Overlapping DNA Clones Spanning the Euchromatic RegionScience, 1992
- Mapping the way aheadNature, 1992
- Continuum of overlapping clones spanning the entire human chromosome 21qNature, 1992
- Detection and characterization of chimeric yeast artificial-chromosome clonesGenomics, 1991
- Genomic mapping by fingerprinting random clones: A mathematical analysisGenomics, 1988
- Testing for the consecutive ones property, interval graphs, and graph planarity using PQ-tree algorithmsJournal of Computer and System Sciences, 1976
- Incidence matrices and interval graphsPacific Journal of Mathematics, 1965