An algorithm for assembly of ordered restriction maps from single DNA molecules
Open Access
- 24 October 2006
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 103 (43) , 15770-15775
- https://doi.org/10.1073/pnas.0604040103
Abstract
The restriction mapping of a massive number of individual DNA molecules by optical mapping enables assembly of physical maps spanning mammalian and plant genomes; however, not through computational means permitting completelyde novoassembly. Existing algorithms are not practical for genomes larger than lower eukaryotes due to their high time and space complexity. In many ways, sequence assembly parallels map assembly, so that the overlap–layout–consensus strategy, recently shown effective in assembling very large genomes in feasible time, sheds new light on solving map construction issues associated with single molecule substrates. Accordingly, we report an adaptation of this approach as the formal basis forde novooptical map assembly and demonstrate its computational feasibility for assembly of very large genomes. As such, we discuss assembly results for a series of genomes: human, plant, lower eukaryote and bacterial. Unlike sequence assembly, the optical map assembly problem is actually more complex because restriction maps from single molecules are constructed, manifesting errors stemming from: missing cuts, false cuts, and high variance of estimated fragment sizes; chimeric maps resulting from artifactually merged molecules; and true overlap scores that are “in the noise” or “slightly above the noise.” We address these problems, fundamental to many single molecule measurements, by an effective error correction method using global overlap information to eliminate spurious overlaps and chimeric maps that are otherwise difficult to identify.Keywords
This publication has 24 references indexed in Scilit:
- Alignment of Optical MapsJournal of Computational Biology, 2006
- Refinement of optical map assembliesBioinformatics, 2006
- Recurrent Fusion of TMPRSS2 and ETS Transcription Factor Genes in Prostate CancerScience, 2005
- Fine-scale structural variation of the human genomeNature Genetics, 2005
- Single-Molecule Approach to Bacterial Genomic Comparisons via Optical MappingJournal of Bacteriology, 2004
- Finishing the euchromatic sequence of the human genomeNature, 2004
- A Whole-Genome Assembly of DrosophilaScience, 2000
- Estimation for Restriction Sites Observed by Optical Mapping Using Reversible-Jump Markov Chain Monte CarloJournal of Computational Biology, 1998
- The Complete Genome Sequence of Escherichia coli K-12Science, 1997
- Ordered Restriction Maps of Saccharomyces cerevisiae Chromosomes Constructed by Optical MappingScience, 1993