An Eulerian path approach to DNA fragment assembly
Top Cited Papers
- 14 August 2001
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 98 (17) , 9748-9753
- https://doi.org/10.1073/pnas.171285098
Abstract
For the last 20 years, fragment assembly in DNA sequencing followed the “overlap–layout–consensus” paradigm that is used in all currently available assembly tools. Although this approach proved useful in assembling clones, it faces difficulties in genomic shotgun assembly. We abandon the classical “overlap–layout–consensus” approach in favor of a new euler algorithm that, for the first time, resolves the 20-year-old “repeat problem” in fragment assembly. Our main result is the reduction of the fragment assembly to a variation of the classical Eulerian path problem that allows one to generate accurate solutions of large-scale sequencing problems. euler , in contrast to the celera assembler, does not mask such repeats but uses them instead as a powerful fragment assembly tool.Keywords
This publication has 22 references indexed in Scilit:
- The Sequence of the Human GenomeScience, 2001
- Initial sequencing and analysis of the human genomeNature, 2001
- Complete DNA sequence of a serogroup A strain of Neisseria meningitidis Z2491Nature, 2000
- A Whole-Genome Assembly of DrosophilaScience, 2000
- Whole-Genome Random Sequencing and Assembly of Haemophilus influenzae RdScience, 1995
- Combinatorial algorithms for DNA sequence assemblyAlgorithmica, 1995
- A New Algorithm for DNA Sequence AssemblyJournal of Computational Biology, 1995
- Toward Simplifying and Accurately Formulating Fragment AssemblyJournal of Computational Biology, 1995
- TIGR Assembler: A New Tool for Assembling Large Shotgun Sequencing ProjectsGenome Science and Technology, 1995
- The accuracy of DNA sequences: Estimating sequence qualityGenomics, 1992