The GNUMAP algorithm: unbiased probabilistic mapping of oligonucleotides from next-generation sequencing
Open Access
- 27 October 2009
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 26 (1) , 38-45
- https://doi.org/10.1093/bioinformatics/btp614
Abstract
Motivation: The advent of next-generation sequencing technologies has increased the accuracy and quantity of sequence data, opening the door to greater opportunities in genomic research. Results: In this article, we present GNUMAP (Genomic Next-generation Universal MAPper), a program capable of overcoming two major obstacles in the mapping of reads from next-generation sequencing runs. First, we have created an algorithm that probabilistically maps reads to repeat regions in the genome on a quantitative basis. Second, we have developed a probabilistic Needleman–Wunsch algorithm which utilizes _prb.txt and _int.txt files produced in the Solexa/Illumina pipeline to improve the mapping accuracy for lower quality reads and increase the amount of usable data produced in a given experiment. Availability: The source code for the software can be downloaded from http://dna.cs.byu.edu/gnumap. Contact:nathanlclement@gmail.com Supplementary information: Supplementary data are available at Bioinformatics online.Keywords
This publication has 17 references indexed in Scilit:
- Evaluation of next generation sequencing platforms for population targeted sequencing studiesGenome Biology, 2009
- Slider—maximum use of probability information for alignment of short sequence reads and SNP detectionBioinformatics, 2008
- Mapping short DNA sequencing reads and calling variants using mapping quality scoresGenome Research, 2008
- ALLPATHS: De novo assembly of whole-genome shotgun microreadsGenome Research, 2008
- Application of massively parallel sequencing to microRNA profiling and discovery in human embryonic stem cellsGenome Research, 2008
- Parallel genomic evolution and metabolic interdependence in an ancient symbiosisProceedings of the National Academy of Sciences, 2007
- Genome-Wide Mapping of in Vivo Protein-DNA InteractionsScience, 2007
- High-Resolution Profiling of Histone Methylations in the Human GenomePublished by Elsevier ,2007
- Metrics for comparing regulatory sequences on the basis of pattern countsBioinformatics, 2004
- Comparing expression profiles of genes with similar promoter regionsBioinformatics, 2002