MapSplice: Accurate mapping of RNA-seq reads for splice junction discovery
Top Cited Papers
Open Access
- 28 August 2010
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 38 (18) , e178
- https://doi.org/10.1093/nar/gkq622
Abstract
The accurate mapping of reads that span splice junctions is a critical component of all analytic techniques that work with RNA-seq data. We introduce a second generation splice detection algorithm, MapSplice, whose focus is high sensitivity and specificity in the detection of splices as well as CPU and memory efficiency. MapSplice can be applied to both short (<75 bp) and long reads (≥75 bp). MapSplice is not dependent on splice site features or intron length, consequently it can detect novel canonical as well as non-canonical splices. MapSplice leverages the quality and diversity of read alignments of a given splice to increase accuracy. We demonstrate that MapSplice achieves higher sensitivity and specificity than TopHat and SpliceMap on a set of simulated RNA-seq data. Experimental studies also support the accuracy of the algorithm. Splice junctions derived from eight breast cancer RNA-seq datasets recapitulated the extensiveness of alternative splicing on a global level as well as the differences between molecular subtypes of breast cancer. These combined results indicate that MapSplice is a highly accurate algorithm for the alignment of RNA-seq reads to splice junctions. Software download URL: http://www.netlab.uky.edu/p/bioinfo/MapSplice.Keywords
This publication has 36 references indexed in Scilit:
- ASTD: The Alternative Splicing and Transcript Diversity databaseGenomics, 2009
- TopHat: discovering splice junctions with RNA-SeqBioinformatics, 2009
- Ultrafast and memory-efficient alignment of short DNA sequences to the human genomeGenome Biology, 2009
- Statistical inferences for isoform expression in RNA-SeqBioinformatics, 2009
- Alternative isoform regulation in human tissue transcriptomesNature, 2008
- Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencingNature Genetics, 2008
- Expression of 24,426 human alternative splicing events and predicted cis regulation in 48 tissues and cell linesNature Genetics, 2008
- Mapping short DNA sequencing reads and calling variants using mapping quality scoresGenome Research, 2008
- Optimal spliced alignments of short sequence readsBioinformatics, 2008
- Mapping and quantifying mammalian transcriptomes by RNA-SeqNature Methods, 2008