Supersplat—spliced RNA-seq alignment
Open Access
- 21 April 2010
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 26 (12) , 1500-1505
- https://doi.org/10.1093/bioinformatics/btq206
Abstract
Motivation: High-throughput sequencing technologies have recently made deep interrogation of expressed transcript sequences practical, both economically and temporally. Identification of intron/exon boundaries is an essential part of genome annotation, yet remains a challenge. Here, we present supersplat, a method for unbiased splice-junction discovery through empirical RNA-seq data. Results: Using a genomic reference and RNA-seq high-throughput sequencing datasets, supersplat empirically identifies potential splice junctions at a rate of ∼11.4 million reads per hour. We further benchmark the performance of the algorithm by mapping Illumina RNA-seq reads to identify introns in the genome of the reference dicot plant Arabidopsis thaliana and we demonstrate the utility of supersplat for de novo empirical annotation of splice junctions using the reference monocot plant Brachypodium distachyon. Availability: Implemented in C++, supersplat source code and binaries are freely available on the web at http://mocklerlab-tools.cgrb.oregonstate.edu/ Contact:tmockler@cgrb.oregonstate.eduKeywords
This publication has 10 references indexed in Scilit:
- Genome sequencing and analysis of the model grass Brachypodium distachyonNature, 2010
- Genome-wide mapping of alternative splicing in Arabidopsis thalianaGenome Research, 2009
- Applications of Ultra-high-Throughput SequencingPublished by Springer Nature ,2009
- TopHat: discovering splice junctions with RNA-SeqBioinformatics, 2009
- RNA-Seq: a revolutionary tool for transcriptomicsNature Reviews Genetics, 2009
- Annotating genomes with massive-scale RNA sequencingGenome Biology, 2008
- Next-generation DNA sequencingNature Biotechnology, 2008
- A Global View of Gene Activity and Alternative Splicing by Deep Sequencing of the Human TranscriptomeScience, 2008
- Optimal spliced alignments of short sequence readsBioinformatics, 2008
- A Fast and Symmetric DUST Implementation to Mask Low-Complexity DNA SequencesJournal of Computational Biology, 2006