Identifying differentially expressed transcripts from RNA-seq data with biological variation
Open Access
- 3 May 2012
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 28 (13) , 1721-1728
- https://doi.org/10.1093/bioinformatics/bts260
Abstract
Motivation: High-throughput sequencing enables expression analysis at the level of individual transcripts. The analysis of transcriptome expression levels and differential expression (DE) estimation requires a probabilistic approach to properly account for ambiguity caused by shared exons and finite read sampling as well as the intrinsic biological variance of transcript expression. Results: We present Bayesian inference of transcripts from sequencing data (BitSeq), a Bayesian approach for estimation of transcript expression level from RNA-seq experiments. Inferred relative expression is represented by Markov chain Monte Carlo samples from the posterior probability distribution of a generative model of the read data. We propose a novel method for DE analysis across replicates which propagates uncertainty from the sample-level model while modelling biological variance using an expression-level-dependent prior. We demonstrate the advantages of our method using simulated data as well as an RNA-seq dataset with technical and biological replication for both studied conditions. Availability: The implementation of the transcriptome expression estimation and differential expression analysis, BitSeq, has been written in C++ and Python. The software is available online from http://code.google.com/p/bitseq/, version 0.4 was used for generating results presented in this article. Contact:glaus@cs.man.ac.uk, antti.honkela@hiit.fi or m.rattray@sheffield.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online.Keywords
All Related Versions
This publication has 32 references indexed in Scilit:
- The developmental transcriptome of Drosophila melanogasterNature, 2010
- Using non-uniform read distribution models to improve isoform expression inference in RNA-SeqBioinformatics, 2010
- Analysis and design of RNA sequencing experiments for identifying isoform regulationNature Methods, 2010
- Single base–resolution methylome of the silkworm reveals a sparse epigenomic mapNature Biotechnology, 2010
- Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiationNature Biotechnology, 2010
- RNA-Seq gene expression estimation with read mapping uncertaintyBioinformatics, 2009
- edgeR: a Bioconductor package for differential expression analysis of digital gene expression dataBioinformatics, 2009
- RNA-Seq: a revolutionary tool for transcriptomicsNature Reviews Genetics, 2009
- Mapping and quantifying mammalian transcriptomes by RNA-SeqNature Methods, 2008
- The MicroArray Quality Control (MAQC) project shows inter- and intraplatform reproducibility of gene expression measurementsNature Biotechnology, 2006