Computation for ChIP-seq and RNA-seq studies
Top Cited Papers
- 15 October 2009
- journal article
- review article
- Published by Springer Nature in Nature Methods
- Vol. 6 (S11) , S22-S32
- https://doi.org/10.1038/nmeth.1371
Abstract
Genome-wide measurements of protein-DNA interactions and transcriptomes are increasingly done by deep DNA sequencing methods (ChIP-seq and RNA-seq). The power and richness of these counting-based measurements comes at the cost of routinely handling tens to hundreds of millions of reads. Whereas early adopters necessarily developed their own custom computer code to analyze the first ChIP-seq and RNA-seq datasets, a new generation of more sophisticated algorithms and software tools are emerging to assist in the analysis phase of these projects. Here we describe the multilayered analyses of ChIP-seq and RNA-seq datasets, discuss the software packages currently available to perform tasks at each layer and describe some upcoming challenges and features for future analysis tools. We also discuss how software choices and uses are affected by specific aspects of the underlying biology and data structure, including genome size, positional clustering of transcription factor binding sites, transcript discovery and expression quantification.Keywords
This publication has 47 references indexed in Scilit:
- De novo transcriptome assembly with ABySSBioinformatics, 2009
- SOAP: short oligonucleotide alignment programBioinformatics, 2008
- Sequence census methods for functional genomicsNature Methods, 2007
- Genome-wide maps of chromatin state in pluripotent and lineage-committed cellsNature, 2007
- Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot projectNature, 2007
- Genome-wide profiles of STAT1 DNA association using chromatin immunoprecipitation and massively parallel sequencingNature Methods, 2007
- Genome-Wide Mapping of in Vivo Protein-DNA InteractionsScience, 2007
- High-Resolution Profiling of Histone Methylations in the Human GenomePublished by Elsevier ,2007
- Chromosome Conformation Capture Carbon Copy (5C): A massively parallel solution for mapping interactions between genomic elementsGenome Research, 2006
- Variance stabilization applied to microarray data calibration and to the quantification of differential expressionBioinformatics, 2002