Manipulation of FASTQ data with Galaxy
Top Cited Papers
Open Access
- 18 June 2010
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 26 (14) , 1783-1785
- https://doi.org/10.1093/bioinformatics/btq281
Abstract
Summary: Here, we describe a tool suite that functions on all of the commonly known FASTQ format variants and provides a pipeline for manipulating next generation sequencing data taken from a sequencing machine all the way through the quality filtering steps. Availability and Implementation: This open-source toolset was implemented in Python and has been integrated into the online data analysis platform Galaxy (public web access: http://usegalaxy.org; download: http://getgalaxy.org). Two short movies that highlight the functionality of tools described in this manuscript as well as results from testing components of this tool suite against a set of previously published files are available at http://usegalaxy.org/u/dan/p/fastq Contact:james.taylor@emory.edu; anton@bx.psu.edu Supplementary information:Supplementary data are available at Bioinformatics online.This publication has 4 references indexed in Scilit:
- Galaxy: A Web‐Based Genome Analysis Tool for ExperimentalistsCurrent Protocols in Molecular Biology, 2010
- The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variantsNucleic Acids Research, 2009
- Using Galaxy to Perform Large‐Scale Interactive Data AnalysesCurrent Protocols in Bioinformatics, 2007
- A framework for collaborative analysis of ENCODE data: Making large-scale analyses biologist-friendlyGenome Research, 2007