Tabix: fast retrieval of sequence features from generic TAB-delimited files
Top Cited Papers
Open Access
- 5 January 2011
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 27 (5) , 718-719
- https://doi.org/10.1093/bioinformatics/btq671
Abstract
Summary: Tabix is the first generic tool that indexes position sorted files in TAB-delimited formats such as GFF, BED, PSL, SAM and SQL export, and quickly retrieves features overlapping specified regions. Tabix features include few seek function calls per query, data compression with gzip compatibility and direct FTP/HTTP access. Tabix is implemented as a free command-line tool as well as a library in C, Java, Perl and Python. It is particularly useful for manually examining local genomic features on the command line and enables genome viewers to support huge data files and remote custom tracks over networks. Availability and Implementation: http://samtools.sourceforge.net. Contact: hengli@broadinstitute.orgKeywords
This publication has 7 references indexed in Scilit:
- BigWig and BigBed: enabling browsing of large distributed datasetsBioinformatics, 2010
- The UCSC Genome Browser database: update 2010Nucleic Acids Research, 2009
- The Sequence Alignment/Map format and SAMtoolsBioinformatics, 2009
- Accurate whole human genome sequencing using reversible terminator chemistryNature, 2008
- Nested Containment List (NCList): a new algorithm for accelerating interval query of genome alignment and interval databasesBioinformatics, 2007
- The Generic Genome Browser: A Building Block for a Model Organism System DatabaseGenome Research, 2002
- The Human Genome Browser at UCSCGenome Research, 2002