VDJtools: Unifying Post-analysis of T Cell Receptor Repertoires
Top Cited Papers
Open Access
- 25 November 2015
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLoS Computational Biology
- Vol. 11 (11) , e1004503
- https://doi.org/10.1371/journal.pcbi.1004503
Abstract
Despite the growing number of immune repertoire sequencing studies, the field still lacks software for analysis and comprehension of this high-dimensional data. Here we report VDJtools, a complementary software suite that solves a wide range of T cell receptor (TCR) repertoires post-analysis tasks, provides a detailed tabular output and publication-ready graphics, and is built on top of a flexible API. Using TCR datasets for a large cohort of unrelated healthy donors, twins, and multiple sclerosis patients we demonstrate that VDJtools greatly facilitates the analysis and leads to sound biological conclusions. VDJtools software and documentation are available at https://github.com/mikessh/vdjtools. High-throughput profiling of T- and B-cell antigen receptor repertoires promises great advances in our understanding of the mechanisms underlying adaptive immune system function, treatment of autoimmune and infectious diseases, and development of novel approaches in cancer immunotherapy. A number of recently developed software tools aim at processing immune repertoire data by mapping Variable (V), Diversity (D) and Joining (J) antigen receptor segments to sequencing reads and assembling T- and B-cell clonotypes. Nevertheless, there still exists a major gap in common methods of data post-analysis in the field: there is no standardized data format so far, and most of data comparative analysis is carried out using a variety of in-house scripts. Here we present VDJtools, a software framework that can analyze output of most commonly used TCR repertoire processing tools and allows applying a diverse set of post-analysis strategies. The main aims of our framework are: To ensure consistency of post-analysis methods and reproducibility of obtained results; to save the time of bioinformaticians analyzing TCR repertoire data by providing comprehensive tabular output and open-source API; and to provide a simple enough command line tool so that immunologists and biologists with little computational background could use it to generate publication-ready results.Keywords
This publication has 61 references indexed in Scilit:
- Biased usage and preferred pairing of α- and β-chains of TCRs specific for an immunodominant gluten epitope in coeliac diseaseInternational Immunology, 2013
- MiTCR: software for T-cell receptor sequencing data analysisNature Methods, 2013
- IgBLAST: an immunoglobulin variable domain sequence analysis toolNucleic Acids Research, 2013
- Deep-sequencing approach for minimal residual disease detection in acute lymphoblastic leukemiaBlood, 2012
- Bioinformatic approaches for functional annotation and pathway inference in metagenomics dataBriefings in Bioinformatics, 2012
- Detection and quantification of rare mutations with massively parallel sequencingProceedings of the National Academy of Sciences, 2011
- Quantitative tracking of T cell clones after haematopoietic stem cell transplantationEMBO Molecular Medicine, 2011
- Overlap and Effective Size of the Human CD8 + T Cell Receptor RepertoireScience Translational Medicine, 2010
- Comprehensive assessment of T-cell receptor β-chain diversity in αβ T cellsBlood, 2009
- Quantifying biodiversity: procedures and pitfalls in the measurement and comparison of species richnessEcology Letters, 2001