Automated analysis of high-throughput B-cell sequencing data reveals a high frequency of novel immunoglobulin V gene segment alleles
Top Cited Papers
- 9 February 2015
- journal article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 112 (8) , E862-70
- https://doi.org/10.1073/pnas.1417683112
Abstract
Individual variation in germline and expressed B-cell immunoglobulin (Ig) repertoires has been associated with aging, disease susceptibility, and differential response to infection and vaccination. Repertoire properties can now be studied at large-scale through next-generation sequencing of rearranged Ig genes. Accurate analysis of these repertoire-sequencing (Rep-Seq) data requires identifying the germline variable (V), diversity (D), and joining (J) gene segments used by each Ig sequence. Current V(D)J assignment methods work by aligning sequences to a database of known germline V(D)J segment alleles. However, existing databases are likely to be incomplete and novel polymorphisms are hard to differentiate from the frequent occurrence of somatic hypermutations in Ig sequences. Here we develop a Tool for Ig Genotype Elucidation via Rep-Seq (TIgGER). TIgGER analyzes mutation patterns in Rep-Seq data to identify novel V segment alleles, and also constructs a personalized germline database containing the specific set of alleles carried by a subject. This information is then used to improve the initial V segment assignments from existing tools, like IMGT/HighV-QUEST. The application of TIgGER to Rep-Seq data from seven subjects identified 11 novel V segment alleles, including at least one in every subject examined. These novel alleles constituted 13% of the total number of unique alleles in these subjects, and impacted 3% of V(D)J segment assignments. These results reinforce the highly polymorphic nature of human Ig V genes, and suggest that many novel alleles remain to be discovered. The integration of TIgGER into Rep-Seq processing pipelines will increase the accuracy of V segment assignments, thus improving B-cell repertoire analyses.Keywords
Funding Information
- Division of Intramural Research, National Institute of Allergy and Infectious Diseases (R01AI104739)
- HHS | NIH | U.S. National Library of Medicine (T15LM07056)
This publication has 36 references indexed in Scilit:
- Human lymphocyte repertoires in ageingCurrent Opinion in Immunology, 2013
- IgBLAST: an immunoglobulin variable domain sequence analysis toolNucleic Acids Research, 2013
- Personalized, sequencing-based immune profiling spurs startupsNature Biotechnology, 2013
- Models of Somatic Hypermutation Targeting and Substitution Based on Synonymous Mutations from High-Throughput Immunoglobulin Sequencing DataFrontiers in Immunology, 2013
- Rep‐Seq: uncovering the immunological repertoire through next‐generation sequencingImmunology, 2012
- Detecting selection in immunoglobulin sequencesNucleic Acids Research, 2011
- Ig gene diversification and selection in follicular lymphoma, diffuse large B cell lymphoma and primary central nervous system lymphoma revealed by lineage tree and mutation analysesInternational Immunology, 2010
- SoDA2: a Hidden Markov Model approach for identification of immunoglobulin rearrangementsBioinformatics, 2010
- B‐cell diversity decreases in old age and is correlated with poor health statusAging Cell, 2009
- IMGT/V-QUEST: the highly customized and integrated system for IG and TR standardized V-J and V-D-J sequence analysisNucleic Acids Research, 2008