Precise determination of the diversity of a combinatorial antibody library gives insight into the human immunoglobulin repertoire
Top Cited Papers
- 1 December 2009
- journal article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 106 (48) , 20216-20221
- https://doi.org/10.1073/pnas.0909775106
Abstract
Antibody repertoire diversity, potentially as high as 1011unique molecules in a single individual, confounds characterization by conventional sequence analyses. In this study, we present a general method for assessing human antibody sequence diversity displayed on phage using massively parallel pyrosequencing, a novel application of Kabat column-labeled profile Hidden Markov Models, and translated complementarity determining region (CDR) capture-recapture analysis. Pyrosequencing of domain amplicon and RCA PCR products generated 1.5 × 106reads, including more than 1.9 × 105high quality, full-length sequences of antibody variable fragment (Fv) variable domains. Novel methods for germline and CDR classification and fine characterization of sequence diversity in the 6 CDRs are presented. Diverse germline contributions to the repertoire with random heavy and light chain pairing are observed. All germline families were found to be represented in 1.7 × 104sequences obtained from repeated panning of the library. While the most variable CDR (CDR-H3) presents significant length and sequence variability, we find a substantial contribution to total diversity from somatically mutated germline encoded CDRs 1 and 2. Using a capture-recapture method, the total diversity of the antibody library obtained from a human donor Immunoglobulin M (IgM) pool was determined to be at least 3.5 × 1010. The results provide insights into the role of IgM diversification, display library construction, and productive germline usages in antibody libraries and the humoral repertoire.Keywords
This publication has 37 references indexed in Scilit:
- Profiling the T-cell receptor beta-chain repertoire by massively parallel sequencingGenome Research, 2009
- High-Throughput Sequencing of the Zebrafish Antibody RepertoireScience, 2009
- IMGT(R), the international ImMunoGeneTics information system(R)Nucleic Acids Research, 2008
- Analysis and improvements to Kabat and structurally correct numbering of antibody variable domainsMolecular Immunology, 2008
- Somatic diversification in the absence of antigen-driven responses is the hallmark of the IgM+IgD+CD27+ B cell repertoire in infantsThe Journal of Experimental Medicine, 2008
- Use of simulated data sets to evaluate the fidelity of metagenomic processing methodsNature Methods, 2007
- Sequence comparisons using multiple sequences detect three times as many remote homologues as pairwise methodsJournal of Molecular Biology, 1998
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Human Antibodies with Sub-nanomolar Affinities Isolated from a Large Non-immunized Phage Display LibraryNature Biotechnology, 1996
- Maximum Discrimination Hidden Markov Models of Sequence ConsensusJournal of Computational Biology, 1995