Maximum entropy models for antibody diversity
Top Cited Papers
- 8 March 2010
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 107 (12) , 5405-5410
- https://doi.org/10.1073/pnas.1001705107
Abstract
Recognition of pathogens relies on families of proteins showing great diversity. Here we construct maximum entropy models of the sequence repertoire, building on recent experiments that provide a nearly exhaustive sampling of the IgM sequences in zebrafish. These models are based solely on pairwise correlations between residue positions but correctly capture the higher order statistical properties of the repertoire. By exploiting the interpretation of these models as statistical physics problems, we make several predictions for the collective properties of the sequence ensemble: The distribution of sequences obeys Zipf's law, the repertoire decomposes into several clusters, and there is a massive restriction of diversity because of the correlations. These predictions are completely inconsistent with models in which amino acid substitutions are made independently at each site and are in good agreement with the data. Our results suggest that antibody diversity is not limited by the sequences encoded in the genome and may reflect rapid adaptation to antigenic challenges. This approach should be applicable to the study of the global properties of other protein families.Keywords
All Related Versions
This publication has 22 references indexed in Scilit:
- Protein Sectors: Evolutionary Units of Three-Dimensional StructurePublished by Elsevier ,2009
- Inferring species interactions in tropical forestsProceedings of the National Academy of Sciences, 2009
- Maximum-entropy network analysis reveals a role for tumor necrosis factor in peripheral nerve development and functionProceedings of the National Academy of Sciences, 2009
- High-Throughput Sequencing of the Zebrafish Antibody RepertoireScience, 2009
- Identification of direct residue contacts in protein–protein interaction by message passingProceedings of the National Academy of Sciences, 2009
- Weak pairwise correlations imply strongly correlated network states in a neural populationNature, 2006
- Power laws, Pareto distributions and Zipf's lawContemporary Physics, 2005
- Network Information and Connected CorrelationsPhysical Review Letters, 2003
- Reaction-rate theory: fifty years after KramersReviews of Modern Physics, 1990
- Information Theory and Statistical MechanicsPhysical Review B, 1957