High-resolution mapping of protein sequence-function relationships
Top Cited Papers
Open Access
- 15 August 2010
- journal article
- research article
- Published by Springer Nature in Nature Methods
- Vol. 7 (9) , 741-746
- https://doi.org/10.1038/nmeth.1492
Abstract
The combination of protein display, moderate selection for protein activity and high-throughput DNA sequencing can be applied to hundreds of thousands of protein variants in parallel, enabling the derivation of sequence-function relationships. We present a large-scale approach to investigate the functional consequences of sequence variation in a protein. The approach entails the display of hundreds of thousands of protein variants, moderate selection for activity and high-throughput DNA sequencing to quantify the performance of each variant. Using this strategy, we tracked the performance of >600,000 variants of a human WW domain after three and six rounds of selection by phage display for binding to its peptide ligand. Binding properties of these variants defined a high-resolution map of mutational preference across the WW domain; each position had unique features that could not be captured by a few representative mutations. Our approach could be applied to many in vitro or in vivo protein assays, providing a general means for understanding how protein function relates to sequence.Keywords
This publication has 32 references indexed in Scilit:
- Rapid interactome profiling by massive sequencingNucleic Acids Research, 2010
- Next-Generation Phage Display: Integrating and Comparing Available Molecular Tools to Enable Cost-Effective High-Throughput AnalysisPLOS ONE, 2009
- A large genome center's improvements to the Illumina sequencing systemNature Methods, 2008
- Robustness–epistasis link shapes the fitness landscape of a randomly drifting proteinNature, 2006
- Darwinian Evolution Can Follow Only Very Few Mutational Paths to Fitter ProteinsScience, 2006
- Survey of allelic expression using EST miningGenome Research, 2005
- Probing WW Domains to Uncover and Refine Determinants of Specificity in Ligand RecognitionCytotechnology, 2003
- Statistical significance for genomewide studiesProceedings of the National Academy of Sciences, 2003
- Solution structures of the YAP65 WW domain and the variant L30 K in complex with the peptides GTPPPPYTVG, N-(n-octyl)-GPPPY and PLPPY and the application of peptide libraries reveal a minimal binding epitopeJournal of Molecular Biology, 2001
- The folding mechanism of a β-sheet: the WW domainJournal of Molecular Biology, 2001