Selecting Folded Proteins from a Library of Secondary Structural Elements
- 8 December 2007
- journal article
- research article
- Published by American Chemical Society (ACS) in Journal of the American Chemical Society
- Vol. 130 (1) , 176-185
- https://doi.org/10.1021/ja074405w
Abstract
A protein evolution strategy is described by which double-stranded DNA fragments encoding defined Escherichia coli protein secondary structural elements (α-helices, β-strands, and loops) are assembled semirandomly into sequences comprised of as many as 800 amino acid residues. A library of novel polypeptides generated from this system was inserted into an enhanced green fluorescent protein (EGFP) fusion vector. Library members were screened by fluorescence activated cell sorting (FACS) to identify those polypeptides that fold into soluble, stable structures in vivo that comprised a subset of shorter sequences (∼60 to 100 residues) from the semirandom sequence library. Approximately 108 clones were screened by FACS, a set of 1149 high fluorescence colonies were characterized by dPCR, and four soluble clones with varying amounts of secondary structure were identified. One of these is highly homologous to a domain of aspartate racemase from a marine bacterium (Polaromonas sp.) but is not homologous to any E. coli protein sequence. Several other selected polypeptides have no global sequence homology to any known protein but show significant α-helical content, limited dispersion in 1D nuclear magnetic resonance spectra, pH sensitive ANS binding and reversible folding into soluble structures. These results demonstrate that this strategy can generate novel polypeptide sequences containing secondary structure.Keywords
This publication has 59 references indexed in Scilit:
- Repeat-induced epigenetic changes in intron 1 of the frataxin gene and its consequences in Friedreich ataxiaNucleic Acids Research, 2007
- The CATH domain structure database: new protocols and classification levels give a more comprehensive resource for exploring evolutionNucleic Acids Research, 2007
- Physical Origins of Protein SuperfamiliesJournal of Molecular Biology, 2006
- Stably folded de novo proteins from a designed combinatorial libraryProtein Science, 2003
- De novo Backbone and Sequence Design of an Idealized α/β-barrel Protein: Evidence of Stable Tertiary StructureJournal of Molecular Biology, 2003
- The Protein Data BankNucleic Acids Research, 2000
- Protein folds, functions and evolutionJournal of Molecular Biology, 1999
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- Basic local alignment search toolJournal of Molecular Biology, 1990