Multiple aromatic side chains within a disordered structure are critical for transcription and transforming activity of EWS family oncoproteins
- 9 January 2007
- journal article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 104 (2) , 479-484
- https://doi.org/10.1073/pnas.0607007104
Abstract
Chromosomal translocations involving the N-terminal ≈250 residues of the Ewings sarcoma (EWS) oncogene produce a group of EWS fusion proteins (EFPs) that cause several distinct human cancers. EFPs are potent transcriptional activators and interact with other proteins required for mRNA biogenesis, indicating that EFPs induce tumorigenesis by perturbing gene expression. Although EFPs were discovered more than a decade ago, molecular analysis has been greatly hindered by the repetitive EWS activation domain (EAD) structure, containing multiple degenerate hexapeptide repeats (consensus SYGQQS) with a conserved tyrosine residue. By exploiting total gene synthesis, we have been able to systematically mutagenize the EAD and determine the effect on transcriptional activation by EWS/ATF1 and cellular transformation by EWS/Fli1. In both assays, we find the following requirements for EAD function. First, multiple tyrosine residues are essential. Second, phenylalanine can effectively substitute for tyrosine, showing that an aromatic ring can confer EAD function in the absence of tyrosine phosphorylation. Third, there is little requirement for specific peptide sequences and, thus, overall sequence composition (and not the degenerate hexapeptide repeat) confers EAD activity. Consistent with the above findings, we also report that the EAD is intrinsically disordered. However, a sensitive computational predictor of natural protein disorder (PONDR VL3) identifies potential molecular recognition features that are tyrosine-dependent and that correlate well with EAD function. In summary we have uncovered several molecular features of the EAD that will impact future studies of the broader EFP family and molecular recognition by complex intrinsically disordered proteins.Keywords
This publication has 50 references indexed in Scilit:
- Intrinsic Disorder in Transcription FactorsBiochemistry, 2006
- Towards a proteome-scale map of the human protein–protein interaction networkNature, 2005
- Specificity and versatility of SH3 and other proline-recognition domains: structural basis and implications for cellular signal transductionBiochemical Journal, 2005
- Coupled Folding and Binding with α-Helix-Forming Molecular Recognition ElementsBiochemistry, 2005
- Showing your ID: intrinsic disorder as an ID for recognition, regulation and cell signalingJournal of Molecular Recognition, 2005
- IUPred: web server for the prediction of intrinsically unstructured regions of proteins based on estimated energy contentBioinformatics, 2005
- Intrinsically unstructured proteins and their functionsNature Reviews Molecular Cell Biology, 2005
- Preformed Structural Elements Feature in Partner Recognition by Intrinsically Unstructured ProteinsJournal of Molecular Biology, 2004
- Predicting intrinsic disorder from amino acid sequenceProteins-Structure Function and Bioinformatics, 2003
- Intrinsically unstructured proteins evolve by repeat expansionBioEssays, 2003