An iterative statistical approach to the identification of protein phosphorylation motifs from large-scale data sets
Top Cited Papers
- 4 November 2005
- journal article
- research article
- Published by Springer Nature in Nature Biotechnology
- Vol. 23 (11) , 1391-1398
- https://doi.org/10.1038/nbt1146
Abstract
With the recent exponential increase in protein phosphorylation sites identified by mass spectrometry, a unique opportunity has arisen to understand the motifs surrounding such sites. Here we present an algorithm designed to extract motifs from large data sets of naturally occurring phosphorylation sites. The methodology relies on the intrinsic alignment of phospho-residues and the extraction of motifs through iterative comparison to a dynamic statistical background. Results show the identification of dozens of novel and known phosphorylation motifs from recently published serine, threonine and tyrosine phosphorylation studies. When applied to a linguistic data set to test the versatility of the approach, the algorithm successfully extracted hundreds of language motifs. This method, in addition to shedding light on the consensus sequences of identified and as yet unidentified kinases and modular protein domains, may also eventually be used as a tool to determine potential phosphorylation sites in proteins of interest.Keywords
This publication has 27 references indexed in Scilit:
- SCF-mediated protein degradation and cell cycle controlOncogene, 2005
- Mass Spectrometric Contributions to the Practice of Phosphorylation Site Mapping through 2003Molecular & Cellular Proteomics, 2005
- Quantitative Phosphoproteomics Applied to the Yeast Pheromone Signaling PathwayMolecular & Cellular Proteomics, 2005
- Immunoaffinity profiling of tyrosine phosphorylation in cancer cellsNature Biotechnology, 2005
- Phosphoproteomic Analysis of the Developing Mouse BrainMolecular & Cellular Proteomics, 2004
- Subsets of the Major Tyrosine Phosphorylation Sites in Crk-associated Substrate (CAS) Are Sufficient to Promote Cell MigrationJournal of Biological Chemistry, 2004
- WebLogo: A Sequence Logo Generator: Figure 1Genome Research, 2004
- Phosphoproteome analysis by mass spectrometry and its application to Saccharomyces cerevisiaeNature Biotechnology, 2002
- Finding flexible patterns in unaligned protein sequencesProtein Science, 1995
- Sequence logos: a new way to display consensus sequencesNucleic Acids Research, 1990