ELM server: a new resource for investigating short functional sites in modular eukaryotic proteins
Top Cited Papers
Open Access
- 1 July 2003
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 31 (13) , 3625-3630
- https://doi.org/10.1093/nar/gkg545
Abstract
Multidomain proteins predominate in eukaryotic proteomes. Individual functions assigned to different sequence segments combine to create a complex function for the whole protein. While on-line resources are available for revealing globular domains in sequences, there has hitherto been no comprehensive collection of small functional sites/motifs comparable to the globular domain resources, yet these are as important for the function of multidomain proteins. Short linear peptide motifs are used for cell compartment targeting, protein–protein interaction, regulation by phosphorylation, acetylation, glycosylation and a host of other post-translational modifications. ELM, the Eukaryotic Linear Motif server at http://elm.eu.org/, is a new bioinformatics resource for investigating candidate short non-globular functional motifs in eukaryotic proteins, aiming to fill the void in bioinformatics tools. Sequence comparisons with short motifs are difficult to evaluate because the usual significance assessments are inappropriate. Therefore the server is implemented with several logical filters to eliminate false positives. Current filters are for cell compartment, globular domain clash and taxonomic range. In favourable cases, the filters can reduce the number of retained matches by an order of magnitude or more.Keywords
This publication has 39 references indexed in Scilit:
- Normalization of nomenclature for peptide motifs as ligands of modular protein domainsPublished by Wiley ,2004
- Coupling of folding and binding for unstructured proteinsCurrent Opinion in Structural Biology, 2002
- DIP, the Database of Interacting Proteins: a research tool for studying cellular networks of protein interactionsNucleic Acids Research, 2002
- The Pfam Protein Families DatabaseNucleic Acids Research, 2002
- Recent improvements to the SMART domain-based sequence annotation resourceNucleic Acids Research, 2002
- The PROSITE database, its status in 2002Nucleic Acids Research, 2002
- Database resources of the National Center for Biotechnology Information: 2002 updateNucleic Acids Research, 2002
- MINT: a Molecular INTeraction databaseFEBS Letters, 2001
- Sumo, ubiquitin's mysterious cousinNature Reviews Molecular Cell Biology, 2001
- Caenorhabditis elegans has a single pathway to target matrix proteins to peroxisomesEMBO Reports, 2000