The Arabidopsis Unannotated Secreted Peptide Database, a Resource for Plant Peptidomics
- 22 September 2006
- journal article
- Published by Oxford University Press (OUP) in Plant Physiology
- Vol. 142 (3) , 831-838
- https://doi.org/10.1104/pp.106.086041
Abstract
In the era of genomics, if a gene is not annotated, it is not investigated. Due to their small size, genes encoding peptides are often missed in genome annotations. Secreted peptides are important regulators of plant growth, development, and physiology. Identification of additional peptide signals by sequence homology searches has had limited success due to sequence heterogeneity. A bioinformatics approach was taken to find unannotated Arabidopsis (Arabidopsis thaliana) peptides. Arabidopsis chromosome sequences were searched for all open reading frames (ORFs) encoding peptides and small proteins between 25 and 250 amino acids in length. The translated ORFs were then sequentially queried for the presence of an amino-terminal cleavable signal peptide, the absence of transmembrane domains, and the absence of endoplasmic reticulum lumenal retention sequences. Next, the ORFs were filtered against the The Arabidopsis Information Resource 6.0 annotated Arabidopsis genes to remove those ORFs overlapping known genes. The remaining 33,809 ORFs were placed in a relational database to which additional annotation data were deposited. Genome-wide tiling array data were compared with the coordinates of the ORFs, supporting the possibility that many of the ORFs may be expressed. In addition, clustering and sequence similarity analyses revealed that many of the putative peptides are in gene families and/or appear to be present in the rice (Oryza sativa) genome. A subset of the ORFs was evaluated by reverse transcription-PCR and, for one-fifth of those, expression was detected. These results support the idea that the number and diversity of plant peptides is broader than currently assumed. The peptides identified and their annotation data may be viewed or downloaded through a searchable Web interface at peptidome.missouri.edu.Keywords
This publication has 39 references indexed in Scilit:
- The cell surface leucine-rich repeat receptor for At Pep1, an endogenous peptide elicitor in Arabidopsis , is functional in transgenic tobacco cellsProceedings of the National Academy of Sciences, 2006
- An endogenous peptide signal in Arabidopsis activates components of the innate immune responseProceedings of the National Academy of Sciences, 2006
- Improved Prediction of Signal Peptides: SignalP 3.0Journal of Molecular Biology, 2004
- Feature-based prediction of non-classical and leaderless protein secretionProtein Engineering, Design and Selection, 2004
- PHYTOCHELATINS AND METALLOTHIONEINS: Roles in Heavy Metal Detoxification and HomeostasisAnnual Review of Plant Biology, 2002
- Predicting transmembrane protein topology with a hidden markov model: application to complete genomes11Edited by F. CohenJournal of Molecular Biology, 2001
- Probability-based protein identification by searching sequence databases using mass spectrometry dataElectrophoresis, 1999
- Plant defense peptidesBiopolymers, 1998
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- Basic local alignment search toolJournal of Molecular Biology, 1990