In silico identification of novel selenoproteins in the Drosophila melanogaster genome
- 1 August 2001
- journal article
- Published by Springer Nature in EMBO Reports
- Vol. 2 (8) , 697-702
- https://doi.org/10.1093/embo-reports/kve151
Abstract
In selenoproteins, incorporation of the amino acid selenocysteine is specified by the UGA codon, usually a stop signal. The alternative decoding of UGA is conferred by an mRNA structure, the SECIS element, located in the 3′‐untranslated region of the selenoprotein mRNA. Because of the non‐standard use of the UGA codon, current computational gene prediction methods are unable to identify selenoproteins in the sequence of the eukaryotic genomes. Here we describe a method to predict selenoproteins in genomic sequences, which relies on the prediction of SECIS elements in coordination with the prediction of genes in which the strong codon bias characteristic of protein coding regions extends beyond a TGA codon interrupting the open reading frame. We applied the method to the Drosophila melanogaster genome, and predicted four potential selenoprotein genes. One of them belongs to a known family of selenoproteins, and we have tested experimentally two other predictions with positive results. Finally, we have characterized the expression pattern of these two novel selenoprotein genes.Keywords
This publication has 30 references indexed in Scilit:
- gff2ps: visualizing genomic annotationsBioinformatics, 2000
- The Genome Sequence of Drosophila melanogasterScience, 2000
- A novel RNA binding protein, SBP2, is required for the translation of mammalian selenoprotein mRNAsThe EMBO Journal, 2000
- Novel Selenoproteins Identified in Silico andin Vivo by Using a Conserved RNA Structural MotifJournal of Biological Chemistry, 1999
- Selenocysteine-Containing Thioredoxin Reductase in C. elegansBiochemical and Biophysical Research Communications, 1999
- SelD homolog from Drosophila lacking selenide-dependent monoselenophosphate synthetase activityJournal of Molecular Biology, 1997
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- Prediction of gene structureJournal of Molecular Biology, 1992