A First Look at ARFome: Dual-Coding Genes in Mammalian Genomes
Open Access
- 18 May 2007
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLoS Computational Biology
- Vol. 3 (5) , e91
- https://doi.org/10.1371/journal.pcbi.0030091
Abstract
Coding of multiple proteins by overlapping reading frames is not a feature one would associate with eukaryotic genes. Indeed, codependency between codons of overlapping protein-coding regions imposes a unique set of evolutionary constraints, making it a costly arrangement. Yet in cases of tightly coexpressed interacting proteins, dual coding may be advantageous. Here we show that although dual coding is nearly impossible by chance, a number of human transcripts contain overlapping coding regions. Using newly developed statistical techniques, we identified 40 candidate genes with evolutionarily conserved overlapping coding regions. Because our approach is conservative, we expect mammals to possess more dual-coding genes. Our results emphasize that the skepticism surrounding eukaryotic dual coding is unwarranted: rather than being artifacts, overlapping reading frames are often hallmarks of fascinating biology. A textbook human gene encodes a protein using a single reading frame. Alternative splicing brings some variation to that picture, but the notion of a single reading frame remains. Although this is true for most of our genes, there are exceptions. Like viral counterparts, some eukaryotic genes produce structurally unrelated proteins from overlapping reading frames. The examples are spectacular (G-protein alpha subunit [Gnas1] or INK4a tumor suppressor), but scarce. The scarcity is anthropogenic in origin: we simply do not believe that dual-coding genes can occur in eukaryotes. To challenge this assumption, we performed the first genome-wide scan for mammalian genes containing alternative reading frames located out of frame relative to the annotated protein-coding region. Using a newly developed statistical framework, we identified 40 such genes. Because our approach is very conservative, this number is likely a significant underestimate, and future studies will identify more alternative reading frame–containing genes with fascinating biology.Keywords
This publication has 27 references indexed in Scilit:
- pXBP1(U) encoded in XBP1 pre-mRNA negatively regulates unfolded protein response activator pXBP1(S) in mammalian ER stress responseThe Journal of cell biology, 2006
- A genome-wide study of dual coding regions in human alternatively spliced genesGenome Research, 2005
- Oscillating Evolution of a Mammalian Locus with Overlapping Reading Frames: An XLαs/ALEX RelayPLoS Genetics, 2005
- INK4a/ARF: A multifunctional tumor suppressor locusMutation Research - Fundamental and Molecular Mechanisms of Mutagenesis, 2005
- Functional polymorphisms in the paternally expressed XLalphas and its cofactor ALEX decrease their mutual interaction and enhance receptor-mediated cAMP formation.Human Molecular Genetics, 2003
- IRE1 couples endoplasmic reticulum load to secretory capacity by processing the XBP-1 mRNANature, 2002
- XBP1 mRNA Is Induced by ATF6 and Spliced by IRE1 in Response to ER Stress to Produce a Highly Active Transcription FactorCell, 2001
- Two overlapping reading frames in a single exon encode interacting proteins--a novel way of gene usageThe EMBO Journal, 2001
- Alternative reading frames of the INK4a tumor suppressor gene encode two unrelated proteins capable of inducing cell cycle arrestCell, 1995
- Origins of genes: "big bang" or continuous creation?Proceedings of the National Academy of Sciences, 1992