Expression of Conjoined Genes: Another Mechanism for Gene Regulation in Eukaryotes
Open Access
- 12 October 2010
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLOS ONE
- Vol. 5 (10) , e13284
- https://doi.org/10.1371/journal.pone.0013284
Abstract
From the ENCODE project, it is realized that almost every base of the entire human genome is transcribed. One class of transcripts resulting from this arises from the conjoined gene, which is formed by combining the exons of two or more distinct (parent) genes lying on the same strand of a chromosome. Only a very limited number of such genes are known, and the definition and terminologies used for them are highly variable in the public databases. In this work, we have computationally identified and manually curated 751 conjoined genes (CGs) in the human genome that are supported by at least one mRNA or EST sequence available in the NCBI database. 353 representative CGs, of which 291 (82%) could be confirmed, were subjected to experimental validation using RT-PCR and sequencing methods. We speculate that these genes are arising out of novel functional requirements and are not merely artifacts of transcription, since more than 70% of them are conserved in other vertebrate genomes. The unique splicing patterns exhibited by CGs reveal their possible roles in protein evolution or gene regulation. Novel CGs, for which no transcript is available, could be identified in 80% of randomly selected potential CG forming regions, indicating that their formation is a routine process. Formation of CGs is not only limited to human, as we have also identified 270 CGs in mouse and 227 in drosophila using our approach. Additionally, we propose a novel mechanism for the formation of CGs. Finally, we developed a database, ConjoinG, which contains detailed information about all the CGs (800 in total) identified in the human genome. In summary, our findings reveal new insights about the functionality of CGs in terms of another possible mechanism for gene regulation and genomic evolution and the mechanism leading to their formation.Keywords
This publication has 36 references indexed in Scilit:
- Nonsense-mediated mRNA decay (NMD) mechanismsNature Structural & Molecular Biology, 2009
- Short Homologous Sequences Are Strongly Associated with the Generation of Chimeric RNAs in EukaryotesJournal of Molecular Evolution, 2008
- Alternative Polyadenylation: A Twist on mRNA 3′ End FormationACS Chemical Biology, 2008
- A Neoplastic Gene Fusion Mimics Trans-Splicing of RNAs in Normal Human CellsScience, 2008
- RBM6-RBM5 transcription-induced chimeras are differentially expressed in tumoursBMC Genomics, 2007
- Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot projectNature, 2007
- Fusion transcripts and transcribed retrotransposed loci discovered through comprehensive transcriptome analysis using Paired-End diTags (PETs)Genome Research, 2007
- Prominent use of distal 5′ transcription start sites and discovery of a large number of additional exons in ENCODE regionsGenome Research, 2007
- Functional Replacement of the RING, B-Box 2, and Coiled-Coil Domains of Tripartite Motif 5α (TRIM5α) by Heterologous TRIM DomainsJournal of Virology, 2006
- What is a gene?Nature, 2006