Abundant novel transcriptional units and unconventional gene pairs on human chromosome 22
Open Access
- 12 December 2005
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 16 (1) , 45-54
- https://doi.org/10.1101/gr.3883606
Abstract
Novel transcriptional units (TUs) are EST-supported transcribed features not corresponding to known genes. Unconventional gene pairs (UGPs) are pairs of genes and/or TUs sharing exon-to-exon cis-antisense overlaps or putative bidirectional promoters. Computational TU and UGP discovery followed by manual curation was performed in the entire published 34.9-Mb human chromosome 22 euchromatic sequence. Novel TUs (n = 517) were as abundant as known genes (n = 492) and typically did not have nonprimate DNA and protein homologies. One hundred seventy-one (33%) of TUs, but only 13 (3%) of genes, both lacked nonprimate conservation and localized to gaps in the human–mouse BLASTZ alignment. Novel TUs were richer in exonic primate-specific interspersed repetitive elements (P = 0.001) and were more likely to rely on splice junctions provided by them, than were known genes: 19% of spliced TUs, versus 5% of spliced genes, had a splice site within a primate-specific repeat. Hence, novel TUs and known genes may represent different portions of the transcriptome. Two hundred nine (21%) of chromosome 22 transcripts participated in 77 cis-antisense and 42 promoter-sharing UGPs. Transcripts involved simultaneously in both UGP types were more common than was expected (P = 0.01). UGPs were nonrandomly distributed along the sequence: 89 (75%) clustered in distinct regions, the sum of which equaled 4.4 Mb (cis-regulatory potential of UGPs is well recognized, TUs and UGPs specific to the primate lineage may contribute to the genomic basis for primate-specific phenotypes.Keywords
This publication has 56 references indexed in Scilit:
- Comparison of the current RefSeq, Ensembl and EST databases for counting genes and gene discoveryFEBS Letters, 2004
- Finishing the euchromatic sequence of the human genomeNature, 2004
- Mammalian Overlapping Genes: The Comparative PerspectiveGenome Research, 2004
- Antisense Transcripts With FANTOM2 Clone Set and Their Implications for Gene RegulationGenome Research, 2003
- A Guide to the Mammalian Genome: Figure 1Genome Research, 2003
- Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAsNature, 2002
- The Human Genome Browser at UCSCGenome Research, 2002
- The non-coding Air RNA is required for silencing autosomal imprinted genesNature, 2002
- MicF : an antisense RNA gene involved in response of Escherichia coli to global stress factors 1 1Edited by D. DraperJournal of Molecular Biology, 2001
- Creation of genome-wide protein expression libraries using random activation of gene expressionNature Biotechnology, 2001