Conserved elements with potential to form polymorphic G-quadruplex structures in the first intron of human genes
Open Access
- 10 January 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 36 (4) , 1321-1333
- https://doi.org/10.1093/nar/gkm1138
Abstract
To understand how potential for G-quadruplex formation might influence regulation of gene expression, we examined the 2 kb spanning the transcription start sites (TSS) of the 18 217 human RefSeq genes, distinguishing contributions of template and nontemplate strands. Regions both upstream and downstream of the TSS are G-rich, but the downstream region displays a clear bias toward G-richness on the nontemplate strand. Upstream of the TSS, much of the G-richness and potential for G-quadruplex formation derives from the presence of well-defined canonical regulatory motifs in duplex DNA, including CpG dinucleotides which are sites of regulatory methylation, and motifs recognized by the transcription factor SP1. This challenges the notion that quadruplex formation upstream of the TSS contributes to regulation of gene expression. Downstream of the TSS, G-richness is concentrated in the first intron, and on the nontemplate strand, where polymorphic sequence elements with potential to form G-quadruplex structures and which cannot be accounted for by known regulatory motifs are found in almost 3000 (16%) of the human RefSeq genes, and are conserved through frogs. These elements could in principle be recognized either as DNA or as RNA, providing structural targets for regulation at the level of transcription or RNA processing.Keywords
This publication has 54 references indexed in Scilit:
- Human telomere, oncogenic promoter and 5'-UTR G-quadruplexes: diverse higher order DNA and RNA targets for cancer therapeuticsNucleic Acids Research, 2007
- Intramolecular DNA quadruplexes with different arrangements of short and long loopsNucleic Acids Research, 2007
- Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot projectNature, 2007
- An RNA G-quadruplex in the 5′ UTR of the NRAS proto-oncogene modulates translationNature Chemical Biology, 2007
- G-quadruplexes in promoters throughout the human genomeNucleic Acids Research, 2006
- Quadruplex DNA: sequence, topology and structureNucleic Acids Research, 2006
- Gene function correlates with potential for G4 DNA formation in the human genomeNucleic Acids Research, 2006
- A genome-wide analysis of CpG dinucleotides in the human genome distinguishes two distinct classes of promotersProceedings of the National Academy of Sciences, 2006
- AID binds to transcription-induced structures in c-MYC that map to regions associated with translocation and hypermutationOncogene, 2005
- Systematic discovery of regulatory motifs in human promoters and 3′ UTRs by comparison of several mammalsNature, 2005