Selection for the G4 DNA motif at the 5′ end of human genes

Abstract
Formation of G4 DNA may occur in the course of replication and transcription, and contribute to genomic instability. We have quantitated abundance of G4 motifs and potential for G4 DNA formation of the nontemplate strand of 5′ exons and introns of transcripts of human genes. We find that, for all human genes, G4 motifs are enriched in 5′ regions of transcripts relative to downstream regions; and in 5′ regulatory regions relative to coding regions. Notably, although tumor suppressor genes are depleted and proto‐oncogenes enriched in G4 motifs, abundance of G4 motifs in the 5′ regions of transcripts of genes in these categories does not differ. These results support the hypothesis that G4 motifs are under selection in the human genome. They further show that for tumor suppressor genes and proto‐oncogenes, independent selection determines potential for G4 DNA formation of 5′ regulatory regions of transcripts and downstream coding regions.