Prevalence of quadruplexes in the human genome
Top Cited Papers
Open Access
- 1 January 2005
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 33 (9) , 2908-2916
- https://doi.org/10.1093/nar/gki609
Abstract
Guanine-rich DNA sequences of a particular form have the ability to fold into four-stranded structures called G-quadruplexes. In this paper, we present a working rule to predict which primary sequences can form this structure, and describe a search algorithm to identify such sequences in genomic DNA. We count the number of quadruplexes found in the human genome and compare that with the figure predicted by modelling DNA as a Bernoulli stream or as a Markov chain, using windows of various sizes. We demonstrate that the distribution of loop lengths is significantly different from what would be expected in a random case, providing an indication of the number of potentially relevant quadruplex-forming sequences. In particular, we show that there is a significant repression of quadruplexes in the coding strand of exonic regions, which suggests that quadruplex-forming patterns are disfavoured in sequences that will form RNA.Keywords
This publication has 51 references indexed in Scilit:
- G-rich Oligonucleotide Inhibits the Binding of a Nuclear Protein to the Ki-ras Promoter and Strongly Reduces Cell Growth in Human Carcinoma Pancreatic CellsBiochemistry, 2004
- Four-Stranded DNA Structure Stabilized by a Novel G:C:A:T TetradJournal of the American Chemical Society, 2003
- An intramolecular quadruplex of (GGA)4 triplet repeat DNA with a G:G:G:G tetrad and a G(:A):G(:A):G(:A):G heptad, and its dimeric interactionJournal of Molecular Biology, 2001
- The Bloom's Syndrome Helicase Unwinds G4 DNAJournal of Biological Chemistry, 1998
- Structure–Function Correlations of the Insulin-linked Polymorphic RegionJournal of Molecular Biology, 1996
- Method for Calculation of Probability of Matching a Bounded Regular Expression in a Random Data StringJournal of Computational Biology, 1995
- Selection of single-stranded DNA molecules that bind and inhibit human thrombinNature, 1992
- Structure and function of telomeresNature, 1991
- Telomeres and Their SynthesisScience, 1990
- Methods for calculating the probabilities of finding patterns in sequencesBioinformatics, 1989