Word frequency analysis reveals enrichment of dinucleotide repeats on the human X chromosome and [GATA]nin the X escape region
Open Access
- 13 March 2006
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 16 (4) , 477-484
- https://doi.org/10.1101/gr.4627606
Abstract
Most of the human genome encodes neither protein nor known functional RNA, yet available approaches to seek meaningful information in the “noncoding” sequence are limited. The unique biology of the X chromosome, one of which is silenced in mammalian females, can yield clues into sequence motifs involved in chromosome packaging and function. Although autosomal chromatin has some capacity for inactivation, evidence indicates that sequences enriched on the X chromosome render it fully competent for silencing, except in specific regions that escape inactivation. Here we have used a linguistic approach by analyzing the frequency and distribution of nine base-pair genomic “words” throughout the human genome. Results identify previously unknown sequence differences on the human X chromosome. Notably, the dinucleotide repeats [AT]n, [AC]n, and [AG]nare significantly enriched across the X chromosome compared with autosomes. Moreover, a striking enrichment (>10-fold) of [GATA]nis revealed throughout the 10-Mb segment at Xp22 that escapes inactivation, and is confirmed by fluorescence in situ hybridization. A similar enrichment is found in other eutherian genomes. Our findings clearly demonstrate sequence differences relevant to the novel biology and evolution of the X chromosome. Furthermore, they implicate simple sequence repeats, linked to gene regulation and unusual DNA structures, in the regulation and formation of facultative heterochromatin. Results suggest a new paradigm whereby a regional escape from X inactivation is due to the presence of elements that prevent heterochromatinization, rather than the lack of other elements that promote it.Keywords
This publication has 63 references indexed in Scilit:
- A Fine-Scale Map of Recombination Rates and Hotspots Across the Human GenomeScience, 2005
- Microsatellite Instability Generates Diversity in Brain and Sociobehavioral TraitsScience, 2005
- X-inactivation profile reveals extensive variability in X-linked gene expression in femalesNature, 2005
- Microsatellites: simple sequences with complex evolutionNature Reviews Genetics, 2004
- Anything else but GAGA: a nonhistone protein complex reshapes chromatin structureTrends in Genetics, 2004
- Initial sequencing and analysis of the human genomeNature, 2001
- Evidence for Heterogeneity in Recombination in the Human Pseudoautosomal Region: High Resolution Analysis by Sperm Typing and Radiation-Hybrid MappingAmerican Journal of Human Genetics, 2000
- The Spreading of X Inactivation into Autosomal Material of an X;autosome Translocation: Evidence for a Difference between Autosomal and X-Chromosomal DNAAmerican Journal of Human Genetics, 1998
- Long-range cis effects of ectopic X-inactivation centres on a mouse autosomeNature, 1997
- Genomic simple repetitive DNAs are targets for differential binding of nuclear proteinsFEBS Letters, 1996