Evidence of Influence of Genomic DNA Sequence on Human X Chromosome Inactivation
Open Access
- 1 September 2006
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLoS Computational Biology
- Vol. 2 (9) , e113
- https://doi.org/10.1371/journal.pcbi.0020113
Abstract
A significant number of human X-linked genes escape X chromosome inactivation and are thus expressed from both the active and inactive X chromosomes. The basis for escape from inactivation and the potential role of the X chromosome primary DNA sequence in determining a gene's X inactivation status is unclear. Using a combination of the X chromosome sequence and a comprehensive X inactivation profile of more than 600 genes, two independent yet complementary approaches were used to systematically investigate the relationship between X inactivation and DNA sequence features. First, statistical analyses revealed that a number of repeat features, including long interspersed nuclear element (LINE) and mammalian-wide interspersed repeat repetitive elements, are significantly enriched in regions surrounding transcription start sites of genes that are subject to inactivation, while Alu repetitive elements and short motifs containing ACG/CGT are significantly enriched in those that escape inactivation. Second, linear support vector machine classifiers constructed using primary DNA sequence features were used to correctly predict the X inactivation status for >80% of all X-linked genes. We further identified a small set of features that are important for accurate classification, among which LINE-1 and LINE-2 content show the greatest individual discriminatory power. Finally, as few as 12 features can be used for accurate support vector machine classification. Taken together, these results suggest that features of the underlying primary DNA sequence of the human X chromosome may influence the spreading and/or maintenance of X inactivation. Female mammals have two X chromosomes while males have one X and one Y chromosome. To equalize dosage of X chromosome genes in males and females, one X in female cells is inactivated, repressing the expression of most genes on the chromosome. Despite the chromosome-wide nature of X inactivation, at least 10%–15% of genes “escape” this inactivation in human females and are still expressed on the inactivated X. Whether a gene escapes or is subject to inactivation is thought to be determined epigenetically, and it is unknown to what extent, if at all, the underlying genomic DNA sequence of the chromosome plays a role. In this work, the authors show that the DNA sequence surrounding genes that escape inactivation is significantly different from the sequence surrounding genes that are subject to inactivation. In fact, a small number of DNA sequence features can be used to predict with high accuracy whether a gene will escape or be subject to this silencing. This establishes strong evidence that epigenetic regulation is, at least in part, dependent on genomic sequence and organization and provides a list of candidate sequence features whose role(s) in X inactivation can now be explored.Keywords
This publication has 41 references indexed in Scilit:
- Word frequency analysis reveals enrichment of dinucleotide repeats on the human X chromosome and [GATA]nin the X escape regionGenome Research, 2006
- X-inactivation profile reveals extensive variability in X-linked gene expression in femalesNature, 2005
- Alu repeat analysis in the complete human genome: trends and variations with respect to genomic compositionBioinformatics, 2004
- A Biophysical Approach to Transcription Factor Binding Site DiscoveryGenome Research, 2003
- Statistical significance for genomewide studiesProceedings of the National Academy of Sciences, 2003
- Escape from X inactivationCytogenetic and Genome Research, 2002
- Alu repeats and human genomic diversityNature Reviews Genetics, 2002
- Requirement for Xist in X chromosome inactivationNature, 1996
- The evolution of mammalian sex chromosomes and the origin of sex determining genesPhilosophical Transactions Of The Royal Society B-Biological Sciences, 1995
- A gene from the region of the human X inactivation centre is expressed exclusively from the inactive X chromosomeNature, 1991