Statistical analysis of the genomic distribution and correlation of regulatory elements in the ENCODE regions
- 13 June 2007
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 17 (6) , 787-797
- https://doi.org/10.1101/gr.5573107
Abstract
The comprehensive inventory of functional elements in 44 human genomic regions carried out by the ENCODE Project Consortium enables for the first time a global analysis of the genomic distribution of transcriptional regulatory elements. In this study we developed an intuitive and yet powerful approach to analyze the distribution of regulatory elements found in many different ChIP–chip experiments on a 10∼100-kb scale. First, we focus on the overall chromosomal distribution of regulatory elements in the ENCODE regions and show that it is highly nonuniform. We demonstrate, in fact, that regulatory elements are associated with the location of known genes. Further examination on a local, single-gene scale shows an enrichment of regulatory elements near both transcription start and end sites. Our results indicate that overall these elements are clustered into regulatory rich “islands” and poor “deserts.” Next, we examine how consistent the nonuniform distribution is between different transcription factors. We perform on all the factors a multivariate analysis in the framework of a biplot, which enhances biological signals in the experiments. This groups transcription factors into sequence-specific and sequence-nonspecific clusters. Moreover, with experimental variation carefully controlled, detailed correlations show that the distribution of sites was generally reproducible for a specific factor between different laboratories and microarray platforms. Data sets associated with histone modifications have particularly strong correlations. Finally, we show how the correlations between factors change when only regulatory elements far from the transcription start sites are considered.Keywords
This publication has 40 references indexed in Scilit:
- Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot projectNature, 2007
- Mapping of transcription factor binding regions in mammalian cells by ChIP: Comparison of array- and sequencing-based technologiesGenome Research, 2007
- Polycomb complexes repress developmental regulators in murine embryonic stem cellsNature, 2006
- Genome-wide Map of Nucleosome Acetylation and Methylation in YeastPublished by Elsevier ,2005
- The role of DNA response elements as allosteric modulators of steroid receptor functionMolecular and Cellular Endocrinology, 2005
- Gene identification signature (GIS) analysis for transcriptome characterization and genome annotationNature Methods, 2005
- SUZ12 Is Required for Both the Histone Methyltransferase Activity and the Silencing Function of the EED-EZH2 ComplexMolecular Cell, 2004
- Role of Histone H3 Lysine 27 Methylation in Polycomb-Group SilencingScience, 2002
- Histone Methyltransferase Activity of a Drosophila Polycomb Group Repressor ComplexCell, 2002
- Drosophila Enhancer of Zeste/ESC Complexes Have a Histone H3 Methyltransferase Activity that Marks Chromosomal Polycomb SitesCell, 2002