Predicting tissue-specific enhancers in the human genome
- 8 January 2007
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 17 (2) , 201-211
- https://doi.org/10.1101/gr.5972507
Abstract
Determining how transcriptional regulatory signals are encoded in vertebrate genomes is essential for understanding the origins of multicellular complexity; yet the genetic code of vertebrate gene regulation remains poorly understood. In an attempt to elucidate this code, we synergistically combined genome-wide gene-expression profiling, vertebrate genome comparisons, and transcription factor binding-site analysis to define sequence signatures characteristic of candidate tissue-specific enhancers in the human genome. We applied this strategy to microarray-based gene expression profiles from 79 human tissues and identified 7187 candidate enhancers that defined their flanking gene expression, the majority of which were located outside of known promoters. We cross-validated this method for its ability to de novo predict tissue-specific gene expression and confirmed its reliability in 57 of the 79 available human tissues, with an average precision in enhancer recognition ranging from 32% to 63% and a sensitivity of 47%. We used the sequence signatures identified by this approach to successfully assign tissue-specific predictions to ∼328,000 human–mouse conserved noncoding elements in the human genome. By overlapping these genome-wide predictions with a data set of enhancers validated in vivo, in transgenic mice, we were able to confirm our results with a 28% sensitivity and 50% precision. These results indicate the power of combining complementary genomic data sets as an initial computational foray into a global view of tissue-specific gene regulation in vertebrates.Keywords
This publication has 51 references indexed in Scilit:
- In vivo enhancer analysis of human conserved non-coding sequencesNature, 2006
- Close sequence comparisons are sufficient to identify human cis-regulatory elementsGenome Research, 2006
- Adaptively inferring human transcriptional subnetworksMolecular Systems Biology, 2006
- Systematic discovery of regulatory motifs in human promoters and 3′ UTRs by comparison of several mammalsNature, 2005
- Highly Conserved Non-Coding Sequences Are Associated with Vertebrate DevelopmentPLoS Biology, 2004
- Transcription regulation and animal diversityNature, 2003
- The UCSC Genome Browser DatabaseNucleic Acids Research, 2003
- Initial sequencing and comparative analysis of the mouse genomeNature, 2002
- The Human Genome Browser at UCSCGenome Research, 2002
- KEGG: Kyoto Encyclopedia of Genes and GenomesNucleic Acids Research, 2000