A User's Guide to the Encyclopedia of DNA Elements (ENCODE)

Top Cited Papers

Open Access

19 April 2011

journal article
research article
Published by Public Library of Science (PLoS) in PLoS Biology

Vol. 9 (4) , e1001046
https://doi.org/10.1371/journal.pbio.1001046

Abstract

The mission of the Encyclopedia of DNA Elements (ENCODE) Project is to enable the scientific and medical communities to interpret the human genome sequence and apply it to understand human biology and improve health. The ENCODE Consortium is integrating multiple technologies and approaches in a collective effort to discover and define the functional elements encoded in the human genome, including genes, transcripts, and transcriptional regulatory regions, together with their attendant chromatin states and DNA methylation patterns. In the process, standards to ensure high-quality data have been implemented, and novel algorithms have been developed to facilitate analysis. Data and derived results are made available through a freely accessible database. Here we provide an overview of the project and the resources it is generating and illustrate the application of ENCODE data to interpret the human genome. The Encyclopedia of DNA Elements (ENCODE) Project was created to enable the scientific and medical communities to interpret the human genome sequence and to use it to understand human biology and improve health. The ENCODE Consortium, a large group of scientists from around the world, uses a variety of experimental methods to identify and describe the regions of the 3 billion base-pair human genome that are important for function. Using experimental, computational, and statistical analyses, we aimed to discover and describe genes, transcripts, and transcriptional regulatory regions, as well as DNA binding proteins that interact with regulatory regions in the genome, including transcription factors, different versions of histones and other markers, and DNA methylation patterns that define states of the genome in various cell types. The ENCODE Project has developed standards for each experiment type to ensure high-quality, reproducible data and novel algorithms to facilitate analysis. All data and derived results are made available through a freely accessible database. This article provides an overview of the complete project and the resources it is generating, as well as examples to illustrate the application of ENCODE data as a user's guide to facilitate the interpretation of the human genome.

Keywords

This publication has 122 references indexed in Scilit:

ChIP‐Seq: A Method for Global Identification of Regulatory Elements in the Genome
Current Protocols in Molecular Biology, 2010
Identification of a CpG Island Methylator Phenotype that Defines a Distinct Subgroup of Glioma
Cancer Cell, 2010
CCCTC-Binding Factor and the Transcription Factor T-bet Orchestrate T Helper 1 Cell-Specific Structure and Function at the Interferon-γ Locus
Immunity, 2009
Genomic Antagonism between Retinoic Acid and Estrogen Signaling in Breast Cancer
Published by Elsevier ,2009
High-Resolution Mapping and Characterization of Open Chromatin across the Genome
Cell, 2008
High-Resolution Profiling of Histone Methylations in the Human Genome
Published by Elsevier ,2007
Analysis of the Vertebrate Insulator Protein CTCF-Binding Sites in the Human Genome
Cell, 2007
Chromatin Modifications and Their Function
Cell, 2007
Control of Developmental Regulators by Polycomb in Human Embryonic Stem Cells
Cell, 2006
A Bivalent Chromatin Structure Marks Key Developmental Genes in Embryonic Stem Cells
Cell, 2006