Eukaryotic Regulatory Element Conservation Analysis and Identification Using Comparative Genomics
Open Access
- 1 March 2004
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 14 (3) , 451-458
- https://doi.org/10.1101/gr.1327604
Abstract
Comparative genomics is a promising approach to the challenging problem of eukaryotic regulatory element identification, because functional noncoding sequences may be conserved across species from evolutionary constraints. We systematically analyzed known human andSaccharomyces cerevisiaeregulatory elements and discovered that human regulatory elements are more conserved between human and mouse than are background sequences. AlthoughS. cerevisiaeregulatory elements do not appear to be more conserved by comparison ofS. cerevisiaetoSchizosaccharomyces pombe, they are more conserved when compared with multiple other yeast genomes (Saccharomyces paradoxus, Saccharomyces mikatae, andSaccharomyces bayanus). Based on these analyses, we developed a sequence-motif-finding algorithm called CompareProspector, which extends Gibbs sampling by biasing the search in regions conserved across species. Using human–mouse comparison, CompareProspector identified known motifs for transcription factors Mef2, Myf, Srf, and Sp1 from a set of human-muscle-specific genes. It also discovered the NFAT motif from genes up-regulated by CD28 stimulation in T-cells, which implies the direct involvement of NFAT in mediating the CD28 stimulatory signal. UsingCaenorhabditis elegans–Caenorhabditis briggsaecomparison, CompareProspector found the PHA-4 motif and the UNC-86 motif. CompareProspector outperformed many other computational motif-finding programs, demonstrating the power of comparative genomics-based biased sampling in eukaryotic regulatory element identification.Keywords
This publication has 34 references indexed in Scilit:
- Sequencing and comparison of yeast species to identify genes and regulatory elementsNature, 2003
- LAGAN and Multi-LAGAN: Efficient Tools for Large-Scale Multiple Alignment of Genomic DNAGenome Research, 2003
- Distinguishing Regulatory DNA From Neutral SitesGenome Research, 2003
- Initial sequencing and comparative analysis of the mouse genomeNature, 2002
- BLAT—The BLAST-Like Alignment ToolGenome Research, 2002
- The genome sequence of Schizosaccharomyces pombeNature, 2002
- Human-mouse genome comparisons to locate regulatory sitesNature Genetics, 2000
- Finding DNA regulatory motifs within unaligned noncoding sequences clustered by whole-genome mRNA quantitationNature Biotechnology, 1998
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- TRANSCRIPTION FACTORS OF THE NFAT FAMILY:Regulation and FunctionAnnual Review of Immunology, 1997