Identification of conserved regulatory elements by comparative genome analysis
Open Access
- 22 May 2003
- journal article
- research article
- Published by Springer Nature in Journal of Biology
- Vol. 2 (2) , 13
- https://doi.org/10.1186/1475-4924-2-13
Abstract
For genes that have been successfully delineated within the human genome sequence, most regulatory sequences remain to be elucidated. The annotation and interpretation process requires additional data resources and significant improvements in computational methods for the detection of regulatory regions. One approach of growing popularity is based on the preferential conservation of functional sequences over the course of evolution by selective pressure, termed 'phylogenetic footprinting'. Mutations are more likely to be disruptive if they appear in functional sites, resulting in a measurable difference in evolution rates between functional and non-functional genomic segments. We have devised a flexible suite of methods for the identification and visualization of conserved transcription-factor-binding sites. The system reports those putative transcription-factor-binding sites that are both situated in conserved regions and located as pairs of sites in equivalent positions in alignments between two orthologous sequences. An underlying collection of metazoan transcription-factor-binding profiles was assembled to facilitate the study. This approach results in a significant improvement in the detection of transcription-factor-binding sites because of an increased signal-to-noise ratio, as demonstrated with two sets of promoter sequences. The method is implemented as a graphical web application, ConSite, which is at the disposal of the scientific community at http://www.phylofoot.org/. Phylogenetic footprinting dramatically improves the predictive selectivity of bioinformatic approaches to the analysis of promoter sequences. ConSite delivers unparalleled performance using a novel database of high-quality binding models for metazoan transcription factors. With a dynamic interface, this bioinformatics tool provides broad access to promoter analysis with phylogenetic footprinting.Keywords
This publication has 41 references indexed in Scilit:
- Phylogenetic footprinting of transcription factor binding sites in proteobacterial genomesNucleic Acids Research, 2001
- RefSeq and LocusLink: NCBI gene-centered resourcesNucleic Acids Research, 2001
- Human-mouse genome comparisons to locate regulatory sitesNature Genetics, 2000
- Alfresco—A Workbench for Comparative Genomic Sequence AnalysisGenome Research, 2000
- Human and Mouse Gene Structure: Comparative Analysis and Application to Exon PredictionGenome Research, 2000
- Identification of a Coordinate Regulator of Interleukins 4, 13, and 5 by Cross-Species Sequence ComparisonsScience, 2000
- DNA binding sites: representation and discoveryBioinformatics, 2000
- ANN-SPEC: A METHOD FOR DISCOVERING TRANSCRIPTION FACTOR BINDING SITES WITH IMPROVED SPECIFICITYPacific Symposium on Biocomputing, 1999
- Measuring Molecular InformationJournal of Theoretical Biology, 1999
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997