Algorithms for Phylogenetic Footprinting
- 1 April 2002
- journal article
- Published by Mary Ann Liebert Inc in Journal of Computational Biology
- Vol. 9 (2) , 211-223
- https://doi.org/10.1089/10665270252935421
Abstract
Phylogenetic footprinting is a technique that identifies regulatory elements by finding unusually well conserved regions in a set of orthologous noncoding DNA sequences from multiple species. We introduce a new motif-finding problem, the Substring Parsimony Problem, which is a formalization of the ideas behind phylogenetic footprinting, and we present an exact dynamic programming algorithm to solve it. We then present a number of algorithmic optimizations that allow our program to run quickly on most biologically interesting datasets. We show how to handle data sets in which only an unknown subset of the sequences contains the regulatory element. Finally, we describe how to empirically assess the statistical significance of the motifs found. Each technique is implemented and successfully identifies a number of known binding sites, as well as several highly conserved but uncharacterized regions. The program is available at http://bio.cs.washington.edu/software.html.Keywords
This publication has 29 references indexed in Scilit:
- Embryonic ε and γ globin genes of a prosimian primate (Galago crassicaudatus): Nucleotide and amino acid sequences, developmental regulation and phylogenetic footprintsPublished by Elsevier ,2004
- Computational identification of Cis -regulatory elements associated with groups of functionally related genes in Saccharomyces cerevisiae 1 1Edited by F. E. CohenJournal of Molecular Biology, 2000
- Finding DNA regulatory motifs within unaligned noncoding sequences clustered by whole-genome mRNA quantitationNature Biotechnology, 1998
- Extracting regulatory sites from the upstream region of yeast genes by computational analysis of oligonucleotide frequencies 1 1Edited by G. von HeijneJournal of Molecular Biology, 1998
- EVOLUTION OF LIGHT-REGULATED PLANT PROMOTERSAnnual Review of Plant Biology, 1998
- Sequences responsible for intracellular localization of beta-actin messenger RNA also affect cell phenotype.The Journal of cell biology, 1994
- Detecting Subtle Sequence Signals: a Gibbs Sampling Strategy for Multiple AlignmentScience, 1993
- A likelihood approach to character weighting and what it tells us about parsimony and compatibilityBiological Journal of the Linnean Society, 1981
- Locating the vertices of a steiner tree in an arbitrary metric spaceMathematical Programming, 1975
- Construction of Phylogenetic TreesScience, 1967