Combining statistical alignment and phylogenetic footprinting to detect regulatory elements
Open Access
- 18 March 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 24 (10) , 1236-1242
- https://doi.org/10.1093/bioinformatics/btn104
Abstract
Motivation: Traditional alignment-based phylogenetic footprinting approaches make predictions on the basis of a single assumed alignment. The predictions are therefore highly sensitive to alignment errors or regions of alignment uncertainty. Alternatively, statistical alignment methods provide a framework for performing phylogenetic analyses by examining a distribution of alignments. Results: We developed a novel algorithm for predicting functional elements by combining statistical alignment and phylogenetic footprinting (SAPF). SAPF simultaneously performs both alignment and annotation by combining phylogenetic footprinting techniques with an hidden Markov model (HMM) transducer-based multiple alignment model, and can analyze sequence data from multiple sequences. We assessed SAPF's predictive performance on two simulated datasets and three well-annotated cis-regulatory modules from newly sequenced Drosophila genomes. The results demonstrate that removing the traditional dependence on a single alignment can significantly augment the predictive performance, especially when there is uncertainty in the alignment of functional regions. Availability: SAPF is freely available to download online at http://www.stats.ox.ac.uk/~satija/SAPF/ Contact:satija@stats.ox.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online.Keywords
This publication has 28 references indexed in Scilit:
- Uncertainty in homology inferences: Assessing and improving genomic sequence alignmentGenome Research, 2007
- MORPH: Probabilistic Alignment Combined with Hidden Markov Models of cis-Regulatory ModulesPLoS Computational Biology, 2007
- Evolution of genes and genomes on the Drosophila phylogenyNature, 2007
- Discovery of functional elements in 12 Drosophila genomes using evolutionary signaturesNature, 2007
- Parametric Alignment of Drosophila GenomesPLoS Computational Biology, 2006
- Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomesGenome Research, 2005
- Using evolutionary Expectation Maximization to estimate indel ratesBioinformatics, 2005
- Drosophila DNase I footprint database: a systematic genome annotation of transcription factor binding sites in the fruitfly, Drosophila melanogasterBioinformatics, 2004
- The Genome Sequence of Drosophila melanogasterScience, 2000
- Regulation of a Segmentation Stripe by Overlapping Activators and Repressors in the Drosophila EmbryoScience, 1991