Combinatorial motif analysis and hypothesis generation on a genomic scale
Open Access
- 1 March 2000
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 16 (3) , 222-232
- https://doi.org/10.1093/bioinformatics/16.3.222
Abstract
Motivation: Computer-assisted methods are essential for the analysis of biosequences. Gene activity is regulated in part by the binding of regulatory molecules (transcription factors) to combinations of short motifs. The goal of our analysis is the development of algorithms to identify regulatory motifs and to predict the activity of combinations of those motifs. Approach: Our research begins with a new motif-finding method, using multiple objective functions and an improved stochastic iterative sampling strategy. Combinatorial motif analysis is accomplished by constructive induction that analyzes potential motif combinations. The hypothesis is generated by applying standard inductive learning algorithms. Results: Tests using 10 previously identified regulons from budding yeast and 14 artificial families of sequences demonstrated the effectiveness of the new motif-finding method. Motif combination and classification approaches were used in the analysis of a sample DNA array data set derived from genome-wide gene expression analysis. Availability: Programs will be available as executable files upon request. Contact: yhu@ics.uci.eduor yhu@cse.ttu.edu.twKeywords
This publication has 0 references indexed in Scilit: