An approach to identify over-represented cis-elements in related sequences
Open Access
- 1 April 2003
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 31 (7) , 1995-2005
- https://doi.org/10.1093/nar/gkg287
Abstract
Computational identification of transcription factor binding sites is an important research area of computational biology. Positional weight matrix (PWM) is a model to describe the sequence pattern of binding sites. Usually, transcription factor binding sites prediction methods based on PWMs require user‐defined thresholds. The arbitrary threshold and also the relatively low specificity of the algorithm prevent the result of such an analysis from being properly interpreted. In this study, a method was developed to identify over‐represented cis‐elements with PWM‐based similarity scores. Three sets of closely related promoters were analyzed, and only over‐ represented motifs with high PWM similarity scores were reported. The thresholds to evaluate the similarity scores to the PWMs of putative transcription factors binding sites can also be automatically determined during the analysis, which can also be used in further research with the same PWMs. The online program is available on the website: http://www.bioinfo.tsinghua.edu.cn/∼zhengjsh/OTFBS/.Keywords
This publication has 26 references indexed in Scilit:
- Psoriatic lesional skin exhibits an aberrant expression pattern of interferon regulatory factor-2 (IRF-2)The Journal of Pathology, 2002
- Sp1- and Sp3-mediated Transcriptional Regulation of the Fibroblast Growth Factor Receptor 1 Gene in Chicken Skeletal Muscle CellsPublished by Elsevier ,2002
- Promoter Extraction from GenBank (PEG): automatic extraction of eukaryotic promoter sequences in large sets of genesBioinformatics, 2001
- Computer-assisted identification of cell cycle-related genes: new targets for E2F transcription factorsJournal of Molecular Biology, 2001
- Assessing Clusters and Motifs from Gene Expression DataGenome Research, 2001
- Discovery and modeling of transcriptional regulatory regionsPublished by Elsevier ,2000
- Activation of human γ-globin gene expression via triplex-forming oligonucleotide (TFO)-directed mutations in the γ-globin gene 5′ flanking regionGene, 2000
- Extracting regulatory sites from the upstream region of yeast genes by computational analysis of oligonucleotide frequencies 1 1Edited by G. von HeijneJournal of Molecular Biology, 1998
- Specificity, free energy and information content in protein–DNA interactionsTrends in Biochemical Sciences, 1998
- AGL1-AGL6, an Arabidopsis gene family with similarity to floral homeotic and transcription factor genes.Genes & Development, 1991