MSCAN: identification of functional clusters of transcription factor binding sites
Open Access
- 1 July 2004
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 32 (Web Server) , W195-W198
- https://doi.org/10.1093/nar/gkh387
Abstract
Identification of functional transcription factor binding sites in genomic sequences is notoriously difficult. The critical problem is the low specificity of predictions, which directly reflects the low target specificity of DNA binding proteins. To overcome the noise produced in predictions of individual binding sites, a new generation of algorithms achieves better predictive specificity by focusing on locally dense clusters of binding sites. MSCAN is a leading method for binding site cluster detection that determines the significance of observed sites while correcting for local compositional bias of sequences. The algorithm is highly flexible, applying any set of input binding models to the analysis of a user-specified sequence. From the user's perspective, a key feature of the system is that no reference data sets of regulatory sequences from co-regulated genes are required to train the algorithm. The output from MSCAN consists of an ordered list of sequence segments that contain potential regulatory modules. We have chosen the features in MSCAN such that sequence and matrix retrieval is highly facilitated, resulting in a web server that is intuitive to use. MSCAN is available at http://mscan.cgb.ki.se/cgi-bin/MSCAN.Keywords
This publication has 12 references indexed in Scilit:
- Ensembl 2004Nucleic Acids Research, 2004
- JASPAR: an open-access database for eukaryotic transcription factor binding profilesNucleic Acids Research, 2004
- Computational detection of cis -regulatory modulesBioinformatics, 2003
- Identification of functional clusters of transcription factor binding motifs in genome sequences: the MSCAN algorithmBioinformatics, 2003
- CREME: a framework for identifying cis-regulatory modules in human-mouse conserved segmentsBioinformatics, 2003
- Target Explorer: an automated tool for the identification of new target genes for a specified set of transcription factorsNucleic Acids Research, 2003
- Cluster-Buster: finding dense clusters of motifs in DNA sequencesNucleic Acids Research, 2003
- TRANSFAC(R): transcriptional regulation, from patterns to profilesNucleic Acids Research, 2003
- TFBS: Computational framework for transcription factor binding site analysisBioinformatics, 2002
- SCORE: A computational approach to the identification of cis-regulatory modules and target genes in whole-genome sequence dataProceedings of the National Academy of Sciences, 2002