cisRED: a database system for genome-scale computational discovery of regulatory elements
Open Access
- 1 January 2006
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 34 (90001) , D68-D73
- https://doi.org/10.1093/nar/gkj075
Abstract
We describe cisRED, a database for conserved regulatory elements that are identified and ranked by a genome-scale computational system (www.cisred.org). The database and high-throughput predictive pipeline are designed to address diverse target genomes in the context of rapidly evolving data resources and tools. Motifs are predicted in promoter regions using multiple discovery methods applied to sequence sets that include corresponding sequence regions from vertebrates. We estimate motif significance by applying discovery and post-processing methods to randomized sequence sets that are adaptively derived from target sequence sets, retain motifs with p-values below a threshold and identify groups of similar motifs and co-occurring motif patterns. The database offers information on atomic motifs, motif groups and patterns. It is web-accessible, and can be queried directly, downloaded or installed locally.Keywords
This publication has 25 references indexed in Scilit:
- Database resources of the National Center for Biotechnology InformationNucleic Acids Research, 2004
- ArrayProspector: a web resource of functional associations inferred from microarray expression dataNucleic Acids Research, 2004
- Coexpression Analysis of Human Genes Across Many Microarray Data SetsGenome Research, 2004
- Identification of sparsely distributed clusters of cis-regulatory elements in sets of co-expressed genesNucleic Acids Research, 2004
- Aligning Multiple Genomic Sequences With the Threaded Blockset AlignerGenome Research, 2004
- Applied bioinformatics for the identification of regulatory elementsNature Reviews Genetics, 2004
- TRANSFAC, TRANSPATH and CYTOMER as starting points for an ontology of regulatory networks.2004
- Computational prediction of transcription-factor binding site locationsGenome Biology, 2003
- Identifying DNA and protein patterns with statistically significant alignments of multiple sequences.Bioinformatics, 1999
- Fitting a mixture model by expectation maximization to discover motifs in biopolymers.1994