P-Match: transcription factor binding site search by combining patterns and weight matrices
Open Access
- 1 July 2005
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 33 (Web Server) , W432-W437
- https://doi.org/10.1093/nar/gki441
Abstract
P-Match is a new tool for identifying transcription factor (TF) binding sites in DNA sequences. It combines pattern matching and weight matrix approaches thus providing higher accuracy of recognition than each of the methods alone. P-Match is closely interconnected with the TRANSFAC® database. In particular, P-Match uses the matrix library as well as sets of aligned known TF-binding sites collected in TRANSFAC® and therefore provides the possibility to search for a large variety of different TF binding sites. Using results of extensive tests of recognition accuracy, we selected three sets of optimized cut-off values that minimize either false negatives or false positives, or the sum of both errors. Comparison with the weight matrix approaches such as Match™ tool shows that P-Match generally provides superior recognition accuracy in the area of low false negative errors (high sensitivity). As familiar to the user of Match™, P-Match also allows to save user-specific profiles that include selected subsets of matrices with corresponding TF-binding sites or user-defined cut-off values. Furthermore, a number of tissue-specific profiles are provided that were compiled by the TRANSFAC® team. A public version of the P-Match tool is available at http://www.gene-regulation.com/cgi-bin/pub/programs/pmatch/bin/p-match.cgi.Keywords
This publication has 17 references indexed in Scilit:
- Deriving an ontology for human gene expression sources from the CYTOMER database on human organs and cell types.2005
- Genome-Wide Analysis of CREB Target Genes Reveals A Core Promoter Requirement for cAMP ResponsivenessMolecular Cell, 2003
- TRANSFAC(R): transcriptional regulation, from patterns to profilesNucleic Acids Research, 2003
- TRANSPATH(R): an integrated database on signal transduction and a tool for array analysisNucleic Acids Research, 2003
- AliBaba2: context specific identification of transcription factor binding sites.2002
- TRANSCompel(R): a database on composite regulatory elements in eukaryotic genesNucleic Acids Research, 2002
- Discovery and modeling of transcriptional regulatory regionsPublished by Elsevier ,2000
- Computer tool FUNSITE for analysis of eukaryotic regulatory genomic sequences.1995
- Matlnd and Matlnspector: new fast and versatile tools for detection of consensus matches in nucleotide sequence dataNucleic Acids Research, 1995
- Detecting Subtle Sequence Signals: a Gibbs Sampling Strategy for Multiple AlignmentScience, 1993