Score Distributions for Simultaneous Matching to Multiple Motifs
- 1 January 1997
- journal article
- Published by Mary Ann Liebert Inc in Journal of Computational Biology
- Vol. 4 (1) , 45-59
- https://doi.org/10.1089/cmb.1997.4.45
Abstract
Several computer algorithms now exist for discovering multiple motifs(expressed as weight matrices) that characterize a family of proteinsequences known to be homologous. This paper describes a methodfor performing similarity searches of protein sequence databases usingsuch a group of motifs. By simultaneously using all the motifsthat characterize a protein family, the sensitivity and specificity ofthe database search are increased. We define the p-value for a targetsequence to be the...Keywords
This publication has 16 references indexed in Scilit:
- Use of receiver operating characteristic (ROC) analysis to evaluate sequence matchingPublished by Elsevier ,2002
- The PROSITE database, its status in 1995Nucleic Acids Research, 1996
- [27] Local alignment statisticsPublished by Elsevier ,1996
- Unsupervised learning of multiple motifs in biopolymers using expectation maximizationMachine Learning, 1995
- Approximations to Profile Score DistributionsJournal of Computational Biology, 1994
- The ENZYME data bankNucleic Acids Research, 1994
- Automated assembly of protein blocks for database searchingNucleic Acids Research, 1991
- Basic local alignment search toolJournal of Molecular Biology, 1990
- [9] Profile analysisPublished by Elsevier ,1990
- Estimating probabilities for normal extremesAdvances in Applied Probability, 1980