Score Distributions for Simultaneous Matching to Multiple Motifs

Abstract
Several computer algorithms now exist for discovering multiple motifs(expressed as weight matrices) that characterize a family of proteinsequences known to be homologous. This paper describes a methodfor performing similarity searches of protein sequence databases usingsuch a group of motifs. By simultaneously using all the motifsthat characterize a protein family, the sensitivity and specificity ofthe database search are increased. We define the p-value for a targetsequence to be the...

This publication has 16 references indexed in Scilit: