Quick, Practical Selection of Effective Seeds for Homology Search
- 1 November 2005
- journal article
- research article
- Published by Mary Ann Liebert Inc in Journal of Computational Biology
- Vol. 12 (9) , 1137-1152
- https://doi.org/10.1089/cmb.2005.12.1137
Abstract
It has been observed that in homology search gapped seeds have better sensitivity than ungapped ones for the same cost (weight). In this paper, we propose a probability leakage model (a dissipative Markov system) to elucidate the mechanism that confers power to spaced seeds. Based on this model, we identify desirable features of gapped search seeds and formulate an extremely efficient procedure for seed design: it samples from the set of spaced seed exhibiting those features, evaluates their sensitivity, and then selects the best. The sensitivity of the constructed seeds is negligibly less than that of the corresponding known optimal seeds. While the challenging mathematical question of characterizing optimal search seeds remains open, we believe that our eminently efficient and effective approach represents a satisfactory solution from a practitioner's viewpoint.Keywords
This publication has 21 references indexed in Scilit:
- tPatternHunter: gapped, fast and sensitive translated homology searchBioinformatics, 2004
- Sensitivity analysis and efficient method for identifying optimal spaced seedsJournal of Computer and System Sciences, 2004
- Good spaced seeds for homology searchBioinformatics, 2004
- Human–Mouse Alignments with BLASTZGenome Research, 2002
- A Greedy Algorithm for Aligning DNA SequencesJournal of Computational Biology, 2000
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Multiple filtration and approximate pattern matchingAlgorithmica, 1995
- Basic local alignment search toolJournal of Molecular Biology, 1990
- Rapid and Sensitive Protein Similarity SearchesScience, 1985
- PATRICIA—Practical Algorithm To Retrieve Information Coded in AlphanumericJournal of the ACM, 1968