Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
Open Access
- 1 September 1997
- journal article
- review article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 25 (17) , 3389-3402
- https://doi.org/10.1093/nar/25.17.3389
Abstract
The BLAST programs are widely used tools for searching protein and DNA databases for sequence similarities. For protein comparisons, a variety of definitional, algorithmic and statistical refinements described here permits the execution time of the BLAST programs to be decreased substantially while enhancing their sensitivity to weak similarities. A new criterion for triggering the extension of word hits, combined with a new heuristic for generating gapped alignments, yields a gapped BLAST program that runs at approximately three times the speed of the original. In addition, a method is introduced for automatically combining statistically significant alignments produced by BLAST into a position-specific score matrix, and searching the database using this matrix. The resulting Position-Specific Iterated BLAST (PSIBLAST) program runs at approximately the same speed per iteration as gapped BLAST, but in many cases is much more sensitive to weak but biologically relevant sequence similarities. PSI-BLAST is used to uncover several new and interesting members of the BRCT superfamily.Keywords
This publication has 65 references indexed in Scilit:
- Amino acid substitution matrices from an information theoretic perspectivePublished by Elsevier ,2005
- Position-based sequence weightsPublished by Elsevier ,2004
- Embedding strategies for effective use of information from multiple sequence alignmentsProtein Science, 1997
- Identification of a RING protein that can interact in vivo with the BRCA1 gene productNature Genetics, 1996
- Sequence Analysis of the Genome of the Unicellular Cyanobacterium Synechocystis sp. Strain PCC6803. II. Sequence Determination of the Entire Genome and Assignment of Potential Protein-coding RegionsDNA Research, 1996
- Maximum Discrimination Hidden Markov Models of Sequence ConsensusJournal of Computational Biology, 1995
- Volume changes in protein evolutionJournal of Molecular Biology, 1994
- Basic local alignment search toolJournal of Molecular Biology, 1990
- Systematic method for the detection of potential λ Cro-like DNA-binding regions in proteinsJournal of Molecular Biology, 1987
- Selection of DNA binding sites by regulatory proteinsJournal of Molecular Biology, 1987