Approximate P-Values for Local Sequence Alignments: Numerical Studies
- 1 October 2001
- journal article
- research article
- Published by Mary Ann Liebert Inc in Journal of Computational Biology
- Vol. 8 (5) , 549-556
- https://doi.org/10.1089/106652701753216530
Abstract
Siegmund and Yakir (2000) have given an approximate p-value when two independent, identically distributed sequences from a finite alphabet are optimally aligned based on a scoring system that rewards similarities according to a general scoring matrix and penalizes gaps (insertions and deletions). The approximation involves an infinite sequence of difficult-to-compute parameters. In this paper, it is shown by numerical studies that these reduce to essentially two numerically distinct parameters, which can be computed as one-dimensional numerical integrals. For an arbitrary scoring matrix and affine gap penalty, this modified approximation is easily evaluated. Comparison with published numerical results show that it is reasonably accurate.Keywords
This publication has 19 references indexed in Scilit:
- The estimation of statistical parameters for local alignment score distributionsNucleic Acids Research, 2001
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- [27] Local alignment statisticsPublished by Elsevier ,1996
- Limit Distribution of Maximal Non-Aligned Two-Sequence Segmental ScoreThe Annals of Probability, 1994
- Amino acid substitution matrices from protein blocks.Proceedings of the National Academy of Sciences, 1992
- Basic local alignment search toolJournal of Molecular Biology, 1990
- The Erdos-Renyi Law in Distribution, for Coin Tossing and Sequence MatchingThe Annals of Statistics, 1990
- Risk theory in a Markovian environmentScandinavian Actuarial Journal, 1989
- Two Moments Suffice for Poisson Approximations: The Chen-Stein MethodThe Annals of Probability, 1989
- Limit Theorems for Semi-Markov Processes and Renewal Theory for Markov ChainsThe Annals of Probability, 1978