Applications and statistics for multiple high-scoring segments in molecular sequences.
- 15 June 1993
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 90 (12) , 5873-5877
- https://doi.org/10.1073/pnas.90.12.5873
Abstract
Score-based measures of molecular-sequence features provide versatile aids for the study of proteins and DNA. They are used by many sequence data base search programs, as well as for identifying distinctive properties of single sequences. For any such measure, it is important to know what can be expected to occur purely by chance. The statistical distribution of high-scoring segments has been described elsewhere. However, molecular sequences will frequently yield several high-scoring segments for which some combined assessment is in order. This paper describes the statistical distribution for the sum of the scores of multiple high-scoring segments and illustrates its application to the identification of possible transmembrane segments and the evaluation of sequence similarity.Keywords
This publication has 28 references indexed in Scilit:
- Identification of common molecular subsequencesPublished by Elsevier ,2004
- Identification of protein coding regions by database similarity searchNature Genetics, 1993
- Amino acid substitution matrices from protein blocks.Proceedings of the National Academy of Sciences, 1992
- Chance and Statistical Significance in Protein and DNA Sequence AnalysisScience, 1992
- The rapid generation of mutation data matrices from protein sequencesBioinformatics, 1992
- The SWISS-PROT protein sequence data bankNucleic Acids Research, 1992
- The PIR-International Protein Sequence DatabaseNucleic Acids Research, 1992
- Protein database searches for multiple alignments.Proceedings of the National Academy of Sciences, 1990
- The ovalbumin gene family: Structure of the X gene and evolution of duplicated split genesCell, 1980
- Tests for comparing related amino-acid sequences. Cytochrome c and cytochrome c551Journal of Molecular Biology, 1971