Profile analysis: detection of distantly related proteins.
- 1 July 1987
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 84 (13) , 4355-4358
- https://doi.org/10.1073/pnas.84.13.4355
Abstract
Profile analysis is a method for detecting distantly related proteins by sequence comparison. The basis for comparison is not only the customary Dayhoff mutational-distance matrix but also the results of structural studies and information implicit in the alignments of the sequences of families of similar proteins. This information is expressed in a position-specific scoring table (profile), which is created from a group of sequences previously aligned by structural or sequence similarity. The similarity of any other sequence (target) to the group of aligned sequences (probe) can be tested by comparing the target to the profile using dynamic programming algorithms. The profile method differs in two major respects from methods of sequence comparison in common use: (i) Any number of known sequences can be used to construct the profile, allowing more information to be used in the testing of the target than is possible with pairwise alignment methods. (ii) The profile includes the penalties for insertion or deletion at each position, which allow one to include the probe secondary structure in the testing scheme. Tests with globin and immunoglobulin sequences show that profile analysis can distinguish all members of these families from all other sequences in a database containing 3800 protein sequences.This publication has 24 references indexed in Scilit:
- Evolutionary similarity among peptide segments is a basis for prediction of protein foldingBiopolymers, 1986
- Identification of protein sequence homology by consensus template alignmentJournal of Molecular Biology, 1986
- Prediction of the occurrence of the ADP-binding βαβ-fold in proteins, using an amino acid sequence fingerprintJournal of Molecular Biology, 1986
- Rapid and Sensitive Protein Similarity SearchesScience, 1985
- Correlation of sequence hydrophobicities measures similarity in three-dimensional protein structureJournal of Molecular Biology, 1983
- Analysis of gene duplication repeats in the myosin rodJournal of Molecular Biology, 1983
- How different amino acid sequences determine similar protein structures: The structure and evolutionary dynamics of the globinsJournal of Molecular Biology, 1980
- Empirical Predictions of Protein ConformationAnnual Review of Biochemistry, 1978
- A general method applicable to the search for similarities in the amino acid sequence of two proteinsJournal of Molecular Biology, 1970
- An improved method of testing for evolutionary homologyJournal of Molecular Biology, 1966