Amino Acid Substitution Matrices from an Artificial Neural Network Model
- 1 October 2001
- journal article
- research article
- Published by Mary Ann Liebert Inc in Journal of Computational Biology
- Vol. 8 (5) , 471-481
- https://doi.org/10.1089/106652701753216495
Abstract
An amino acid substitution matrix specifies probabilities of substitutions for each pair of the 20 amino acids. Log-odds scores transformed from the values in substitution matrices are widely used to construct protein sequence alignments. Any given substitution matrix is suited to matching sequences diverged by a specific evolutionary distance. However, for a given set of sequences, it is not always clear what matrix should be used. We used an artificial neural network model to predict probabilities of amino acid substitutions with alignment samples of different evolutionary distances. From this internal description, substitution matrices suitable for detecting relationships at any chosen evolutionary distance can be instantly generated. By using the additional information of evolutionary distances, the average cross entropy error of our neural network model is lower than that of a series of BLOSUM and PET matrices over all testing sets. Our model is more accurate on the prediction of amino acid substitution probabilities.Keywords
This publication has 20 references indexed in Scilit:
- Amino acid substitution matrices from an information theoretic perspectivePublished by Elsevier ,2005
- Identification of common molecular subsequencesPublished by Elsevier ,2004
- Protein structure comparison using iterated double dynamic programmingProtein Science, 1999
- CATH – a hierarchic classification of protein domain structuresPublished by Elsevier ,1997
- Recognition of analogous and homologous protein folds: analysis of sequence and structure conservationJournal of Molecular Biology, 1997
- A Structural Basis for Sequence ComparisonsJournal of Molecular Biology, 1993
- Basic local alignment search toolJournal of Molecular Biology, 1990
- Protein structure alignmentJournal of Molecular Biology, 1989
- Amino acid substitutions in structurally related proteins a pattern recognition approachJournal of Molecular Biology, 1988
- A general method applicable to the search for similarities in the amino acid sequence of two proteinsJournal of Molecular Biology, 1970