Modeling Amino Acid Replacement
- 1 December 2000
- journal article
- research article
- Published by Mary Ann Liebert Inc in Journal of Computational Biology
- Vol. 7 (6) , 761-776
- https://doi.org/10.1089/10665270050514918
Abstract
The estimation of amino acid replacement frequencies during molecular evolution is crucial for many applications in sequence analysis. Score matrices for database search programs or phylogenetic analysis rely on such models of protein evolution. Pioneering work was done by Dayhoff et al. (1978) who formulated a Markov model of evolution and derived the famous PAM score matrices. Her estimation procedure for amino acid exchange frequencies is restricted to pairs of proteins that have a constant and small degree of divergence. Here we present an improved estimator, called the resolvent method, that is not subject to these limitations. This extension of Dayhoff's approach enables us to estimate an amino acid substitution model from alignments of varying degree of divergence. Extensive simulations show the capability of the new estimator to recover accurately the exchange frequencies among amino acids. Based on the SYSTERS database of aligned protein families (Krause and Vingron, 1998) we recompute a series of score matrices.Keywords
This publication has 17 references indexed in Scilit:
- Amino acid substitution matrices from an information theoretic perspectivePublished by Elsevier ,2005
- A set-theoretic approach to database searching and clustering.Bioinformatics, 1998
- Estimation of evolutionary distances under stationary and nonstationary models of nucleotide substitutionProceedings of the National Academy of Sciences, 1998
- Model of Amino Acid Substitution in Proteins Encoded by Mitochondrial DNAJournal of Molecular Evolution, 1996
- Amino acid substitution during functionally constrained divergent evolution of protein sequencesProtein Engineering, Design and Selection, 1994
- Amino acid substitution matrices from protein blocks.Proceedings of the National Academy of Sciences, 1992
- Exhaustive Matching of the Entire Protein Sequence DatabaseScience, 1992
- Automated assembly of protein blocks for database searchingNucleic Acids Research, 1991
- Basic local alignment search toolJournal of Molecular Biology, 1990
- Construction of Phylogenetic TreesScience, 1967