Modeling residue usage in aligned protein sequences via maximum likelihood

1 December 1996

journal article
Published by Oxford University Press (OUP) in Molecular Biology and Evolution

Vol. 13 (10) , 1368-1374
https://doi.org/10.1093/oxfordjournals.molbev.a025583

Abstract

A computational method is presented for characterizing residue usage, i.e., site-specific residue frequencies, in aligned protein sequences. The method obtains frequency estimates that maximize the likelihood of the sequences in a simple model for sequence evolution, given a tree or a set of candidate trees computed by other methods. These maximum-likelihood frequencies constitute a profile of the sequences, and thus the method offers a rigorous alternative to sequence weighting for constructing such a profile. The ability of this method to discard misleading phylogenetic effects allows the biochemical propensities of different positions in a sequence to be more clearly observed and interpreted.

Keywords

This publication has 0 references indexed in Scilit: