Maximum-Likelihood Approach for Gene Family Evolution Under Functional Divergence

Open Access

1 April 2001

journal article
research article
Published by Oxford University Press (OUP) in Molecular Biology and Evolution

Vol. 18 (4) , 453-464
https://doi.org/10.1093/oxfordjournals.molbev.a003824

Abstract

According to the observed alignment pattern (i.e., amino acid configuration), we studied two basic types of functional divergence of a protein family. Type I functional divergence after gene duplication results in altered functional constraints (i.e., different evolutionary rate) between duplicate genes, whereas type II results in no altered functional constraints but radical change in amino acid property between them (e.g., charge, hydrophobicity, etc.). Two statistical approaches, i.e., the subtree likelihood and the whole-tree likelihood, were developed for estimating the coefficients of (type I or type II) functional divergence. Numerical algorithms for obtaining maximum-likelihood estimates are also provided. Moreover, a posterior-based site-specific profile is implemented to predict critical amino acid residues that are responsible for type I and/or type II functional divergence after gene duplication. We compared the current likelihood with a fast method developed previously by examples; both show similar results. For handling altered functional constraints (type I functional divergence) in the large gene family with many member genes (clusters), which appears to be a normal case in postgenomics, the subtree likelihood provides a solution that is computationally feasible and robust against the uncertainty of the phylogeny. The cost of this feasibility is the approximation when frequencies of amino acids are very skewed. The potential bias and correction are discussed.

Keywords

This publication has 29 references indexed in Scilit:

The role of cyclooxygenases in inflammation, cancer, and development
Oncogene, 1999
Coevolving protein residues: maximum likelihood identification and relationship to structure 1 1Edited by G. Von Heijne
Journal of Molecular Biology, 1999
Vertebrate evolution by interspecific hybridisation – are we polyploid?
FEBS Letters, 1997
Modeling residue usage in aligned protein sequences via maximum likelihood
Molecular Biology and Evolution, 1996
An Evolutionary Trace Method Defines Binding Surfaces Common to Protein Families
Journal of Molecular Biology, 1996
A method to predict functional residues in proteins
Nature Structural & Molecular Biology, 1995
Evolution of the Vertebrate Genome as Reflected in Paralogous Chromosomal Regions in Man and the House Mouse
Genomics, 1993
The rapid generation of mutation data matrices from protein sequences
Bioinformatics, 1992
Evolutionary trees from DNA sequences: A maximum likelihood approach
Journal of Molecular Evolution, 1981
Fitting Discrete Probability Distributions to Evolutionary Events
Science, 1971