A Maximum Likelihood Method for Detecting Directional Evolution in Protein Sequences and Its Application to Influenza A Virus
Open Access
- 29 May 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in Molecular Biology and Evolution
- Vol. 25 (9) , 1809-1824
- https://doi.org/10.1093/molbev/msn123
Abstract
We develop a model-based phylogenetic maximum likelihood test for evidence of preferential substitution toward a given residue at individual positions of a protein alignment—directional evolution of protein sequences (DEPS). DEPS can identify both the target residue and sites evolving toward it, help detect selective sweeps and frequency-dependent selection—scenarios that confound most existing tests for selection, and achieve good power and accuracy on simulated data. We applied DEPS to alignments representing different genomic regions of influenza A virus (IAV), sampled from avian hosts (H5N1 serotype) and human hosts (H3N2 serotype), and identified multiple directionally evolving sites in 5/8 genomic segments of H5N1 and H3N2 IAV. We propose a simple descriptive classification of directionally evolving sites into 5 groups based on the temporal distribution of residue frequencies and document known functional correlates, such as immune escape or host adaptation.Keywords
This publication has 69 references indexed in Scilit:
- The Global Circulation of Seasonal Influenza A (H3N2) VirusesScience, 2008
- The Influenza Virus Resource at the National Center for Biotechnology InformationJournal of Virology, 2008
- Epitope Mapping of the Hemagglutinin Molecule of a Highly Pathogenic H5N1 Influenza Virus by Using Monoclonal AntibodiesJournal of Virology, 2007
- Simultaneous amino acid substitutions at antigenic sites drive influenza A hemagglutinin evolutionProceedings of the National Academy of Sciences, 2007
- The evolution of epidemic influenzaNature Reviews Genetics, 2007
- Mapping the Antigenic and Genetic Evolution of Influenza VirusScience, 2004
- The rapid generation of mutation data matrices from protein sequencesBioinformatics, 1992
- Adaptive protein evolution at the Adh locus in DrosophilaNature, 1991
- Evolutionary trees from DNA sequences: A maximum likelihood approachJournal of Molecular Evolution, 1981
- Structure of the haemagglutinin membrane glycoprotein of influenza virus at 3 Å resolutionNature, 1981