Bias in Phylogenetic Reconstruction of Vertebrate Rhodopsin Sequences
Open Access
- 1 August 2000
- journal article
- research article
- Published by Oxford University Press (OUP) in Molecular Biology and Evolution
- Vol. 17 (8) , 1220-1231
- https://doi.org/10.1093/oxfordjournals.molbev.a026405
Abstract
Two spurious nodes were found in phylogenetic analyses of vertebrate rhodopsin sequences in comparison with well-established vertebrate relationships. These spurious reconstructions were well supported in bootstrap analyses and occurred independently of the method of phylogenetic analysis used (parsimony, distance, or likelihood). Use of this data set of vertebrate rhodopsin sequences allowed us to exploit established vertebrate relationships, as well as the considerable amount known about the molecular evolution of this gene, in order to identify important factors contributing to the spurious reconstructions. Simulation studies using parametric bootstrapping indicate that it is unlikely that the spurious nodes in the parsimony analyses are due to long branches or other topological effects. Rather, they appear to be due to base compositional bias at third positions, codon bias, and convergent evolution at nucleotide positions encoding the hydrophobic residues isoleucine, leucine, and valine. LogDet distance methods, as well as maximum-likelihood methods which allow for nonstationary changes in base composition, reduce but do not entirely eliminate support for the spurious resolutions. Inclusion of five additional rhodopsin sequences in the phylogenetic analyses largely corrected one of the spurious reconstructions while leaving the other unaffected. The additional sequences not only were more proximal to the corrected node, but were also found to have intermediate levels of base composition and codon bias as compared with neighboring sequences on the tree. This study shows that the spurious reconstructions can be corrected either by excluding third positions, as well as those encoding the amino acids Ile, Val, and Leu (which may not be ideal, as these sites can contain useful phylogenetic signal for other parts of the tree), or by the addition of sequences that reduce problems associated with convergent evolution.Keywords
This publication has 45 references indexed in Scilit:
- Molecular phylogenies become functionalTrends in Ecology & Evolution, 1999
- Base Compositional Bias and Phylogenetic Analyses: A Test of the “Flying DNA” HypothesisMolecular Phylogenetics and Evolution, 1998
- Control of rhodopsin activity in visionEye, 1998
- Opsin Phylogeny and Evolution: A Model for Blue Shifts in Wavelength RegulationMolecular Phylogenetics and Evolution, 1995
- Rhodopsin: structure, function, and geneticsBiochemistry, 1992
- The ‘effective number of codons’ used in a geneGene, 1990
- Evaluation of the maximum likelihood estimate of the evolutionary tree topologies from DNA sequence data, and the branching order in hominoideaJournal of Molecular Evolution, 1989
- Ancient origin of lactalbumin from lysozyme: Analysis of DNA and amino acid sequencesJournal of Molecular Evolution, 1988
- Evolutionary trees from DNA sequences: A maximum likelihood approachJournal of Molecular Evolution, 1981
- A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequencesJournal of Molecular Evolution, 1980