The Apicomplexan Whole-Genome Phylogeny: An Analysis of Incongruence among Gene Trees
Open Access
- 5 August 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in Molecular Biology and Evolution
- Vol. 25 (12) , 2689-2698
- https://doi.org/10.1093/molbev/msn213
Abstract
The protistan phylum Apicomplexa contains many important pathogens and is the subject of intense genome sequencing efforts. Based upon the genome sequences from seven apicomplexan species and a ciliate outgroup, we identified 268 single-copy genes suitable for phylogenetic inference. Both concatenation and consensus approaches inferred the same species tree topology. This topology is consistent with most prior conceptions of apicomplexan evolution based upon ultrastructural and developmental characters, that is, the piroplasm genera Theileria and Babesia form the sister group to the Plasmodium species, the coccidian genera Eimeria and Toxoplasma are monophyletic and are the sister group to the Plasmodium species and piroplasm genera, and Cryptosporidium forms the sister group to the above mentioned with the ciliate Tetrahymena as the outgroup. The level of incongruence among gene trees appears to be high at first glance; only 19% of the genes support the species tree, and a total of 48 different gene-tree topologies are observed. Detailed investigations suggest that the low signal-to-noise ratio in many genes may be the main source of incongruence. The probability of being consistent with the species tree increases as a function of the minimum bootstrap support observed at tree nodes for a given gene tree. Moreover, gene sequences that generate high bootstrap support are robust to the changes in alignment parameters or phylogenetic method used. However, caution should be taken in that some genes can infer a “wrong” tree with strong support because of paralogy, model violations, or other causes. The importance of examining multiple, unlinked genes that possess a strong phylogenetic signal cannot be overstated.Keywords
This publication has 73 references indexed in Scilit:
- An Improved General Amino Acid Replacement MatrixMolecular Biology and Evolution, 2008
- ToxoDB: an integrated Toxoplasma gondii database resourceNucleic Acids Research, 2007
- PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignmentsNucleic Acids Research, 2006
- Lateral gene transfer in eukaryotesCellular and Molecular Life Sciences, 2005
- The genome of Cryptosporidium hominisNature, 2004
- OrthoMCL: Identification of Ortholog Groups for Eukaryotic GenomesGenome Research, 2003
- Genome sequence of the human malaria parasite Plasmodium falciparumNature, 2002
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- The rapid generation of mutation data matrices from protein sequencesBioinformatics, 1992
- Basic Local Alignment Search ToolJournal of Molecular Biology, 1990