The human phylome
Open Access
- 13 June 2007
- journal article
- Published by Springer Nature in Genome Biology
- Vol. 8 (6) , R109
- https://doi.org/10.1186/gb-2007-8-6-r109
Abstract
Background: Phylogenomics analyses serve to establish evolutionary relationships among organisms and their genes. A phylome, the complete collection of all gene phylogenies in a genome, constitutes a valuable source of information, but its use in large genomes still constitutes a technical challenge. The use of phylomes also requires the development of new methods that help us to interpret them. Results: We reconstruct here the human phylome, which includes the evolutionary relationships of all human proteins and their homologs among 39 fully sequenced eukaryotes. Phylogenetic techniques used include alignment trimming, branch length optimization, evolutionary model testing and maximum likelihood and Bayesian methods. Although differences with alternative topologies are minor, most of the trees support the Coelomata and Unikont hypotheses as well as the grouping of primates with laurasatheria to the exclusion of rodents. We assess the extent of gene duplication events and their relationship with the functional roles of the protein families involved. We find support for at least one, and probably two, rounds of whole genome duplications before vertebrate radiation. Using a novel algorithm that is independent from a species phylogeny, we derive orthology and paralogy relationships of human proteins among eukaryotic genomes. Conclusion: Topological variations among phylogenies for different genes are to be expected, highlighting the danger of gene-sampling effects in phylogenomic analyses. Several links can be established between the functions of gene families duplicated at certain phylogenetic splits and major evolutionary transitions in those lineages. The pipeline implemented here can be easily adapted for use in other organisms.Keywords
This publication has 91 references indexed in Scilit:
- Bushes in the Tree of LifePLoS Biology, 2006
- BABELOMICS: a systems biology perspective in the functional annotation of genome-scale experimentsNucleic Acids Research, 2006
- Protein Family Expansions and Biological ComplexityPLoS Computational Biology, 2006
- Toward Automatic Reconstruction of a Highly Resolved Tree of LifeScience, 2006
- Analysis of the human protein interactome and comparison with yeast, worm and fly interaction datasetsNature Genetics, 2006
- OrthoMCL: Identification of Ortholog Groups for Eukaryotic GenomesGenome Research, 2003
- A physical map of the human genomeNature, 2001
- Initial sequencing and analysis of the human genomeNature, 2001
- Evidence for a clade of nematodes, arthropods and other moulting animalsNature, 1997
- The rapid generation of mutation data matrices from protein sequencesBioinformatics, 1992