Probabilistic Cross-Species Inference of Orthologous Genomic Regions Created by Whole-Genome Duplication in Yeast
- 1 July 2008
- journal article
- Published by Oxford University Press (OUP) in Genetics
- Vol. 179 (3) , 1681-1692
- https://doi.org/10.1534/genetics.107.074450
Abstract
Identification of orthologous genes across species becomes challenging in the presence of a whole-genome duplication (WGD). We present a probabilistic method for identifying orthologs that considers all possible orthology/paralogy assignments for a set of genomes with a shared WGD (here five yeast species). This approach allows us to estimate how confident we can be in the orthology assignments in each genomic region. Two inferences produced by this model are indicative of purifying selection acting to prevent duplicate gene loss. First, our model suggests that there are significant differences (up to a factor of seven) in duplicate gene half-life. Second, we observe differences between the genes that the model infers to have been lost soon after WGD and those lost more recently. Gene losses soon after WGD appear uncorrelated with gene expression level and knockout fitness defect. However, later losses are biased toward genes whose paralogs have high expression and large knockout fitness defects, as well as showing biases toward certain functional groups such as ribosomal proteins. We suggest that while duplicate copies of some genes may be lost neutrally after WGD, another set of genes may be initially preserved in duplicate by natural selection for reasons including dosage.Keywords
This publication has 63 references indexed in Scilit:
- Natural history and evolutionary principles of gene duplication in fungiNature, 2007
- Independent sorting-out of thousands of duplicated gene pairs in two yeast species descended from a whole-genome duplicationProceedings of the National Academy of Sciences, 2007
- Global trends of whole-genome duplications revealed by the ciliate Paramecium tetraureliaNature, 2006
- Genome evolution in yeastsNature, 2004
- Proof and evolutionary analysis of ancient genome duplication in the yeast Saccharomyces cerevisiaeNature, 2004
- Dosage sensitivity and the evolution of gene families in yeastNature, 2003
- Sequencing and comparison of yeast species to identify genes and regulatory elementsNature, 2003
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Basic Local Alignment Search ToolJournal of Molecular Biology, 1990
- Evolutionary trees from DNA sequences: A maximum likelihood approachJournal of Molecular Evolution, 1981