Evolutionary simulations to detect functional lineage-specific genes
Open Access
- 9 June 2006
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 22 (15) , 1815-1822
- https://doi.org/10.1093/bioinformatics/btl280
Abstract
Motivation: Supporting the functionality of recent duplicate gene copies is usually difficult, owing to high sequence similarity between duplicate counterparts and shallow phylogenies, which hamper both the statistical and experimental inference. Results: We developed an integrated evolutionary approach to identify functional duplicate gene copies and other lineage-specific genes. By repeatedly simulating neutral evolution, our method estimates the probability that an ORF was selectively conserved and is therefore likely to represent a bona fide coding region. In parallel, our method tests whether the accumulation of non-synonymous substitutions reveals signatures of selective constraint. We show that our approach has high power to identify functional lineage-specific genes using simulated and real data. For example, a coding region of average length (∼1400 bp), restricted to hominoids, can be predicted to be functional in ∼94–100% of cases. Notably, the method may support functionality for instances where classical selection tests based on the ratio of non-synonymous to synonymous substitutions fail to reveal signatures of selection. Our method is available as an automated tool, ReEVOLVER, which will also be useful to systematically detect functional lineage-specific genes of closely related species on a large scale. Availability: ReEVOLVER is available at . Contact:Henrik.Kaessmann@unil.ch Supplementary Data: Supplementary Data are available at Bioinformatics online.Keywords
This publication has 40 references indexed in Scilit:
- Emergence of Young Human Genes after a Burst of Retroposition in PrimatesPLoS Biology, 2005
- A genome-wide comparison of recent chimpanzee and human segmental duplicationsNature, 2005
- Bayes Empirical Bayes Inference of Amino Acid Sites Under Positive SelectionMolecular Biology and Evolution, 2005
- Birth and adaptive evolution of a hominoid gene that supports high neurotransmitter fluxNature Genetics, 2004
- eShadow: A Tool for Comparing Closely Related SequencesGenome Research, 2004
- Comparative genomics at the vertebrate extremesNature Reviews Genetics, 2004
- The origin of new genes: glimpses from the young and oldNature Reviews Genetics, 2003
- Segmental duplications and the evolution of the primate genomeNature Reviews Genetics, 2002
- Initial sequencing and analysis of the human genomeNature, 2001
- A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequencesJournal of Molecular Evolution, 1980