GenomeHistory: a software tool and its application to fully sequenced genomes
Open Access
- 1 August 2002
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 30 (15) , 3378-3386
- https://doi.org/10.1093/nar/gkf449
Abstract
We present a publicly available software tool (http://www.unm.edu/∼compbio/software/GenomeHistory) that identifies all pairs of duplicate genes in a genome and then determines the degree of synonymous and non‐synonymous divergence between each duplicate pair. Using this tool, we analyze the relations between (i) gene function and the propensity of a gene to duplicate and (ii) the number of genes in a gene family and the family’s rate of sequence evolution. We do so for the complete genomes of four eukaryotes (fission and budding yeast, fruit fly and nematode) and one prokaryote (Escherichia coli). For some classes of genes we observe a strong relationship between gene function and a gene’s propensity to undergo duplication. Most notably, ribosomal genes and transcription factors appear less likely to undergo gene duplication than other genes. In both fission and budding yeast, we see a strong positive correlation between the selective constraint on a gene and the size of the gene family of which this gene is a member. In contrast, a weakly negative such correlation is seen in multicellular eukaryotes.Keywords
This publication has 38 references indexed in Scilit:
- The genome sequence of Schizosaccharomyces pombeNature, 2002
- The Genome Sequence of Drosophila melanogasterScience, 2000
- A structural census of genomes: comparing bacterial, eukaryotic, and archaeal genomes in terms of protein structureJournal of Molecular Biology, 1997
- The Complete Genome Sequence of Escherichia coli K-12Science, 1997
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Life with 6000 GenesScience, 1996
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- Basic local alignment search toolJournal of Molecular Biology, 1990
- Molecular characterization of the histone gene family of Caenorhabditis elegansJournal of Molecular Biology, 1987
- Evolution of the differential regulation of duplicate genes after polyploidizationJournal of Molecular Evolution, 1979