The Adaptive Evolution Database (TAED)
Open Access
- 24 July 2001
- journal article
- Published by Springer Nature in Genome Biology
Abstract
The Master Catalog is a collection of evolutionary families, including multiple sequence alignments, phylogenetic trees and reconstructed ancestral sequences, for all protein-sequence modules encoded by genes in GenBank. It can therefore support large-scale genomic surveys, of which we present here The Adaptive Evolution Database (TAED). In TAED, potential examples of positive adaptation are identified by high values for the normalized ratio of nonsynonymous to synonymous nucleotide substitution rates (KA/KS values) on branches of an evolutionary tree between nodes representing reconstructed ancestral sequences. Evolutionary trees and reconstructed ancestral sequences were extracted from the Master Catalog for every subtree containing proteins from the Chordata only or the Embryophyta only. Branches with high KA/KS values were identified. These represent candidate episodes in the history of the protein family when the protein may have undergone positive selection, where the mutant form conferred more fitness than the ancestral form. Such episodes are frequently associated with change in function. An unexpectedly large number of families (between 10% and 20% of those families examined) were found to have at least one branch with high KA/KS values above arbitrarily chosen cut-offs (1 and 0.6). Most of these survived a robustness test and were collected into TAED. TAED is a raw resource for bioinformaticists interested in data mining and for experimental evolutionists seeking candidate examples of adaptive evolution for further experimental study. It can be expanded to include other evolutionary information (for example changes in gene regulation or splicing) placed in a phylogenetic perspective.Keywords
This publication has 23 references indexed in Scilit:
- Functional and physiological consequences of genetic variation at phosphoglucose isomerase: Heat shock protein expression is related to enzyme genotype in a montane beetleProceedings of the National Academy of Sciences, 2000
- The Pfam Protein Families DatabaseNucleic Acids Research, 2000
- Episodic adaptive evolution of primate lysozymesNature, 1997
- Pseudogenes in ribonuclease evolution: a source of new biomacromolecular function?FEBS Letters, 1996
- The Crystal Structure of a High Oxygen Affinity Species of Haemoglobin (Bar-headed Goose Haemoglobin in the Oxy Form)Journal of Molecular Biology, 1996
- Accelerated evolution in the protein-coding regions is universal in crotalinae snake venom gland phospholipase A2 isozyme genes.Proceedings of the National Academy of Sciences, 1995
- HOVERGEN: a database of homologous vertebrate genesNucleic Acids Research, 1994
- Unbiased estimation of the rates of synonymous and nonsynonymous substitutionJournal of Molecular Evolution, 1993
- Pattern of nucleotide substitution at major histocompatibility complex class I loci reveals overdominant selectionNature, 1988
- Toward Defining the Course of Evolution: Minimum Change for a Specific Tree TopologySystematic Zoology, 1971