Molecular phylogeny of the kelch-repeat superfamily reveals an expansion of BTB/kelch proteins in animals
Open Access
- 17 September 2003
- journal article
- review article
- Published by Springer Nature in BMC Bioinformatics
- Vol. 4 (1) , 42
- https://doi.org/10.1186/1471-2105-4-42
Abstract
The kelch motif is an ancient and evolutionarily-widespread sequence motif of 44–56 amino acids in length. It occurs as five to seven repeats that form a β-propeller tertiary structure. Over 28 kelch-repeat proteins have been sequenced and functionally characterised from diverse organisms spanning from viruses, plants and fungi to mammals and it is evident from expressed sequence tag, domain and genome databases that many additional hypothetical proteins contain kelch-repeats. In general, kelch-repeat β-propellers are involved in protein-protein interactions, however the modest sequence identity between kelch motifs, the diversity of domain architectures, and the partial information on this protein family in any single species, all present difficulties to developing a coherent view of the kelch-repeat domain and the kelch-repeat protein superfamily. To understand the complexity of this superfamily of proteins, we have analysed by bioinformatics the complement of kelch-repeat proteins encoded in the human genome and have made comparisons to the kelch-repeat proteins encoded in other sequenced genomes. We identified 71 kelch-repeat proteins encoded in the human genome, whereas 5 or 8 members were identified in yeasts and around 18 in C. elegans, D. melanogaster and A. gambiae. Multiple domain architectures were identified in each organism, including previously unrecognised forms. The vast majority of kelch-repeat domains are predicted to form six-bladed β-propellers. The most prevalent domain architecture in the metazoan animal genomes studied was the BTB/kelch domain organisation and we uncovered 3 subgroups of human BTB/kelch proteins. Sequence analysis of the kelch-repeat domains of the most robustly-related subgroups identified differences in β-propeller organisation that could provide direction for experimental study of protein-binding characteristics. The kelch-repeat superfamily constitutes a distinct and evolutionarily-widespread family of β-propeller domain-containing proteins. Expansion of the family during the evolution of multicellular animals is mainly accounted for by a major expansion of the BTB/kelch domain architecture. BTB/kelch proteins constitute 72 % of the kelch-repeat superfamily of H. sapiens and form three subgroups, one of which appears the most-conserved during evolution. Distinctions in propeller blade organisation between subgroups 1 and 2 were identified that could provide new direction for biochemical and functional studies of novel kelch-repeat proteins.Keywords
This publication has 109 references indexed in Scilit:
- The genome sequence of the filamentous fungus Neurospora crassaNature, 2003
- Genome sequence of the human malaria parasite Plasmodium falciparumNature, 2002
- The genome sequence of Schizosaccharomyces pombeNature, 2002
- Genome sequence and gene compaction of the eukaryote parasite Encephalitozoon cuniculiNature, 2001
- The Genome Sequence of Drosophila melanogasterScience, 2000
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Life with 6000 GenesScience, 1996
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- Structural principles for the propeller assembly of β‐sheets: The preference for seven‐fold symmetryProteins-Structure Function and Bioinformatics, 1992
- Basic Local Alignment Search ToolJournal of Molecular Biology, 1990