Signatures of Domain Shuffling in the Human Genome
Open Access
- 1 November 2002
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 12 (11) , 1642-1650
- https://doi.org/10.1101/gr.520702
Abstract
To elucidate the role of exon shuffling in shaping the complexity of the human genome/proteome, we have systematically analyzed intron phase distributions in the coding sequence of human protein domains. We found that introns at the boundaries of domains show high excess of symmetrical phase combinations (i.e., 0–0, 1–1, and 2–2), whereas nonboundary introns show no excess symmetry. This suggests that exon shuffling has primarily involved rearrangement of structural and functional domains as a whole. Furthermore, we found that domains flanked by phase 1 introns have dramatically expanded in the human genome due to domain shuffling and that 1–1 symmetrical domains and domain families are nonrandomly distributed with respect to their age. The predominance and extracellular location of 1–1 symmetrical domains among domains specific to metazoans suggests that they are associated with the rise of multicellularity. On the other hand, 0–0 symmetrical domains tend to be over-represented among ancient protein domains that are shared between the eukaryotic and prokaryotic kingdoms, which is compatible with the suggestion of primordial domain shuffling in the progenote. To see whether the human data reflect general genomic patterns of metazoans, similar analyses were done for the nematodeCaenorhabditis elegans. Although the C. elegans data generally concur with the human patterns, we identified fewer intron-bounded domains in this organism, consistent with the lower complexity of C. elegans genes. [The following individuals kindly provided reagents, samples, or unpublished information as indicated in the paper: Z. Gu and R. Stevens.]Keywords
This publication has 44 references indexed in Scilit:
- Protein Repeats: Structures, Functions, and EvolutionJournal of Structural Biology, 2001
- The Sequence of the Human GenomeScience, 2001
- Initial sequencing and analysis of the human genomeNature, 2001
- Genome evolution and the evolution of exon-shuffling — a reviewGene, 1999
- A comparison of sequence and structure protein domain families as a basis for structural genomics.Bioinformatics, 1999
- Introns and reading frames: correlation between splicing sites and their codon positionsMolecular Biology and Evolution, 1996
- A dominant mutation in the Ikaros gene leads to rapid development of leukemia and lymphomaCell, 1995
- Polycystic kidney disease: The complete structure of the PKD1 gene and its proteinCell, 1995
- Introns as relict retrotransposons: Implications for the evolutionary origin of eukaryotic mRNA splicing mechanismsJournal of Theoretical Biology, 1986
- Selfish DNA: the ultimate parasiteNature, 1980