Genome cartography through domain annotation
Open Access
- 3 July 2001
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
The evolutionary history of eukaryotic proteins involves rapid sequence divergence, addition and deletion of domains, and fusion and fission of genes. Although the protein repertoires of distantly related species differ greatly, their domain repertoires do not. To account for the great diversity of domain contexts and an unexpected paucity of ortholog conservation, we must categorize the coding regions of completely sequenced genomes into domain families, as well as protein families.This publication has 16 references indexed in Scilit:
- Mouse genomics: Making sense of the sequenceCurrent Biology, 2001
- The Sequence of the Human GenomeScience, 2001
- Initial sequencing and analysis of the human genomeNature, 2001
- TIGRFAMs: a protein family resource for the functional identification of proteinsNucleic Acids Research, 2001
- The InterPro database, an integrated documentation resource for protein families, domains and functional sitesNucleic Acids Research, 2001
- The Pfam Protein Families DatabaseNucleic Acids Research, 2000
- SMART, a simple modular architecture research tool: Identification of signaling domainsProceedings of the National Academy of Sciences, 1998
- branchless Encodes a Drosophila FGF Homolog That Controls Tracheal Cell Migration and the Pattern of BranchingCell, 1996
- Whole-Genome Random Sequencing and Assembly of Haemophilus influenzae RdScience, 1995
- Three-dimensional structure of human basic fibroblast growth factor, a structural homolog of interleukin 1 beta.Proceedings of the National Academy of Sciences, 1991