Gene3D: Structural Assignment for Whole Genes and Genomes Using the CATH Domain Structure Database
Open Access
- 1 March 2002
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 12 (3) , 503-514
- https://doi.org/10.1101/gr.213802
Abstract
We present a novel web-based resource,Gene3D, of precalculated structural assignments to gene sequences and whole genomes. This resource assigns structural domains from the CATH database to whole genes and links these to their curated functional and structural annotations within the CATH domain structure database, the functional Dictionary of Homologous Superfamilies (DHS) and PDBsum. CurrentlyGene3D provides annotation for 36 complete genomes (two eukaryotes, six archaea, and 28 bacteria). On average, between 30% and 40% of the genes of a given genome can be structurally annotated. Matches to structural domains are found using the profile-based method (PSI-BLAST). and a novel protocol, DRange, is used to resolve conflicts in matches involving different homologous superfamilies.Keywords
This publication has 37 references indexed in Scilit:
- Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structureJournal of Molecular Biology, 2001
- Predicting transmembrane protein topology with a hidden markov model: application to complete genomes11Edited by F. CohenJournal of Molecular Biology, 2001
- The CATH Dictionary of Homologous Superfamilies (DHS): a consensus approach for identifying distant structural homologuesProtein Engineering, Design and Selection, 2000
- The Protein Data BankNucleic Acids Research, 2000
- Benchmarking PSI-BLAST in genome annotation 1 1Edited by G. von HeijneJournal of Molecular Biology, 1999
- GenTHREADER: an efficient and reliable protein fold recognition method for genomic sequencesJournal of Molecular Biology, 1999
- Sequence comparisons using multiple sequences detect three times as many remote homologues as pairwise methodsJournal of Molecular Biology, 1998
- Genome‐wide analysis of integral membrane proteins from eubacterial, archaean, and eukaryotic organismsProtein Science, 1998
- Intermediate sequences increase the detection of homology between sequencesJournal of Molecular Biology, 1997
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997