CDD: a Conserved Domain Database for protein classification
Top Cited Papers
Open Access
- 17 December 2004
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 33 (Database ) , D192-D196
- https://doi.org/10.1093/nar/gki069
Abstract
The Conserved Domain Database (CDD) is the protein classification component of NCBI's Entrez query and retrieval system. CDD is linked to other Entrez databases such as Proteins, Taxonomy and PubMed(R), and can be accessed at http://www.ncbi.nlm.nih. gov/entrez/query.fcgi?db=cdd. CD-Search, which is available at http://www.ncbi.nim.nih.gov/Structure/ cdd/wrpsb.cgi, is a fast, interactive tool to identify conserved domains in new protein sequences. CD-Search results for protein sequences in Entrez are pre-computed to provide links between proteins and domain models, and computational annotation visible upon request. Protein-protein queries submitted to NCBI's BLAST search service at http:// www.ncbi.nim.nih.gov/BLAST are scanned for the presence of conserved domains by default. While CDD started out as essentially a mirror of publicly available domain alignment collections, such as SMART, Pfam and COG, we have continued an effort to update, and in some cases replace these models with domain hierarchies curated at the NCBI. Here, we report on the progress of the curation effort and associated improvements in the functionality of the CDD information retrieval system.Keywords
This publication has 10 references indexed in Scilit:
- CD-Search: protein domain annotations on the flyNucleic Acids Research, 2004
- The last CTD repeat of the mammalian RNA polymerase II large subunit is important for its stabilityNucleic Acids Research, 2004
- SMART 4.0: towards genomic data integrationNucleic Acids Research, 2004
- Database resources of the National Center for Biotechnology Information: updateNucleic Acids Research, 2004
- The Pfam protein families databaseNucleic Acids Research, 2004
- The COG database: an updated version includes eukaryotesBMC Bioinformatics, 2003
- CDART: Protein Homology by Domain ArchitectureGenome Research, 2002
- Comparison of sequence and structure alignments for protein domainsProteins-Structure Function and Bioinformatics, 2002
- CDD: a database of conserved domain alignments with links to domain three-dimensional structureNucleic Acids Research, 2002
- Cn3D: sequence and structure views for EntrezTrends in Biochemical Sciences, 2000