Improvements to CluSTr: the database of SWISS-PROT+TrEMBL protein clusters
- 1 January 2003
- journal article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 31 (1) , 388-389
- https://doi.org/10.1093/nar/gkg035
Abstract
The CluSTr database (http://www.ebi.ac.uk/clustr/) offers an automatic classification of SWISS-PROT+TrEMBL proteins into groups of related proteins. The clustering is based on analysis of all pair-wise sequence comparisons between proteins using the Smith-Waterman algorithm. The analysis, carried out on different levels of protein similarity, yields a hierarchical organization of clusters. Information about domain content of the clustered proteins is provided via the InterPro resource. The introduced InterPro 'condensed graphical view' simplifies the visual analysis of represented domain architectures. Integrated applications allow users to visualize and edit multiple alignments and build sequence divergence trees. Links to the relevant structural data in Protein Data Bank (PDB) and Homology derived Secondary Structure of Proteins (HSSP) are also provided.Keywords
This publication has 13 references indexed in Scilit:
- The EBI SRS server—new featuresBioinformatics, 2002
- Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disordersNucleic Acids Research, 2002
- The Protein Data Bank: unifying the archiveNucleic Acids Research, 2002
- Proteome Analysis Database: online application of InterPro and CluSTr for the functional classification of proteins in whole genomesNucleic Acids Research, 2001
- The InterPro database, an integrated documentation resource for protein families, domains and functional sitesNucleic Acids Research, 2001
- CluSTr: a database of clusters of SWISS-PROT+TrEMBL proteinsNucleic Acids Research, 2001
- A collection of well characterised integral membrane proteinsBioinformatics, 2000
- The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000Nucleic Acids Research, 2000
- Significance of Z-value statistics of Smith–Waterman scores for protein alignmentsComputers & Chemistry, 1999
- Protein folds and families: sequence and structure alignmentsNucleic Acids Research, 1999