ProtoMap: automatic classification of protein sequences and hierarchy of protein families
- 1 January 2000
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 28 (1) , 49-55
- https://doi.org/10.1093/nar/28.1.49
Abstract
The ProtoMap site offers an exhaustive classification of all proteins in the SWISS-PROT database, into groups of related proteins. The classification is based on analysis of all pairwise similarities among protein sequences, The analysis makes essential use of transitivity to identify homologies among proteins. Within each group of the classification, every two members are either directly or transitively related. However, transitivity is applied restrictively in order to prevent unrelated proteins from clustering together, The classification is done at different levels of confidence, and yields a hierarchical organization of all:proteins. The resulting classification splits the protein space into well-defined groups of proteins, which are closely correlated with natural biological families and superfamilies. Many clusters contain protein sequences that are not classified by other databases. The hierarchical organization suggested by our analysis may help in detecting finer subfamilies in families of known proteins. In addition it brings forth interesting relationships between protein families, upon which local maps for the neighborhood of protein families can be sketched. The ProtoMap web server can be accessed at http://www.protomap.cs.huji.ac.il.Keywords
This publication has 20 references indexed in Scilit:
- ProDom and ProDom-CG: tools for protein domain analysis and whole genome comparisonsNucleic Acids Research, 2000
- Increased coverage of protein families with the Blocks Database serversNucleic Acids Research, 2000
- ProtoMap: automatic classification of protein sequences, a hierarchy of protein families, and local maps of the protein space.1999
- PRINTS prepares for the new millenniumNucleic Acids Research, 1999
- A Genomic Perspective on Protein FamiliesScience, 1997
- Superfamily classification in PIR-international protein sequence databasePublished by Elsevier ,1996
- Amino acid substitution matrices from protein blocks.Proceedings of the National Academy of Sciences, 1992
- Basic local alignment search toolJournal of Molecular Biology, 1990
- Improved tools for biological sequence comparison.Proceedings of the National Academy of Sciences, 1988
- Comparison of biosequencesAdvances in Applied Mathematics, 1981