A ‘periodic table’ for protein structures
Open Access
- 1 April 2002
- journal article
- letter
- Published by Springer Nature in Nature
- Vol. 416 (6881) , 657-660
- https://doi.org/10.1038/416657a
Abstract
Current structural genomics programs aim systematically to determine the structures of all proteins coded in both human and other genomes, providing a complete picture of the number and variety of protein structures that exist. In the past, estimates have been made on the basis of the incomplete sample of structures currently known. These estimates have varied greatly (between 1,000 and 10,000; see for example refs 1 and 2), partly because of limited sample size but also owing to the difficulties of distinguishing one structure from another. This distinction is usually topological, based on the fold of the protein; however, in strict topological terms (neglecting to consider intra-chain cross-links), protein chains are open strings and hence are all identical. To avoid this trivial result, topologies are determined by considering secondary links in the form of intra-chain hydrogen bonds (secondary structure) and tertiary links formed by the packing of secondary structures. However, small additions to or loss of structure can make large changes to these perceived topologies and such subjective solutions are neither robust nor amenable to automation. Here I formalize both secondary and tertiary links to allow the rigorous and automatic definition of protein topology.Keywords
This publication has 19 references indexed in Scilit:
- Structure Comparison and Structure PatternsJournal of Computational Biology, 2000
- HOMSTRAD: A database of protein structure alignments for homologous familiesProtein Science, 1998
- CATH – a hierarchic classification of protein domain structuresPublished by Elsevier ,1997
- Dali/FSSP classification of three-dimensional protein foldsNucleic Acids Research, 1997
- Protein superfamilles and domain superfoldsNature, 1994
- One thousand families for the molecular biologistNature, 1992
- THE CLASSIFICATION AND ORIGINS OF PROTEIN FOLDING PATTERNSAnnual Review of Biochemistry, 1990
- Why do globular proteins fit the limited set of foldin patterns?Progress in Biophysics and Molecular Biology, 1987
- Analysis and prediction of the packing of α-helices against a β-sheet in the tertiary structure of globular proteinsJournal of Molecular Biology, 1982
- Analysis and prediction of protein β-sheet structures by a combinatorial approachNature, 1980