The ups and downs of protein topology; rapid comparison of protein structure

Open Access

1 December 2000

journal article
research article
Published by Oxford University Press (OUP) in Protein Engineering, Design and Selection

Vol. 13 (12) , 829-837
https://doi.org/10.1093/protein/13.12.829

Abstract

Protein topology can be described at different levels. At the most fundamental level, it is a sequence of secondary structure elements (a `primary topology string'). Searching predicted primary topology strings against a library of strings from known protein structures is the basis of some protein fold recognition methods. Here a method known as TOPSCAN is presented for rapid comparison of protein structures. Rather than a simple two-letter alphabet (encoding strand and helix), more complex alphabets are used encoding direction, proximity, accessibility and length of secondary elements and loops in addition to secondary structure. Comparisons are made between the structural information content of primary topology strings and encodings which contain additional information (`secondary topology strings'). The algorithm is extremely fast, with a scan of a large domain against a library of more than 2000 secondary structure strings completing in ∼30 s. Analysis of protein fold similarity using TOPSCAN at primary and secondary topology levels is presented.

Keywords

PROTEIN STRUCTURE

This publication has 29 references indexed in Scilit:

The Protein Data Bank
Nucleic Acids Research, 2000
Motif-based searching in TOPS protein topology databases.
Bioinformatics, 1999
The crystal structure of plant acetohydroxy acid isomeroreductase complexed with NADPH, two magnesium ions and a herbicidal transition state analog determined at 1.65Aresolution
The EMBO Journal, 1997
The structure of elongation factor G in complex with GDP: conformational flexibility and nucleotide exchange
Structure, 1996
Knowledge‐based protein secondary structure assignment
Proteins-Structure Function and Bioinformatics, 1995
The double cubic lattice method: Efficient approaches to numerical integration of surface area and volume and to dot surface contouring of molecular assemblies
Journal of Computational Chemistry, 1995
Improved strategy in analytic surface calculation for molecular systems: Handling of singularities and computational efficiency
Journal of Computational Chemistry, 1993
A new approach to protein fold recognition
Nature, 1992
Structure of ferricytochrome c′ from Rhodospirillum molischianum at 1.67 Å resolution
Journal of Molecular Biology, 1985
Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features
Biopolymers, 1983