Integrated graphical analysis of protein sequence features predicted from sequence composition

21 September 2001

journal article
research article
Published by Wiley in Proteins-Structure Function and Bioinformatics

Vol. 45 (3) , 262-273
https://doi.org/10.1002/prot.1146

Abstract

Several protein sequence analysis algorithms are based on properties of amino acid composition and repetitiveness. These include methods for prediction of secondary structure elements, coiled‐coils, transmembrane segments or signal peptides, and for assignment of low‐complexity, nonglobular, or intrinsically unstructured regions. The quality of such analyses can be greatly enhanced by graphical software tools that present predicted sequence features together in context and allow judgment to be focused simultaneously on several different types of supporting information. For these purposes, we describe the SFINX package, which allows many different sets of segmental or continuous‐curve sequence feature data, generated by individual external programs, to be viewed in combination alongside a sequence dot‐plot or a multiple alignment of database matches. The implementation is currently based on extensions to the graphical viewers Dotter and Blixem and scripts that convert data from external programs to a simple generic data definition format called SFS. We describe applications in which dot‐plots and flanking database matches provide valuable contextual information for analyses based on compositional and repetitive sequence features. The system is also useful for comparing results from algorithms run with a range of parameters to determine appropriate values for defaults or cutoffs for large‐scale genomic analyses. Proteins 2001;45:262–273.

Keywords

This publication has 27 references indexed in Scilit:

Statistics of local complexity in amino acid sequences and sequence databases
Published by Elsevier ,2001
Intrinsically unstructured proteins: re-assessing the protein structure-function paradigm
Journal of Molecular Biology, 1999
The Biopolymer Markup Language.
Bioinformatics, 1999
Principles governing amino acid composition of integral membrane proteins: application to topology prediction 1 1Edited by J. Thornton
Journal of Molecular Biology, 1998
A dot-matrix program with dynamic threshold control suited for genomic DNA and protein sequence analysis
Gene, 1995
bioTk: Componentry for genome informatics graphical user interfaces
Gene, 1995
Sequences with ‘unusual’ amino acid compositions
Current Opinion in Structural Biology, 1994
A Model Recognition Approach to the Prediction of All-Helical Membrane Protein Structure and Topology
Biochemistry, 1994
Predicting Coiled Coils from Protein Sequences
Science, 1991
A simple method for displaying the hydropathic character of a protein
Journal of Molecular Biology, 1982