Improving the performance of DomainParser for structural domain partition using neural network
Open Access
- 1 February 2003
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 31 (3) , 944-952
- https://doi.org/10.1093/nar/gkg189
Abstract
Structural domains are considered as the basic units of protein folding, evolution, function and design. Automatic decomposition of protein structures into structural domains, though after many years of investigation, remains a challenging and unsolved problem. Manual inspection still plays a key role in domain decomposition of a protein structure. We have previously developed a computer program, DomainParser, using network flow algorithms. The algorithm partitions a protein structure into domains accurately when the number of domains to be partitioned is known. However the performance drops when this number is unclear (the overall performance is 74.5% over a set of 1317 protein chains). Through utilization of various types of structural information including hydrophobic moment profile, we have developed an effective method for assessing the most probable number of domains a structure may have. The core of this method is a neural network, which is trained to discriminate correctly partitioned domains from incorrectly partitioned domains. When compared with the manual decomposition results given in the SCOP database, our new algorithm achieves higher decomposition accuracy (81.9%) on the same data set.Keywords
This publication has 22 references indexed in Scilit:
- SCOP database in 2002: refinements accommodate structural genomicsNucleic Acids Research, 2002
- Hydrophobic moments of protein structures: Spatially profiling the distributionProceedings of the National Academy of Sciences, 2001
- Protein domain decomposition using a graph-theoretic approachBioinformatics, 2000
- Domain size distributions can predict domain boundariesBioinformatics, 2000
- Enhanced genome annotation using structural profiles in the program 3D-PSSM 1 1Edited by J. ThorntonJournal of Molecular Biology, 2000
- Identification of structural domains in proteins by a graph heuristic.1999
- The helical hydrophobic moment: a measure of the amphiphilicity of a helixNature, 1982
- The Anatomy and Taxonomy of Protein StructurePublished by Elsevier ,1981
- The protein data bank: A computer-based archival file for macromolecular structuresJournal of Molecular Biology, 1977
- Nucleation, Rapid Folding, and Globular Intrachain Regions in ProteinsProceedings of the National Academy of Sciences, 1973