Improved general regression network for protein domain boundary prediction
Open Access
- 13 February 2008
- journal article
- Published by Springer Nature in BMC Bioinformatics
- Vol. 9 (S1) , S12
- https://doi.org/10.1186/1471-2105-9-s1-s12
Abstract
Protein domains present some of the most useful information that can be used to understand protein structure and functions. Recent research on protein domain boundary prediction has been mainly based on widely known machine learning techniques, such as Artificial Neural Networks and Support Vector Machines. In this study, we propose a new machine learning model (IGRN) that can achieve accurate and reliable classification, with significantly reduced computations. The IGRN was trained using a PSSM (Position Specific Scoring Matrix), secondary structure, solvent accessibility information and inter-domain linker index to detect possible domain boundaries for a target sequence.Keywords
This publication has 38 references indexed in Scilit:
- Improving the performance of DomainDiscovery of protein domain boundary assignment using inter-domain linker indexBMC Bioinformatics, 2006
- Partitioning Protein Structures into Domains: Why Is it so Difficult?Journal of Molecular Biology, 2006
- DOMpro: Protein Domain Prediction Using Profiles, Secondary Structure, Relative Solvent Accessibility, and Recursive Neural NetworksData Mining and Knowledge Discovery, 2006
- Armadillo: Domain Boundary Prediction by Amino Acid CompositionJournal of Molecular Biology, 2005
- Bidirectional segmented-memory recurrent neural network for protein secondary structure predictionSoft Computing, 2005
- Interdomain Contact Regions and Angles Between Adjacent Short Consensus Repeat DomainsJournal of Molecular Biology, 2004
- The Protein Data BankNucleic Acids Research, 2000
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- One thousand families for the molecular biologistNature, 1992
- Nucleation, Rapid Folding, and Globular Intrachain Regions in ProteinsProceedings of the National Academy of Sciences, 1973