POEM: Parameter Optimization Using Ensemble Methods: Application to Target Specific Scoring Functions
- 1 September 2005
- journal article
- research article
- Published by American Chemical Society (ACS) in Journal of Chemical Information and Modeling
- Vol. 45 (5) , 1291-1302
- https://doi.org/10.1021/ci050036g
Abstract
In computational biology processes such as docking, binding, and folding are often described by simplified, empirical models. These models are fitted to physical properties of the process by adjustable parameters. An appropriate choice of these parameters is crucial for the quality of the models. Locating the best choices for the parameters is often is a difficult task, depending on the complexity of the model. We describe a new method and program, POEM (Parameter Optimization using Ensemble Methods), for this task. In POEM we combine the DOE (Design Of Experiment) procedure with ensembles of different regression methods. We apply the method to the optimization of target specific scoring functions in molecular docking. The method consists of an iterative procedure that uses alternate evaluation and prediction steps. During each cycle of optimization we fit an approximate function to a defined loss function landscape and improve the quality of this fit from cycle to cycle by constantly augmenting our data set. As test applications we fitted the FlexX and Screenscore scoring functions to the kinase and ATPase protein classes. The results are promising: Starting from random parameters we are able to locate parameter sets which show superior performance compared to the original values. The POEM approach converges quickly and the approximated loss function landscapes are smooth, thus making the approach a suitable method for optimizations on rugged landscapes.Keywords
This publication has 16 references indexed in Scilit:
- Target-biased scoring approaches and expert systems in structure-based virtual screeningCurrent Opinion in Chemical Biology, 2004
- Virtual Screening for Kinase TargetsCurrent Medicinal Chemistry, 2004
- Protein Flexibility in Ligand Docking and Virtual Screening to Protein KinasesJournal of Molecular Biology, 2004
- ATPases as drug targets: learning from their structureNature Reviews Drug Discovery, 2002
- Knowledge-based scoring function to predict protein-ligand interactionsJournal of Molecular Biology, 2000
- A General and Fast Scoring Function for Protein−Ligand Interactions: A Simplified Potential ApproachJournal of Medicinal Chemistry, 1999
- All-Atom Empirical Potential for Molecular Modeling and Dynamics Studies of ProteinsThe Journal of Physical Chemistry B, 1998
- Optimal ensemble averaging of neural networksNetwork: Computation in Neural Systems, 1997
- A Fast Flexible Docking Method using an Incremental Construction AlgorithmJournal of Molecular Biology, 1996
- Neural network ensemblesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1990