Use of Recursion Forests in the Sequential Screening Process: Consensus Selection by Multiple Recursion Trees

1 May 2003

journal article
research article
Published by American Chemical Society (ACS) in Journal of Chemical Information and Computer Sciences

Vol. 43 (3) , 941-948
https://doi.org/10.1021/ci034023j

Abstract

The application of Cheminformatics to High-Throughput Screening (HTS) data requires the use of robust modeling methods. Robust models must be able to accommodate false positive and false negative data yet retain good explanatory and predictive power. Recursive Partitioning has been shown to accommodate false positive and false negative data in the model building phase but suffers from a high false positive rate in the prediction phase, especially with sparse data sets such as HTS data. Here, we introduce Consensus Selection as a procedure to decrease the false positive rate of Recursive Partitioning-based models. Consensus Selection by Multiple Recursion Trees can increase the hit rate of a High-Throughput Screen in excess of 30-fold while significantly reducing the false positive rate relative to single Recursion Tree models.

Keywords

This publication has 11 references indexed in Scilit:

SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation
Nature Genetics, 2008
Optimization of Focused Chemical Libraries Using Recursive Partitioning
Combinatorial Chemistry & High Throughput Screening, 2002
4D-QSAR Analysis of a Set of Ecdysteroids and a Comparison to CoMFA Modeling
Journal of Chemical Information and Computer Sciences, 2001
Prediction of Biological Activity for High-Throughput Screening Using Binary Kernel Discrimination
Journal of Chemical Information and Computer Sciences, 2001
Retrospective Analysis of an Experimental High-Throughput Screening Data Set by Recursive Partitioning
Journal of Combinatorial Chemistry, 2001
GRID/CPCA: A New Computational Tool To Design Selective Ligands
Journal of Medicinal Chemistry, 2000
Consensus Scoring: A Method for Obtaining Improved Hit Rates from Docking Databases of Three-Dimensional Structures into Proteins
Journal of Medicinal Chemistry, 1999
Analysis of a Large Structure/Biological Activity Data Set Using Recursive Partitioning
Journal of Chemical Information and Computer Sciences, 1999
Potentiation and Inhibition of Neuronal Nicotinic Receptors by Atropine: Competitive and Noncompetitive Effects
Molecular Pharmacology, 1997
Analysis of a 29 Full Factorial Chemical Library
Journal of Medicinal Chemistry, 1995