Use of Recursion Forests in the Sequential Screening Process: Consensus Selection by Multiple Recursion Trees
- 1 May 2003
- journal article
- research article
- Published by American Chemical Society (ACS) in Journal of Chemical Information and Computer Sciences
- Vol. 43 (3) , 941-948
- https://doi.org/10.1021/ci034023j
Abstract
The application of Cheminformatics to High-Throughput Screening (HTS) data requires the use of robust modeling methods. Robust models must be able to accommodate false positive and false negative data yet retain good explanatory and predictive power. Recursive Partitioning has been shown to accommodate false positive and false negative data in the model building phase but suffers from a high false positive rate in the prediction phase, especially with sparse data sets such as HTS data. Here, we introduce Consensus Selection as a procedure to decrease the false positive rate of Recursive Partitioning-based models. Consensus Selection by Multiple Recursion Trees can increase the hit rate of a High-Throughput Screen in excess of 30-fold while significantly reducing the false positive rate relative to single Recursion Tree models.Keywords
This publication has 11 references indexed in Scilit:
- SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivationNature Genetics, 2008
- Optimization of Focused Chemical Libraries Using Recursive PartitioningCombinatorial Chemistry & High Throughput Screening, 2002
- 4D-QSAR Analysis of a Set of Ecdysteroids and a Comparison to CoMFA ModelingJournal of Chemical Information and Computer Sciences, 2001
- Prediction of Biological Activity for High-Throughput Screening Using Binary Kernel DiscriminationJournal of Chemical Information and Computer Sciences, 2001
- Retrospective Analysis of an Experimental High-Throughput Screening Data Set by Recursive PartitioningJournal of Combinatorial Chemistry, 2001
- GRID/CPCA: A New Computational Tool To Design Selective LigandsJournal of Medicinal Chemistry, 2000
- Consensus Scoring: A Method for Obtaining Improved Hit Rates from Docking Databases of Three-Dimensional Structures into ProteinsJournal of Medicinal Chemistry, 1999
- Analysis of a Large Structure/Biological Activity Data Set Using Recursive PartitioningJournal of Chemical Information and Computer Sciences, 1999
- Potentiation and Inhibition of Neuronal Nicotinic Receptors by Atropine: Competitive and Noncompetitive EffectsMolecular Pharmacology, 1997
- Analysis of a 29 Full Factorial Chemical LibraryJournal of Medicinal Chemistry, 1995