Prediction of rodent carcinogenicity bioassays from molecular structure using inductive logic programming.
Open Access
- 1 October 1996
- journal article
- Published by Environmental Health Perspectives in Environmental Health Perspectives
- Vol. 104 (suppl 5) , 1031-1040
- https://doi.org/10.1289/ehp.96104s51031
Abstract
The machine learning program Progol was applied to the problem of forming the structure-activity relationship (SAR) for a set of compounds tested for carcinogenicity in rodent bioassays by the U.S. National Toxicology Program (NTP). Progol is the first inductive logic programming (ILP) algorithm to use a fully relational method for describing chemical structure in SARs, based on using atoms and their bond connectivities. Progol is well suited to forming SARs for carcinogenicity as it is designed to produce easily understandable rules (structural alerts) for sets of noncongeneric compounds. The Progol SAR method was tested by prediction of a set of compounds that have been widely predicted by other SAR methods (the compounds used in the NTP's first round of carcinogenesis predictions). For these compounds no method (human or machine) was significantly more accurate than Progol. Progol was the most accurate method that did not use data from biological tests on rodents (however, the difference in accuracy is not significant). The Progol predictions were based solely on chemical structure and the results of tests for Salmonella mutagenicity. Using the full NTP database, the prediction accuracy of Progol was estimated to be 63% (+/- 3%) using 5-fold cross validation. A set of structural alerts for carcinogenesis was automatically generated and the chemical rationale for them investigated- these structural alerts are statistically independent of the Salmonella mutagenicity. Carcinogenicity is predicted for the compounds used in the NTP's second round of carcinogenesis predictions. The results for prediction of carcinogenesis, taken together with the previous successful applications of predicting mutagenicity in nitroaromatic compounds, and inhibition of angiogenesis by suramin analogues, show that Progol has a role to play in understanding the SARs of cancer-related compounds.Keywords
This publication has 29 references indexed in Scilit:
- Application of SAR methods to non-congeneric data bases assocated with carcinogenicity and mutagenicity: Issues and approachsMutation Research - Fundamental and Molecular Mechanisms of Mutagenesis, 1994
- Two million rodent carcinogens? The role of SAR and QSAR in their detectionMutation Research - Fundamental and Molecular Mechanisms of Mutagenesis, 1994
- The influence of chemical structure on the extent and sites of carcinogenesis for 522 rodent carcinogens and 55 different human carcinogen exposuresMutation Research - Fundamental and Molecular Mechanisms of Mutagenesis, 1993
- Prospective ke screening of potential carcinogens being tested in rodent bioassays by the US National Toxicology ProgramMutagenesis, 1992
- Long-term chemical carcinogenesis experiments for identifying potential human cancer hazards: collective database of the National Cancer Institute and National Toxicology Program (1976-1991).Environmental Health Perspectives, 1991
- QSAR prediction of rodent carcinogenicity for a set of chemicals currently bioassayed by the US National Toxicology ProgramMutagenesis, 1991
- On the rodent bioassays currently being conducted on 44 chemicals: a RASH analysis to predict test results from the National Toxicology ProgramMutagenesis, 1991
- A prospective toxicity evaluation (COMPACT) on 40 chemicals currently being tested by the National Toxicology ProgramMutagenesis, 1990
- Prediction of the carcinogenicity in rodents of chemicals currently being tested by the US National Toxicology Program: structure-activity correlationsMutagenesis, 1990
- Prediction of probability of carcinogenicity for a set of ongoing NTP bioassaysMutagenesis, 1990