Inhibition and substrate recognition – a computational approach applied to HIV protease

Abstract
We have developed a computational approach in which an inhibitor's strength is determined from its interaction energy with a limited set of amino acid residues of the inhibited protein. We applied this method to HIV protease. The method uses a consensus structure built from X-ray crystallographic data. All inhibitors are docked into the consensus structure. Given that not every ligand–protein interaction causes inhibition, we implemented a genetic algorithm to determine the relevant set of residues. The algorithm optimizes the q2 between the sum of interaction energies and the observed inhibition constants. The best possible predictive model resulting has a q2 of 0.63. External validation by examining the predictivity for compounds not used in derivation of the model leads to a prediction accuracy between 0.9 and 1.5 log10 unit. Out of 198 residues in the whole protein, the best internally predictive model defines a subset of 20 residues and the best externally predictive model one of 9 residues. These residues are distributed over the subsites of the enzyme. This approach provides insight in which interactions are important for inhibiting HIV protease and it allows for quantitative prediction of inhibitor strength.