Ligand- and Structure-Based Virtual Screening 157
of this ensemble approach is the ability to search a wider conformational space that
includes backbone and sidechain variation. The on-the-fly approaches may include
protein sidechain rotamers [45,60], sidechains and user-defined loops [49,61], or
induced fit using protein structure prediction [62]. The latter approach, while accurate,
is not currently fast enough for virtual screening.
After calculating a ligand pose, the last step in docking is to evaluate it with some
sort of scoring function. The original scoring function from UCSF DOCK was a
shape-based contact score. Later, force-field-based functions were introduced. These
functions took the Lennard–Jones and electrostatic parameters from force fields such
as AMBER [63] or CHARMM [64,65]. To save computational time by turning an
O(N
2
) calculation to O(N), a grid was precalculated for the sum of the receptor
potential at each point in space. To generate this grid, a geometric mean approxi-
mation (A
ij
=
√
A
ii
√
A
jj
) was made to the van der Waals portion of the force field
[66]. Eventually, solvation and entropy terms were added to many force-field-based
scoring functions, typically using DELPHI [67] or ZAP [68] for solvation [69,70].As
many factors known to be important in the free energy of ligand binding are missing
in force-field scores, many programs chose instead to derive an empirical scoring
using several intuitive parameters such as hydrophobicity, solvation, metal-binding,
or the number of number rotatable bonds, along with van der Waals and electrostatic
energy terms. Empirical functions were derived from a least-squares fit of the param-
eters to ligand–protein systems with known crystal structures and known binding
energies [42,45,52,71–73]. Again these scoring functions are usually calculated on
a grid for computational speed. The advantage of starting with a force-field-based
method is that it is applicable to a wide range of ligands. The empirical scoring
schemes work well when the ligands and receptors resemble the training set. Another
approach was to use a knowledge-based function, derived from a statistical analysis
of ligand atom/protein atom contact frequencies and distances in a database of crystal
structures [74–76]. As each of these scoring approaches were shown to work well
in different cases, many programs started to create “consensus” functions combining
several different scoring schemes, which were shown to be more predictive than any
single scoring scheme [77]. After primary scoring, several approaches “rescore” top
hits. For example, the PostDOCK filter [78] was derived from a supervised machine
learning study on protein/ligand structures in the Protein Data Bank [79], and it was
shown to improve enrichment by as much as 19-fold. Short molecular dynamics runs
using implicit waters, implemented as MM-PBSA (for Poisson–Boltzmann solva-
tion) or MM-GBSA (Generalized Born), were also shown to improve enrichment
rates [80,81].
Many other factors to improve the quality of structure-based affinity predictions
have been addressed including waters, metals, and protonation states of the receptor
protein (see Refs. [82,83] for a detailed review). Additional screens have been devel-
oped to identify lead compounds that not only showstrong binding affinity to the target
receptor, but also have good pharmacological properties. Lipinski’s Rule of Five [84],
which uses a set of property heuristics such as molecular weight, hydrogen bonds, and
so on that match the range in the majority of known orally absorbed drugs, has become
a standard for prescreening ligands prior to docking for “drug-like” properties. Filters
have been developed to remove compounds known to be promiscuous binders (i.e.,