Beruflich Dokumente
Kultur Dokumente
Lead generation:
Natural ligand / Screening
Biological Testing
Pre-Clinical Studies
Finding lead compound
A lead compound is a small molecule that serves as the
starting point for an optimization involving many small
molecules that are closely related in structure to the lead
compound
Many organizations maintain databases of chemical
compounds
Some of these are publically accessible others are
proprietary
Databases contain an extremely large number of compounds
(ACS data bases contains 10 million compounds)
3D databases have information about chemical and
geometrical features
Hydrogen bond donors
Hydrogen bond acceptors
Positive Charge Centers
Aromatic ring centers
Hydrophobic centers
Finding lead compound
There are two approaches to this
problem
A computer program AutoDock (or
similar version Affinity (accelrys)) can
be used to search a database by
generating fit between molecule
and the receptor
Alternatively one can search 3D
pharmacophore
Structure based drug design
Drug design and development
Structure based drug design exploits the
3D structure of the target or a
pharmacophore
Find a molecule which would be expected to
interact with the receptor. (Searching a data
base)
Design entirely a new molecule from
SCRATCH (de novo drug/ligand design)
In this context bioinformatics and
chemoinformatics play a crucial role
Structure-based Drug Design (SBDD)
Structure Analysis
Biological Testing
and Compound Design
If promising
rational drug design
X-Ray structural
determination of native
protein
X-Ray structural
determination of
inhibitor complex
Drug
Structure Based Drug Design have the potential to shave off years
and millions of dollars
Working at the intersection
Structural Biology
Biochemistry
Medicinal Chemistry
Toxicology
Pharmacology
Biophysical Chemistry
Natural Products Chemistry
Chemical Ecology
Information Technology
Molecular docking-definition
It is a process by which two molecules
are put together in 3 Dimension
Termination criteria:
accepted moves
rejected moves
cycles
Genetic function algorithm
Start with a random population (50-200)
Perform Crossover (Sex, two parents ->
2 children) and Mutation (Cosmic rays,
one individual gives 1 mutant child)
Compute fitness of each individual
Proportional Selection & Elitism
New Generation begins if total energy
evals or maximum generations reached
Search parameters
Population size
Crossover rate
Mutation rate
Local search
energy evals
Termination criteria
energy evals
generations
Dimensionality of molecular
docking
Degrees of Freedom (DOF)
Position or Translation
(x,y,z) = 3
Orientation or Quaternion
(qx, qy, qz, qw) = 4
Rotatable Bonds or Torsions
(tor1, tor2, torn) = n
Total DOF, or Dimensionality,
D=3+4+n
Docking score
DGbinding = DGvdW + DGelec + DGhbond +
DGdesolv + DGtors
DGvdW
12-6 Lennard-Jones potential
DGelec
Coulombic with Solmajer-dielectric
DGhbond
12-10 Potential with Goodford Directionality
DGdesolv
Stouten Pairwise Atomic Solvation Parameters
DGtors
Number of rotatable bonds
Molecular mechanics: theory
Considering the simple
harmonic approximation, the
potential energy of molecules
is given by
V= VBond+ VAngle + VTorsion + Vvdw
+ Velec+ Vop
VBond = 1/2Kr (rij-r0)2
Where Kr is the stretching
force constant
VAngle =1/2K (ijk-0)2
Where K is the bending force
constant
VTorsion =V/2 (1+ Cos n(+0))
Where V is the barrier to
rotation, is torsional angle
Molecular mechanics: Theory
Lennard-Jones type of 6-12 potential is
used to describe non-bonded and weak
interaction
Vvdw= (Aij/rij12-Bij/rij6)
Simple Columbic potential is used to
describe electrostatic interaction
Velec=(qiqj/rij)
Out of plane bending/deformation is
described by the following expression
Vop= 0.5 Kop 2
The forcefield
The purpose of a forcefield is to describe the
potential energy surface of entire classes of
molecules with reasonable accuracy
In a sense, the forcefield extrapolates from the
empirical data of the small set of models used
to parameterize it, a larger set of related
models
Some forcefields aim for high accuracy for a
limited set of elements, thus enabling good
predictions of many molecular properties
Others aim for the broadest possible coverage
of the periodic table, with necessarily lower
accuracy
Components of a forcefield
The forcefield contains all the necessary elements
for calculations of energy and force:
A list of forcefield types
A list of partial charges
Forcefield-typing rules
Functional forms for the components of the
energy expression
Parameters for the function terms
For some forcefields, rules for generating
parameters that have not been explicitly defined
For some forcefields, a way of assigning
functional forms and parameters
The energy expression
Valence interactions
The energy of valence interactions is generally
accounted for by diagonal terms:
bond stretching (bond)
valence angle bending (angle)
dihedral angle torsion (torsion)
inversion, also called out-of-plane interactions
(oop) terms, which are part of nearly all
forcefields for covalent systems
A Urey-Bradley (UB) term may be used to
account for interactions between atom pairs
involved in 1-3 configurations (i.e., atoms bound
to a common atom)
Evalence=Ebond + Eangle + Etorsion + Eoop + EUB
Non-bond interactions
Vi = Sk(energies of
interactions between i
and all other residues k
located within a cutoff
distance of Rc from i)
Classical molecular dynamics
Constituent molecules obey
classical laws of motion
In MD simulation, we have to
solve Newton's equation of
motion
Force calculation is the time
consuming part of the
simulation
MD simulation can be
performed in various ensembles
NVT, NPT and NVE are the
ensembles widely used in the
MD simulations
Both quantum and classical
potentials can be used to
perform MD simulation
Calculation of interaction energy
MM total energy can be used to get
interaction energy of the ligands with
biomolecules
In order to compute the interaction
energy, calculations have to be
performed for the biomolecule,
ligands and the biomolecule-ligand
adduct using the same force field
Eint= Ecomplex - {Ebiomolecule+Eligand}
Integration of equation of motion
and time step
A key parameter in the integration algorithm is
the integration time step
The time step is related to molecular vibration
The main limitation imposed by the highest-
frequency motion
The vibrational period must be split into at
least 8-10 segments for models to satisfy the
Verlet algorithm that the velocities and
accelerations are constant over time step used
In most organic models, the highest vibrational
frequency is that of C-H stretching, whose
period is of the order of 10-14 s (10fs). Therefore
integration step should be 0.5-1 fs
Stages and duration in MD
simulation
Dynamics simulations are usually carried out in two
stages, equilibration and data collection
The purpose of the equilibration is to prepare the
system so that it comes to the most probable
configuration consistent with the target temperature
and pressure
For large system, the equilibration takes long time
because of the vast conformational space it has to
search
The best way to judge whether a model has
equilibrated is to plot various thermodynamic
quantities such as energy, temperature, pressure
versus time
When equilibrated, the system fluctuate around their
average
Durations of some real
molecular events
Event Approximate duration
Water reorientation 4 ps
3.5-6.5 3.5-6.5
Aspargine of
T.Helix and
gallic acid
Aspartic acid of
T.Helix and
catechin
Lysine of T.Helix
and
epigallocateching
allate
Binding energies different complexes
between polyphenols and triple helix
Binding Energy (Kcal/mol)
Binding Sites in Epigallocatechi
triple helix Gallic acid Pentagalloyl
Catechin (Cat) ngallate
(Gal) glucose (PGG)
(EGCG)
9th residue Ser
of C-chain (2) 16.5 22.5 35.2 56.6