Beruflich Dokumente
Kultur Dokumente
3D-QSAR publications
250
229
203
194 195
200 186
176 180
168
158
150
123
106
99
100
76
60 58
50
29
20
1 2 4
0
88
89
90
91
92
93
94
95
96
97
98
99
00
01
02
03
04
05
06
07
19
19
19
19
19
19
19
19
19
19
19
19
20
20
20
20
20
20
20
20
1
3D-QSAR Methods in Drug Design
• Introduction
• Theoretical Background - 3D-QSAR
– Training Set
– Ligand Alignment
– Molecular Field Calculation
– Internal Validation – Crossvalidation
– External Prediction – Interpretation
• Case Studies
• Conclusions and Recommendations
Introduction
Pharmacophores
Data base Similarity Structure-based
filtering analysis QSAR
design
2-4 years
Target Biological test HTS HTS hits Chemistry Target structure Drug
selection development confirmed start determined candidate
2
Intention – QSAR
Statistical Tools
Biological Data Molecular Properties
Ki MEP
IC50 MLP
MIC Volume
Permeation log P
… …
biol. activity = const. + (c1 P1) + (c2 P2) + (c3 P3) + ...
3
QSAR
PLS Analysis
• PLS analysis:
Belongs to the family of PCA (principal component
analysis) techniques and is used as standard method
within 3D-QSAR
4
Crossvalidated PLS analysis
Leave-one-out Leave-several-out
Original Groups of
Table crossvalidation
SDEP
Derivation
of a model Differences
Compounds Predicted
excluded Activity
Measured
Prediction of excluded Activity
compounds
∑(y predicted
experiment − y) 2
0.50
0.00 = No Model!
5
3D-QSAR Methods
• Different techniques
– Molecular-shape analysis
– Hypothetical Active Site Lattice
(HASL)
– Comparative Molecular Field
Analysis (CoMFA, CoMSiA,
GRID/GOLPE …)
– ALMOND, GRIND
– QUASAR
– AFMoC
CoMFA Approach
Compound Number Biol. Activity Steric Electrostatic Steric Electrostatic Steric Electrostatic Steric Electrostatic
Interaction Interaction Interaction Interaction Interaction Interaction Interaction Interaction ...
S001 E001 S002 E002 S003 E003 S004 S004
1 1.07
2 0.09
3 0.66
4 1.42
5 -0.62
6 0.64
7 -0.46
6
3D-QSAR
Selection
Training Set
Interpretation
Ligand Alignment Graphical Representation
Calculation Statistical
Molecular Fields Analyses
QSAR - Setup
7
Selection Training Set
CN CN
CN N
N N N
O N N
R Y X
Y X O
R R O
CN Set 3 : 18 molecules
N N Set 2 : 7 molecules
O O
O
R
X R'
Set 1 : 16 molecules
R
O O O Y R
Set 6 : 22 molecules
N
OR´
N N N
O X O
R R'
Y
O
R Set 5 :12 molecules
Set 4 : 3 molecules
O
Set 7 : 21 molecules
8
Selection Training Set
10
Sets 4 und 7:
9 not enough active (7)
or inactive (4)
8
compounds
pIC 50
7
Sets 1, 2, 3 und 5:
6 Poor distribution of
biol. activities
5
4 Set 6:
3 Broad range and
0 1 2 3 4 5 6 7 8 relatively well
distributed biol.
Sets activities
The model using only the steric field shows the best statistical
results (q2, LOO cross validation)
9
Selection Training Set
10
Selection Training Set
• • Set 4
pred.
•
„Beware of q2!“
q2 0.85
•• •
••• •
expt.
S
11
3D-QSAR
Selection
Training Set
Interpretation
Ligand Alignment Graphical Representation
Calculation Statistical
Molecular Fields Analyses
Ligand Alignment
N
HO N
HO
N
S N
O H
N
H
How to align?
N
N
F
H
N N N
N
N N
O N N H
H
12
Ligand Alignment
3D-QSAR
Selection
Training Set
Interpretation
Ligand Alignment Graphical Representation
Calculation Statistical
Molecular Fields Analyses
13
Calculation Molecular Fields
Energy, kcal/mol
0.4
1.5
0.2
1 0
-0.2 0 1 2 3 4
0.5 -0.4
0 -0.6
0 1 2 3 4 -0.8
-0.5 -1
Nonbonded Internuclear Distance Internuclear Distance
3D-QSAR
Selection
Training Set
Interpretation
Ligand Alignment Graphical Representation
Calculation Statistical
Molecular Fields Analyses
14
Statistical Parameters
• Crossvalidation:
q 2
= 1−
∑(y obs − y pred ) 2
– Crossvalidated Correlation Coeff., q2. ∑(y obs − y) 2
– Optimal Number of Components.
ypred predicted value
– SDEP (Standard Deviation of Error yobs observed value
Prediction) y mean
( y pred − y obs ) 2
• Final PLS Model:
– Correlation Coefficient, r2.
SDEP = ∑ N
– Standard Deviation, s.
– F values. SDEP standard dev. of error prediction
yobs predicted value
ypred observed value
N number of ligands
r 2 = 1−
∑(y calc − yobs ) 2
∑(y obs − y) 2
0,8 1,2
1
0,6
0,8
r^2 final
Q^2
0,4 0,6
0,2 0,4
0,2
0
0
0 2 4 6 8 10 12
0 2 4 6 8 10 12
Number of components Number of components
LOO
15
3D-QSAR
Selection
Training Set
Interpretation
Ligand Alignment Graphical Representation
Calculation Statistical
Molecular Fields Analyses
Graphical Representation
16
Case Study 1
Ligand-based 3D-QSAR
5-HT1F Receptor Agonists
N N
HO HO N
N
S N
N
O N N N N
H H H
N N
H F F
HO N H H
N N N N
S
O O N O N N
N N
H H H
N
N N
H
H H N
N O
O
O N
O N N N
H H H
17
Alignment Procedure – 5-HT1F Agonists
LY306528 (R)
Template molecule 1
Template molecule 2 Ligand Alignment
Template molecule 3
18
Statistical Results
LOO Crossvalidation
Predicted pKi
q2LOO = 0.94
SDEP = 0.25
n = 21
Experimental pKi
q2 SDEP
19
Case Study 2
Receptor-based 3D-QSAR
Acetylcholinesterase (AChE)
Inhibitors
AChE Inhibitors
Biological data:
NH2 (CH2)n R
N N - IC50 values fromaAChE
aTorp. Californica
- 42 Inhibitors
X Y CH2
- pIC50 3.1 - 7.6
N N
- competitive inhibitors
H
R N
N N
20
Docking Validation
AutoDock X-ray
3D-QSAR - Setup
Analyses
Analysesof
ofknown
knownX-ray
X-raystructures
structures
Docking
Docking
AutoDock
AutoDock
Validation
Validation
GRID
GRID InteractionFields
Interaction Fields
Receptor-based
Receptor-basedAlignment
Alignment
3D-QSAR
3D-QSARAnalysis
Analysis
21
Receptor-based Alignment
Blue – hydrophilic
Brown - hydrophobic
Donepezil
Aminopyridazine
22
3D-QSAR Model
GRID/GOLPE - www.moldiscovery.com
pIC50 calculated
pIC50 experimental
3D-QSAR Model
GRID/GOLPE - www.moldiscovery.com
Cross-validation:
Leave-50%-out
pIC50 predicted
pIC50 experimental
23
Graphical Representation
GRID/GOLPE
PLS Felder Wasser Sonde
N N
N N
N N N N
H H
N N
N N
N N N N
H H
N N N
H
N N O
N N N N
H H
24
AFMoC
Conclusions
25
Recommendations for CoMFA Analyses
Book to read
Novel textbook
Student Edition
26