Beruflich Dokumente
Kultur Dokumente
Gunnar Rtsch
1
CCC Berlin, December 27, 2004
Roadmap
Motivating Examples
Some Background
Boosting & SVMs
Applications
2
CCC Berlin, December 27, 2004
From:
manfred@cse.ucsc.edu
Subject: ML Positions in Santa Cruz
Date: 4. December 2004 06:00:37 MEZ
LOTTERY COORDINATOR,
INTERNATIONAL PROMOTIONS/PRIZE AWARD DEPARTMENT.
SMARTBALL LOTTERY, UK.
DEAR WINNER,
WINNER OF HIGH STAKES DRAWS
Congratulations to you as we bring to your notice, the
results of the the end of year, HIGH STAKES DRAWS of
SMARTBALL LOTTERY UNITED KINGDOM. We are happy to inform you
that you have emerged a winner under the HIGH STAKES DRAWS
SECOND CATEGORY,which is part of our promotional draws. The
draws were held on15th DECEMBER 2004 and results are being
officially announced today. Participants were selected
through a computer ballot system drawn from 30,000
names/email addresses of individuals and companies from
Africa, America, Asia, Australia,Europe, Middle East, and
Oceania as part of our International Promotions Program.
3
CCC Berlin, December 27, 2004
Chemist
F
F
Inactives
F
N N
HO
Cl
HO
HO
N N
HO
N
N
H
O
O
N
S
O
Cl
OH
OH HN
OH
HO
OH
N
Cl
Actives
Gunnar Rtsch
4
CCC Berlin, December 27, 2004
Chemist
F
F
Inactives
F
N N
HO
Cl
HO
HO
N N
HO
N
N
H
O
O
N
S
O
Cl
OH
OH HN
OH
HO
OH
N
Cl
Actives
former CombiChem
technology
Gunnar Rtsch
5
CCC Berlin, December 27, 2004
Learning
F
F
Inactives
N N
HO
Cl
HO
HO
N N
HO
N
N
H
O
O
N
S
O
Cl
OH
OH HN
OH
HO
OH
N
Cl
Machine
Actives
former CombiChem
technology
Gunnar Rtsch
6
CCC Berlin, December 27, 2004
Gunnar Rtsch
7
CCC Berlin, December 27, 2004
8
CCC Berlin, December 27, 2004
Problem Formulation
Natural
+1
Plastic
-1
Natural
+1
Plastic
-1
The World:
Data
Unknown Target Function
Unknown Distribution
Objective
Problem:
Gunnar Rtsch
is unknown
9
CCC Berlin, December 27, 2004
Problem Formulation
Gunnar Rtsch
10
CCC Berlin, December 27, 2004
Gunnar Rtsch
11
CCC Berlin, December 27, 2004
Gunnar Rtsch
12
CCC Berlin, December 27, 2004
Gunnar Rtsch
13
CCC Berlin, December 27, 2004
Gunnar Rtsch
14
CCC Berlin, December 27, 2004
Gunnar Rtsch
15
CCC Berlin, December 27, 2004
Gunnar Rtsch
16
CCC Berlin, December 27, 2004
Gunnar Rtsch
17
CCC Berlin, December 27, 2004
Gunnar Rtsch
18
CCC Berlin, December 27, 2004
Gunnar Rtsch
19
CCC Berlin, December 27, 2004
Gunnar Rtsch
20
CCC Berlin, December 27, 2004
Gunnar Rtsch
21
CCC Berlin, December 27, 2004
Gunnar Rtsch
22
CCC Berlin, December 27, 2004
Boosting: decision
Gunnar Rtsch
23
CCC Berlin, December 27, 2004
AdaBoost Algorithm
Gunnar Rtsch
24
CCC Berlin, December 27, 2004
AdaBoost algorithm
Combination of
Decision stumps/trees
Neural networks
Heuristic rules
Further reading
http://www.boosting.org
http://www.mlss.cc
Gunnar Rtsch
25
CCC Berlin, December 27, 2004
property 2
Linear Separation
property 1
Gunnar Rtsch
26
CCC Berlin, December 27, 2004
Linear Separation
property 2
property 1
Gunnar Rtsch
27
CCC Berlin, December 27, 2004
property 2
property 2
property 1
m
ar
gi
n
property 1
28
CCC Berlin, December 27, 2004
(with
Use
)
for prediction
m
ar
gi
n
Solution:
Linear combination of examples
many s are zero
Support Vector Machines
Demo
Gunnar Rtsch
29
CCC Berlin, December 27, 2004
Kernel Trick
Linear in
input space
Gunnar Rtsch
Non-linear in Linear in
input space
feature space
30
CCC Berlin, December 27, 2004
Gunnar Rtsch
31
CCC Berlin, December 27, 2004
Gunnar Rtsch
32
CCC Berlin, December 27, 2004
Gunnar Rtsch
33
CCC Berlin, December 27, 2004
Gunnar Rtsch
34
CCC Berlin, December 27, 2004
Gunnar Rtsch
35
CCC Berlin, December 27, 2004
Gunnar Rtsch
36
CCC Berlin, December 27, 2004
Gunnar Rtsch
37
CCC Berlin, December 27, 2004
Many Applications
Gunnar Rtsch
38
CCC Berlin, December 27, 2004
MNIST Benchmark
39
CCC Berlin, December 27, 2004
Gunnar Rtsch
40
CCC Berlin, December 27, 2004
Face Detection
1.
Classifier
face
non-face
2. Search
2 ( l 1)
600
450
0
.
7
l =1
Gunnar Rtsch
525,820 patches
41
CCC Berlin, December 27, 2004
42
43
44
45
46
47
Neurophysics Group
Dept. of Neurology
Klinikum Benjamin
Franklin
Freie Universitt
Berlin, Germany
Gunnar Rtsch
Benjamin Blankertz
Klaus-Robert Mller
48
CCC Berlin, December 27, 2004
Gunnar Rtsch
49
CCC Berlin, December 27, 2004
50
CCC Berlin, December 27, 2004
51
CCC Berlin, December 27, 2004
Gunnar Rtsch
52
CCC Berlin, December 27, 2004
15
10
5
0
-5
-10
-15
LEFT
hand
(ch. C4)
15
10
5
0
-5
-10
-15
[V]
RIGHT
hand
(ch. C3)
15
10
5
0
-5
-10
-15
0 [ms]
15
10
5
0
-5
-10
-15
-500 -400 -300 -200 -100
Gunnar Rtsch
0 [ms]
0 [ms]
0 [ms]
53
BCI Setup
ACQUISITION modes:
- few single electrodes
- 32-128 channel electrode caps
- subdural macroelectrodes
- intracortical multi-single-units
EEG parameters:
- slow cortical potentials
- /_ amplitude modulations
- Bereitschafts-/motor-potential
TASK alternatives:
- feedback control
- imagined movements
- movement (preparation)
- mental state diversity
Gunnar Rtsch
54
CCC Berlin, December 27, 2004
Gunnar Rtsch
55
CCC Berlin, December 27, 2004
Gunnar Rtsch
56
CCC Berlin, December 27, 2004
GMD.SCAI
Gunnar Rtsch
Alexander Zien
Thomas Lengauer
GMD.FIRST
Institute for
Computer Architecture
and Software Technology
Gunnar Rtsch
Sebastian Mika
Bernhard Schlkopf
Klaus-Robert Mller
57
A (1,0,0,0,0)
C (0,1,0,0,0)
G (0,0,1,0,0)
T (0,0,0,1,0)
N (0,0,0,0,1)
(...,0,1,0,0,0,0,...)
58
CCC Berlin, December 27, 2004
Gunnar Rtsch
59
CCC Berlin, December 27, 2004
Learning
F
F
Inactives
N N
HO
Cl
HO
HO
N N
HO
N
N
H
O
O
N
S
O
Cl
OH
OH HN
OH
HO
OH
N
Cl
Machine
Actives
former CombiChem
technology
Gunnar Rtsch
60
CCC Berlin, December 27, 2004
few
Gunnar Rtsch
actives
more
inactives
plenty
untested
61
CCC Berlin, December 27, 2004
Shape/Feature Descriptor
bit =
Shape
Feature type
Feature location
bit number
254230
Shape/Feature Signature
Shape i
~105 bits
Shape j
62
CCC Berlin, December 27, 2004
Total number of
active examples
selected
after each batch
On Thrombin
dataset
Largest Selection Strategy
Gunnar Rtsch
63
CCC Berlin, December 27, 2004
Concluding Remarks
Computational Challenges
Algorithms can work with 100.000s of examples
(need
operations
Usually model parameters to be tuned
(cross-validation computationally expensive)
Need computer clusters and
Job scheduling systems (pbs, gridengine)
Often use MATLAB
(to be replaced by python: help!)
64
CCC Berlin, December 27, 2004
Gunnar Rtsch
65
CCC Berlin, December 27, 2004