05 Labels and Losses PDF

CMSC498L
Introduction to Deep Learning

Abhinav Shrivastava
Labels and Losses
Credit: blog
What about these?
source
What about images?
Input: 100x100x3 pixels
Output: Cat

Output: Dog
Credit: Krähenbühl, source: cat, dog

What about images?
Output: Cat

Output: Dog

How can we make machines learn this?
Output: Cat

Output: Dog

Input Data ! (()
! (#)
! (%)
! (&)
! (')
Credit: Krähenbühl, source: google image search

Input Data ! ()) Output Label ( ())
! (#) Cat ( (#)
! (%) Dog ( (%)
! (&) Sheep ( (&)
! (') Pony ( (')

Input Data ! ()) Model f Output Label ( ())
! (#) Cat ( (#)
! (%) Dog ( (%)

*(!, ,)
! (&) Sheep ( (&)
! (') Pony ( (')

! (#) Cat ( (#)
! (%) Dog ( (%)

*(!, ,)
! (&) Sheep ( (&)
! (') Pony ( (')

! (#) Cat ( (#)
! (%) Dog ( (%)

*(!, ,)
! (&) Sheep ( (&)
! (') Pony ( (')

! (#) Cat ( (#)
! (%) Dog ( (%)

*(!, ,)
! (&) Sheep ( (&)
! (') Pony ( (')

Model: Deep Neural Network
!(#, %)
ReLU
ReLU
ReLU
ReLU
ReLU
Input Layers of computation Output
Credit: Krähenbühl
!(#, %)
ReLU
ReLU
ReLU
ReLU
ReLU
!(#, %) Model Parameters or Weights
ReLU
ReLU
ReLU
ReLU
ReLU
Training Model Weights
! (#) Cat ( (#)
! (%) Dog ( (%)

*(!, ,)
! (&) Sheep ( (&)
! (') Pony ( (')

Data # (') Output )( (') Label ) (')
!(#, %) Dog Cat
!(#, %) Dog Cat

Loss
!(#, %) Dog Cat

Loss
Provide signal to
improve
Loss Functions
!(#, %) Dog Cat

Loss
• Zero for correct label

Provide signal to
improve
Loss Functions
!(#, %) Dog Cat

Loss
• Zero for correct label )( (') == ) (')
Provide signal to
improve
Loss Functions
!(#, %) Dog Cat

Loss
• >0 for incorrect label Provide signal to

improve
Loss Functions
!(#, %) Dog Cat

Loss
• >0 for incorrect label )( (') ! = ) (')

Provide signal to
improve
Loss Functions
!(#, %) Dog Cat

Loss

Provide signal to
• Monotonically increasing improve
Loss Functions
!(#, %) Dog Cat

Loss

Provide signal to
• Important for gradient-based optimization
Loss Functions
!(#, %) Dog Cat

Loss

Provide signal to
• Important for gradient-based optimization
Labels and Losses
Labels
Credit: source
Types of Labels
data
Credit: Krähenbühl, Source: Pac-Man fans

Types of Labels
data dangerous class finer-class points actions
good pac-man pac-man 0.0 -
good bonus dot bonus 1.0 + points
bad bad guy clyde -1.5 die
bad bad guy blinky -1.9 die
good bonus cherry 100.0 +points, all disabled
good bonus strawberry 105.0 +points, all disabled
good bonus apple 300.2 +points, inky disabled
bad bad guy inky -10.5 die
Types of Labels
continuous/discrete
continuous discrete
regression how many classes?

examples:
• weight
2 >2
• age
• location binary classification multi-class classification
• velocity
examples: examples:
• distance
• dog vs. cat • Semantic classes
• score
• danger • actions
• price
• alive • etc.
• bood vs. bad
• class vs. not-class
Types of Labels
continuous/discrete *structured, rewards, etc.
continuous discrete

examples:
• weight
2 >2
• age
• velocity
examples: examples:
• distance
• score
• price
• alive • etc.
• bood vs. bad
Types of Labels
*structured, rewards, etc.
Image Word
2D/3D Pose Estimation
Sentence Parse tree

Types of Labels
continuous/discrete
continuous discrete

examples:
• weight
2 >2
• age
• velocity
examples: examples:
• distance
• score
• price
• alive • etc.
• bood vs. bad
Types of Labels
continuous/discrete
continuous discrete

examples:
• weight
2 >2
• age
• velocity
examples: examples:
• distance
• score
• price
• alive • etc.
• bood vs. bad
Regression
data points
0.0
1.0
-1.5
!(#, %) -1.9
100.0
105.0
300.2
-10.5
Regression
Data # (') !(#, %) Output )( (') Label ) (')
• Continuous label ) (')

• Continuous network output )( (')
Regression Loss
Loss
• Loss
Regression Loss
Loss
• Loss
• L1 / MAE
• L2 / MSE / Least Squares
Regression Loss
Data " (#) !(", %) Output '* (#) Label ' (#)
Loss
• Continuous label ' (#)
• Continuous network output '* (#)
• Loss
• L1 / MAE ! " # ,% − ' #
# # +
• L2 / MSE / Least Squares ! " , % − ' +
Types of Labels
continuous/discrete
continuous discrete

examples:
• weight
2 >2
• age
• velocity
examples: examples:
• distance
• score
• price
• alive • etc.
• bood vs. bad
Types of Labels
continuous/discrete
continuous discrete

examples:
• weight
2 >2
• age
• velocity
examples: examples:
• distance
• score
• price
• alive • etc.
• bood vs. bad
Multi-class Classification
data multi-class
pac-man
dot bonus
clyde
!(#, %) blinky
cherry
strawberry
apple
inky
(#)
• Discrete label !
(#)
• Continuous network output !
%
pac-man
blinky
cherry
strawberry
… …
Multi-class Classification via Regression
(#)
(#)
%
0. pac-man
1. blinky
2. cherry
3. strawberry
(#)
(#)
%
0. pac-man
1. blinky
2. cherry
3. strawberry
(#)
(#)
%
0. pac-man
1. blinky
&(', )) 1. blinky
2. cherry
3. strawberry
(#)
✗
✗
(#)
✗
%
0. pac-man
1. blinky
&(', )) 1. blinky
2. cherry
3. strawberry
Multi-class Classification via Regression + 1-hot
(#)
(#)
%
[1 0 0 0] pac-man
[0 1 0 0] blinky
&(', ))
[0 0 1 0] cherry
[0 0 0 1] strawberry
(()
• Discrete label '
(()
• Continuous network output '
)
[1 0 0 0] pac-man
[0 1 0 0] blinky
!(#, %)
[0 0 1 0] cherry
✗
(()
• Discrete label '
✗
(()
• Continuous network output '
)
✗
[1 0 0 0] pac-man
[0 1 0 0] blinky
!(#, %)
[0 0 1 0] cherry
($)
• Discrete label "
• Regress to class probability & ' = " $
• Continuous network output "! ($)

• One continuous output per class &(' = blinky)
P(pac-man)
P(blinky)
P(cherry)
P(strawberry)
($)

• One continuous output per class &(' = blinky)
P(pac-man) = 0.5
P(blinky) = 0
/(0, 2)
P(cherry) = 0.5
P(strawberry) = 0
Multi-class Classification – Softmax
• Probability
• Positive
• Sums to 1 P(pac-man)
P(blinky)
!(#, %)
P(cherry)
P(strawberry)
• Probability
• Positive z
• Sums to 1 10.18 P(pac-man)
12.91 P(blinky)
!(#, %)
-12.38 P(cherry)
18.19 P(strawberry)
• Probability
• Positive z
• Sums to 1 10.18 P(pac-man)
12.91 P(blinky)
!(#, %) exp norm
-12.38 P(cherry)
18.19 P(strawberry)
• Probability exp '(
! " =
• Positive ∑* exp '*
• Sums to 1
• Probability exp '(
! " =
• Positive ∑* exp '*
• Sums to 1
Credit: Krähenbühl; refer to blog

Multi-class Classification – Softmax Loss
• Maximum likelihood
• Minimize negative log-probability (NLL)
• Cross entropy
• Softmax-loss − log %(')
• log is numerically stable

− log %(')

P(pac-man)
P(blinky)
!(#, %)
P(cherry)
P(strawberry)
P(pac-man)
P(blinky)
!(#, %)
P(cherry)
P(strawberry)
P(pac-man)
P(blinky)
!(#, %)
P(cherry)
P(strawberry)
P(pac-man)
P(blinky)
!(#, %)
P(cherry)
P(strawberry)
− log %(')

Types of Labels
continuous/discrete
continuous discrete

examples:
• weight
2 >2
• age
• velocity
examples: examples:
• distance
• score
• price
• alive • etc.
• bood vs. bad
Binary Classification
data multi-class
good
good
bad
!(#, %) bad
good
good
good
bad
Binary Classification
($)

P(good)
• Single output &(' = class)
P(bad) = 1 - P(good)
• Positive: class 1
• Negative: class 0
Binary Classification – Sigmoid
• Probability 1
! "=0 =
• Positive 1 + exp(+)
• Sums to 1
1
! "=1 =
1 + exp(−+)
Binary Classification – Loss
• Maximum likelihood
• Minimize negative log-probability (NLL)
• Cross entropy
• Softmax-loss − log %(')
• log is numerically stable
Sigmoid is a special case of Softmax
• How?
Types of Labels
continuous/discrete
continuous discrete

examples:
• weight
2 >2
• age
• velocity
examples: examples:
• distance
• score
• price
• alive • etc.
• bood vs. bad
Other types of losses
Classification (max-margin)
SVM Loss
Credit: Krähenbühl, image credit

Embeddings
Credit: Krähenbühl, image credit1, credit2

Embeddings
Embedding Learning
After training
Embedding Learning: Triplet Loss
! − <$ ! − <$
! − >$ ! − >$
! − <$
! − >$
Types of Labels
continuous/discrete
continuous discrete

examples:
• weight
2 >2
• age
• velocity
examples: examples:
• distance
• score
• price
• alive • etc.
• bood vs. bad

05 Labels and Losses PDF

Hochgeladen von

Dokumentinformationen

Originalbeschreibung:

Originaltitel

Copyright

Verfügbare Formate

Dieses Dokument teilen

Dokument teilen oder einbetten

Freigabeoptionen

Stufen Sie dieses Dokument als nützlich ein?

Sind diese Inhalte unangemessen?

Copyright:

Verfügbare Formate

05 Labels and Losses PDF

Hochgeladen von

Copyright:

Verfügbare Formate

CMSC498L

Introduction to Deep Learning

Input: 100x100x3 pixels

Credit: Krähenbühl, source: cat, dog

Input: 100x100x3 pixels

Credit: Krähenbühl, source: cat, dog

Input: 100x100x3 pixels

Credit: Krähenbühl, source: cat, dog

Credit: Krähenbühl, source: google image search

! (#) Cat ( (#)

! (%) Dog ( (%)

! (&) Sheep ( (&)

! (') Pony ( (')

Credit: Krähenbühl, source: google image search

! (#) Cat ( (#)

! (%) Dog ( (%)

! (') Pony ( (')

Credit: Krähenbühl, source: google image search

! (#) Cat ( (#)

! (%) Dog ( (%)

! (') Pony ( (')

Credit: Krähenbühl, source: google image search

! (#) Cat ( (#)

! (%) Dog ( (%)

! (') Pony ( (')

Credit: Krähenbühl, source: google image search

! (#) Cat ( (#)

! (%) Dog ( (%)

! (') Pony ( (')

Credit: Krähenbühl, source: google image search

!(#, %) Model Parameters or Weights

! (#) Cat ( (#)

! (%) Dog ( (%)

! (') Pony ( (')

Credit: Krähenbühl, source: google image search

Data # (') Output )( (') Label ) (')

!(#, %) Dog Cat

Data # (') Output )( (') Label ) (')

!(#, %) Dog Cat

Data # (') Output )( (') Label ) (')

!(#, %) Dog Cat

Data # (') Output )( (') Label ) (')

!(#, %) Dog Cat

• Zero for correct label

Data # (') Output )( (') Label ) (')

!(#, %) Dog Cat

• Zero for correct label )( (') == ) (')

Data # (') Output )( (') Label ) (')

!(#, %) Dog Cat

• Zero for correct label )( (') == ) (')

• >0 for incorrect label Provide signal to

Data # (') Output )( (') Label ) (')

!(#, %) Dog Cat

• Zero for correct label )( (') == ) (')

• >0 for incorrect label )( (') ! = ) (')

Data # (') Output )( (') Label ) (')

!(#, %) Dog Cat

• Zero for correct label )( (') == ) (')

• >0 for incorrect label )( (') ! = ) (')

Data # (') Output )( (') Label ) (')

!(#, %) Dog Cat

• Zero for correct label )( (') == ) (')