DDPG Algorithm

Hochgeladen von

worming

0% fanden dieses Dokument nützlich (0 Abstimmungen)

22 Ansichten12 Seiten

DDPG Algorithm

Copyright

Verfügbare Formate

PDF, TXT oder online auf Scribd lesen

Dieses Dokument teilen

Dokument teilen oder einbetten

Freigabeoptionen

Stufen Sie dieses Dokument als nützlich ein?

Sind diese Inhalte unangemessen?

Dieses Dokument melden

DDPG Algorithm

Copyright:

Verfügbare Formate

Als PDF, TXT herunterladen oder online auf Scribd lesen

Markieren Sie unangemessene Inhalte

0% fanden dieses Dokument nützlich (0 Abstimmungen)

22 Ansichten12 Seiten

DDPG Algorithm

Hochgeladen von

worming

DDPG Algorithm

Copyright:

Verfügbare Formate

Als PDF, TXT herunterladen oder online auf Scribd lesen

Markieren Sie unangemessene Inhalte

Zu Seite

Sie sind auf Seite 1von 12

Im Dokument suchen

Deep

Deterministic Policy
Gradient (DDPG)
Actor-Critic Approach
Actor-Critic Approach
• Actor function specifies action a given
the current state of the environment s
• Critic value function specifies a
signal (TD Error) to criticize the actions made
by the actor
Deterministic vs Stochastic Policy
• Stochastic :
• Deterministic :
• Computing stochastic gradient requires more samples, as it
integrates over both state and action space. Deterministic gradient
is preferable as it integrates over state space only.
• In DQN, action was selected as:

• Above eq. is not practical for continuous action space. Using

deterministic policy allows us to use:
Deterministic vs Stochastic Policy
• But, deterministic policy gradient might not explore the full state and
action space. To overcome this, we introduce a noise process :

• Recall that we used - greedy approach in DQN to ensure

exploration.
Deterministic Policy gradient
• We want to maximize the rewards (Q-values) received over the
sampled mini-batch. The gradient is given as:
J is the start
distribution

• Applying chain rule:

• Silver el at. (2014) proved that this is the policy gradient, i.e. we will
get the maximum expected reward as long as we update the model
parameters following the gradient formula above.
Use target networks to
do off-policy updates
Use target networks to
do off-policy updates

Action selected using Deterministic Actor

as explained before
Use target networks to
do off-policy updates

Action selected using Deterministic Actor

as explained before

Experience Replay
Use target networks to
do off-policy updates

Action selected using Deterministic Actor

as explained before

Experience Replay

Policy Gradient
Use target networks to
do off-policy updates

Action selected using Deterministic Actor

as explained before

Experience Replay

Policy Gradient

<< 1
Slow updates,
but highly stable

Das könnte Ihnen auch gefallen

The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Von Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Bewertung: 4 von 5 Sternen
4/5 (5794)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Von Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Bewertung: 4 von 5 Sternen
4/5 (1090)
Never Split the Difference: Negotiating As If Your Life Depended On It
Von Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Bewertung: 4.5 von 5 Sternen
4.5/5 (838)
Principles: Life and Work
Von Everand
Principles: Life and Work
Ray Dalio
Bewertung: 4 von 5 Sternen
4/5 (599)
The Glass Castle: A Memoir
Von Everand
The Glass Castle: A Memoir
Jeannette Walls
Bewertung: 4.5 von 5 Sternen
4.5/5 (1712)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Von Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Bewertung: 4 von 5 Sternen
4/5 (895)
Sing, Unburied, Sing: A Novel
Von Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Bewertung: 4 von 5 Sternen
4/5 (1103)
Grit: The Power of Passion and Perseverance
Von Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Bewertung: 4 von 5 Sternen
4/5 (588)
Shoe Dog: A Memoir by the Creator of Nike
Von Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Bewertung: 4.5 von 5 Sternen
4.5/5 (537)
The Perks of Being a Wallflower
Von Everand
The Perks of Being a Wallflower
Stephen Chbosky
Bewertung: 4.5 von 5 Sternen
4.5/5 (2101)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Von Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Bewertung: 4.5 von 5 Sternen
4.5/5 (344)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Von Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Bewertung: 4.5 von 5 Sternen
4.5/5 (474)
Bad Feminist: Essays
Von Everand
Bad Feminist: Essays
Roxane Gay
Bewertung: 4 von 5 Sternen
4/5 (1015)
The Outsider: A Novel
Von Everand
The Outsider: A Novel
Stephen King
Bewertung: 4 von 5 Sternen
4/5 (1839)
Her Body and Other Parties: Stories
Von Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Bewertung: 4 von 5 Sternen
4/5 (821)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Von Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Bewertung: 4.5 von 5 Sternen
4.5/5 (120)
The Emperor of All Maladies: A Biography of Cancer
Von Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Bewertung: 4.5 von 5 Sternen
4.5/5 (271)
Angela's Ashes: A Memoir
Von Everand
Angela's Ashes: A Memoir
Frank McCourt
Bewertung: 4.5 von 5 Sternen
4.5/5 (440)
The Little Book of Hygge: Danish Secrets to Happy Living
Von Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Bewertung: 3.5 von 5 Sternen
3.5/5 (399)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Von Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Bewertung: 3.5 von 5 Sternen
3.5/5 (2259)
Brooklyn: A Novel
Von Everand
Brooklyn: A Novel
Colm Tóibín
Bewertung: 3.5 von 5 Sternen
3.5/5 (1937)
A Man Called Ove: A Novel
Von Everand
A Man Called Ove: A Novel
Fredrik Backman
Bewertung: 4.5 von 5 Sternen
4.5/5 (4609)
A Tree Grows in Brooklyn
Von Everand
A Tree Grows in Brooklyn
Betty Smith
Bewertung: 4.5 von 5 Sternen
4.5/5 (1929)
The Art of Racing in the Rain: A Novel
Von Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Bewertung: 4 von 5 Sternen
4/5 (4200)
The Yellow House: A Memoir (2019 National Book Award Winner)
Von Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Bewertung: 4 von 5 Sternen
4/5 (98)
Steve Jobs
Von Everand
Steve Jobs
Walter Isaacson
Bewertung: 4.5 von 5 Sternen
4.5/5 (806)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Von Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Bewertung: 4.5 von 5 Sternen
4.5/5 (266)
The Woman in Cabin 10
Von Everand
The Woman in Cabin 10
Ruth Ware
Bewertung: 3.5 von 5 Sternen
3.5/5 (2322)
Yes Please
Von Everand
Yes Please
Amy Poehler
Bewertung: 4 von 5 Sternen
4/5 (1891)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Von Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Bewertung: 3.5 von 5 Sternen
3.5/5 (231)
Team of Rivals: The Political Genius of Abraham Lincoln
Von Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Bewertung: 4.5 von 5 Sternen
4.5/5 (234)
Fear: Trump in the White House
Von Everand
Fear: Trump in the White House
Bob Woodward
Bewertung: 3.5 von 5 Sternen
3.5/5 (738)
John Adams
Von Everand
John Adams
David McCullough
Bewertung: 4.5 von 5 Sternen
4.5/5 (2409)
Wolf Hall: A Novel
Von Everand
Wolf Hall: A Novel
Hilary Mantel
Bewertung: 4 von 5 Sternen
4/5 (3811)
On Fire: The (Burning) Case for a Green New Deal
Von Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Bewertung: 4 von 5 Sternen
4/5 (73)
The Light Between Oceans: A Novel
Von Everand
The Light Between Oceans: A Novel
M.L. Stedman
Bewertung: 4.5 von 5 Sternen
4.5/5 (789)
The Unwinding: An Inner History of the New America
Von Everand
The Unwinding: An Inner History of the New America
George Packer
Bewertung: 4 von 5 Sternen
4/5 (45)
Manhattan Beach: A Novel
Von Everand
Manhattan Beach: A Novel
Jennifer Egan
Bewertung: 3.5 von 5 Sternen
3.5/5 (792)
The Constant Gardener: A Novel
Von Everand
The Constant Gardener: A Novel
John le Carré
Bewertung: 3.5 von 5 Sternen
3.5/5 (104)
Rise of ISIS: A Threat We Can't Ignore
Von Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Bewertung: 3.5 von 5 Sternen
3.5/5 (137)
Little Women
Von Everand
Little Women
Louisa May Alcott
Bewertung: 4 von 5 Sternen
4/5 (104)
Mfile 280
Dokument4 Seiten
Mfile 280
worming
Noch keine Bewertungen
NCTA13-Examining The Future Access Network
Dokument37 Seiten
NCTA13-Examining The Future Access Network
worming
Noch keine Bewertungen
Tackling SSL Vulnerabilities For Secure Online Transactions: E-Guide
Dokument13 Seiten
Tackling SSL Vulnerabilities For Secure Online Transactions: E-Guide
worming
Noch keine Bewertungen
Hetnet2005 Tutorial
Dokument21 Seiten
Hetnet2005 Tutorial
worming
Noch keine Bewertungen
Cisco ASA 5500-X Series Next-Generation Firewalls: Product Overview
Dokument12 Seiten
Cisco ASA 5500-X Series Next-Generation Firewalls: Product Overview
worming
Noch keine Bewertungen
11 Goiceanu
Dokument13 Seiten
11 Goiceanu
worming
Noch keine Bewertungen
Hetnet2005 Tutorial
Dokument21 Seiten
Hetnet2005 Tutorial
worming
Noch keine Bewertungen
Calculating Fiber Loss
Dokument11 Seiten
Calculating Fiber Loss
reliableplacement
Noch keine Bewertungen
Lte Drive Test Concept
Dokument8 Seiten
Lte Drive Test Concept
DineshRama
Noch keine Bewertungen
Gigabit-Capable Passive Optical Networks (GPON) : General Characteristics
Dokument34 Seiten
Gigabit-Capable Passive Optical Networks (GPON) : General Characteristics
cetinalican
Noch keine Bewertungen
All Resources
Dokument1 Seite
All Resources
worming
Noch keine Bewertungen
Emf Measurements in The Bts Cellular Stations of Vodafone Albania
Dokument6 Seiten
Emf Measurements in The Bts Cellular Stations of Vodafone Albania
worming
Noch keine Bewertungen
GSM Methodedemesure E
Dokument6 Seiten
GSM Methodedemesure E
worming
Noch keine Bewertungen
Introduction To Cryptography
Dokument45 Seiten
Introduction To Cryptography
worming
Noch keine Bewertungen
Mfile 280
Dokument4 Seiten
Mfile 280
worming
Noch keine Bewertungen
Antenna Near Field
Dokument4 Seiten
Antenna Near Field
Abohicham Abh
Noch keine Bewertungen
Field Strength Vs Radiated Power
Dokument2 Seiten
Field Strength Vs Radiated Power
worming
Noch keine Bewertungen
Adaptive Optical Transmitter and Receiver For Optical Wireless Communication System With Multiscatttering Channel
Dokument8 Seiten
Adaptive Optical Transmitter and Receiver For Optical Wireless Communication System With Multiscatttering Channel
worming
Noch keine Bewertungen
Power Lasers Clean Tech Day 200907021
Dokument30 Seiten
Power Lasers Clean Tech Day 200907021
worming
Noch keine Bewertungen
Antenna Near Field
Dokument4 Seiten
Antenna Near Field
Abohicham Abh
Noch keine Bewertungen
Tech Radar Trends Infographics
Dokument22 Seiten
Tech Radar Trends Infographics
worming
Noch keine Bewertungen
11 Goiceanu
Dokument13 Seiten
11 Goiceanu
worming
Noch keine Bewertungen
From Dumb Pipes To Smart Services Whitepaper
Dokument16 Seiten
From Dumb Pipes To Smart Services Whitepaper
worming
Noch keine Bewertungen
Reference Guide To Fiber Optic Testing
Dokument172 Seiten
Reference Guide To Fiber Optic Testing
Paul Necsoiu
100% (2)
Itu CWDM / 70Km / 10 Gigabit Ethernet: Xpcxx070103D - Dual Fibre Xenpak CWDM Transceiver
Dokument5 Seiten
Itu CWDM / 70Km / 10 Gigabit Ethernet: Xpcxx070103D - Dual Fibre Xenpak CWDM Transceiver
worming
Noch keine Bewertungen
CCDE Demo
Dokument11 Seiten
CCDE Demo
Jean Marcial
Noch keine Bewertungen
IT System Administrator 1
Dokument1 Seite
IT System Administrator 1
worming
Noch keine Bewertungen