Open in app
Jonathan Hui
17.2K Followers
About

Sign in

17.2K Followers
About
Open in app
Hi Jonathan,
1

praneeth sv

Jonathan Hui

Jonathan Hui

Dec 24, 2018·1 min read

The natural policy gradient is:

Image for post
Image for post

We use the conjugate gradient to compute the value for:

Image for post
Image for post

which is unrelated with trust region.

Written by

Jonathan Hui

Deep Learning

More from Jonathan Hui

Deep Learning

More From Medium

TensorFlow RNN models

Jonathan Hui

TensorFlow Eager Execution v.s. Graph (@tf.function)

Jonathan Hui

TensorFlow & Keras

Jonathan Hui

F is a second-order derivative.

Jonathan Hui

TensorFlow Custom training, Transfer learning & Custom layers

Jonathan Hui

TensorFlow Keras Preprocessing Layers & Dataset Performance

Jonathan Hui

TensorFlow Automatic Differentiation (AutoDiff)

Jonathan Hui

TensorFlow 2 Series

Jonathan Hui

About

Help

Legal

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store