Matching Tracks in TensorFlow

Question

Matching Tracks in TensorFlow

According to Sutton's book - Strengthening Learning: An Introduction, the network weights update equation is defined:

where e _tis an acceptability trace. It looks like a Gradient Descent update with an extra e_t...
Can this matching trace be included tf.train.GradientDescentOptimizer

in TensorFlow?

+3

reinforcement-learning tensorflow gradient-descent

nikpod 06 June 17 at 3:59

source to share

1 answer

Allen lavoie · Accepted Answer · 2017-06-07T18:05:11+0000

Here's a simple use case tf.contrib.layers.scale_gradient

for simple gradient multiplication. In the forward pass, it is just the identifier op, and in the back pass, it multiplies the gradients by its second argument.

import tensorflow as tf

with tf.Graph().as_default():
  some_value = tf.constant([0.,0.,0.])
  scaled = tf.contrib.layers.scale_gradient(some_value, [0.1, 0.2, 0.3])
  (some_value_gradient,) = tf.gradients(tf.reduce_sum(scaled), some_value)
  with tf.Session():
    print(scaled.eval())
    print(some_value_gradient.eval())

Prints:

[ 0.  0.  0.]
[ 0.1         0.2         0.30000001]

Matching Tracks in TensorFlow

More articles: