View Source Rein.Agents.DDPG (rein v0.1.0)

Deep Deterministic Policy Gradient implementation.

This assumes that the Actor network will output {nil, num_actions} actions, and that the Critic network accepts the "actions" input with the same shape.

Actions are deemed to be in a continuous space of type :f32.