Mehdi Mounsif

1 Questions 1 Answers 0 Followers

Questions related from Mehdi Mounsif

When training a RL actor-critic agent, what are the key values to monitor ?

I'm training an agent to accomplish a reaching task. The agent controls a multi-joint robotic arm and has to reach for a target. So far, I've had some success with vanilla policy gradient but, to...

13 February 2018 7,467 0 View