1 Questions 1 Answers 0 Followers
Questions related from Mehdi Mounsif
I'm training an agent to accomplish a reaching task. The agent controls a multi-joint robotic arm and has to reach for a target. So far, I've had some success with vanilla policy gradient but, to...
13 February 2018 7,386 0 View