1 Questions 2 Answers 0 Followers
Questions related from Ali Amini Bagh
Hello everybody. The reward is necessary to tell the machine ( agent ) which state-action pairs are good, and which are bad. Please help me to understand the behavior of the discount factor or...
06 January 2020 7,915 1 View