1 Questions 1 Answers 0 Followers
Questions related from Soumia Mehimeh
Dynamics in reinforcement learning, that are represented by the transition function in an MDP, are meant to modelize the probability of reaching (or deriving from) the desired state. From what I...
21 March 2022 1,142 3 View