1 Questions 1 Answers 0 Followers
Questions related from A.s. Gowri
assume there are 80 states each with a possibility of 2 actions. we have a predefined reward function. The problem is about research allocation.
31 October 2020 10,073 3 View