Gavin Rens

3 Questions 3 Answers 0 Followers

Questions related from Gavin Rens

What research has been done on learning non-Markovian reward functions?

Recently, some work has been done planning and learning in Non-Markovian Decision Processes, that is, decision-making with temporally extended rewards. In these settings, a particular reward is...

04 April 2019 8,722 2 View

What is the state-of-the-art in Online POMDP planning?

What online (approximate) POMDP planning algorithm is the most effective, in general these days?

08 August 2016 1,226 0 View

What is the definition of the POMDP state value function when the reward function is defined as R:S x A x S->Reals?

When the reward function is defined as R(a,s), the value function is defined as max_{a in A} (rho(a,b) + ...), where A is the set of actions, b is the current belief state and rho(a,b) is the...

05 May 2016 3,071 3 View