Gavin Rens

4 Questions 4 Answers 0 Followers

Questions related from Gavin Rens

What research has been done on learning non-Markovian reward functions?

Recently, some work has been done planning and learning in Non-Markovian Decision Processes, that is, decision-making with temporally extended rewards. In these settings, a particular reward is...

07 April 2019 5,089 2 View

What is the definition of the POMDP state value function when the reward function is defined as R:S x A x S->Reals?

When the reward function is defined as R(a,s), the value function is defined as max_{a in A} (rho(a,b) + ...), where A is the set of actions, b is the current belief state and rho(a,b) is the...

06 May 2016 9,764 3 View

Can anyone recommend free software which is easy to use, for computing Minimum Cross-Entropy?

I'm working with discrete distributions, and i have only one constraint (over the atomic events) on the posterior distribution. In particular, i have a prior distribution P over four atomic events...

26 January 2016 6,961 2 View

Is there work which combines POMDP planning and the BDI architecture?

Do you think it is a good or bad idea to use a partially observable Markov decision process (POMDP) planner instead of a plan library in the belief-desire-intention (BDI) architecture? The...

19 March 2014 8,854 1 View