For an MDP problem (S, A, r, p), I am looking for an example where

transition probabilities have the following property

p(s'|s,a) = p(s'|s) for all s, s', a

More Vikas Vikram Singh's questions See All
Similar questions and discussions