I'm new in reinforcement learning and I don't know the difference between value iteration and policy iteration methods!

I am also very confused about categories of methods in reinforcement learning. Some studies classified reinforcement learning methods in two groups: model-based and model-free. But, some other studies classified reinforcement learning methods as: value iteration and policy iteration.

I were wondering if anybody help me to know the relation between these classification, as well.

More Negin Malekian's questions See All
Similar questions and discussions