Hello Everyone,
I am recently working on the topic " Deep Reinforcement Learning in production scheduling".
Most recently I am working on the simulation of the environment, the state and action engineering process.
The uprising question for me here is: how are uncertainty and constraints taken into account in a RL project?
In stochastic optimization there is a probability distribution of several factors which are taken into account. Also the solution space is restricted with constraints.
How are these factors considered in model-free RL? Is the uncertainty included in the simulation of the environment? Are the state or action space restricted through constraints?
Best regards,
Christoph