I am using reinforcement learning to control my system. The problem that I have is that two control signals are dependent to each other and their value should satisfy a constraint something like this :
u1-u2>b1
u1-u2< b2
How can I apply this constraint on the output of agent in the reinforcement learning?