I want to improve the performance of the agent. I think the critical point of improving the agent is to normalize the observation and action. However, I don't know how to normalize them when the range of them are infinite. If anyone knows how to solve the problem or any tricks to improve the agents, you are very welcome to release any idea! Thank you so much!