12 February 2022 9 2K Report

I want to improve the performance of the agent. I think the critical point of improving the agent is to normalize the observation and action. However, I don't know how to normalize them when the range of them are infinite. If anyone knows how to solve the problem or any tricks to improve the agents, you are very welcome to release any idea! Thank you so much!

More Xiaoyi Hu's questions See All
Similar questions and discussions