20 October 2022 7 5K Report

In the field of reinforcement learning based optimal control, why the authors choose the multiple polynomials with even orders as critic neural network basis functions for simulation?

For example: basis function is given as $\phi (x)$=[x1^2, x1x2, x2^2, x1^4, x1^2x2^2, x2^4 ]', x=[x1, x2]'

More Wei Xin's questions See All
Similar questions and discussions