In the field of reinforcement learning based optimal control, why the authors choose the multiple polynomials with even orders as critic neural network basis functions for simulation?
For example: basis function is given as $\phi (x)$=[x1^2, x1x2, x2^2, x1^4, x1^2x2^2, x2^4 ]', x=[x1, x2]'