How does Reinforcement Learning (RL) relate to Metaheuristic Search (MS) techniques like Genetic Algorithm, Particle Swarm Optimization, etc. and to Generative Adversarial Learning used in GANs? I feel that MS is similar to RL in the sense that it creates the next population based on feedback about the suitability of candidates from the present population based on function evaluations. Similarly, the generator and discriminator networks in GANs train based on feedback from the each other. What are your views? What are some other methods apart from MS and GANs that are similar to RL but not consider to be RL exactly?