In any deep learning procedure in AI problems we effectively find the weights by some kind of a minimization process. This process can reach a local minimum and thus the weights can not be the best we could find. How is that avoided?

Similar questions and discussions