We use some concepts like overshooting of convergence of NN when high learning rate is used. How does it measure the upper bound of the learning rate?

More B.K. Tripathy's questions See All
Similar questions and discussions