can any one simplified benefits of mini-batch gradient descent over stochastic gradient descent ?

Thanks

Similar questions and discussions