What parameters or factors are affected if I checkpoint and retrain the model from the same point instead of training altogether. Is there some kind of tradeoff between the two methods?

e.g. In place of running 40 epochs straight, I run it 4 times with 10 epochs each.

The parameters I was focusing on were :

1. Accuracy ( model performance)

2. Time for training

any others I should consider?

More Ashay Singh's questions See All
Similar questions and discussions