How could we estimate optimal learning rate for machine learning model, if we realized from a gap occurs between validation and trainining graph. When epoch number increased, validation accuracy remained similar although model accuracy increased slightly (we concluded a local maximum exists)? We used a little learning rate, such as 0.000001, almost to skip the local maksimum. Could it be result of using number of layers in 3D-CNN model even we applied dropout. Do you have any idea or suggestion? All ideas are welcome and tanks to all.
Best regards,