In varying the number of neurons to determine the optimum number of neurons for an ANN model I obtained the attached graph. Why does the MSE increase, decrease and then increase when varying the number of neurons to optimize an ANN?

Similar questions and discussions