I am trying to train CIFAR10 dataset on CIFAR10 architecture on MATCONVNET without gpu but my objective function is not decreasing down to 1,665. What may be parameters to tune while considering theses graph?
These option where considered (Momentum=0.9; BbatchSize=256; NumEpochs=45; Continue=true; …) and the rest were default.