I am running an MLP to estimate an output give 3 variables and 1600 observations. I resample the data into 10 bags and in each bag, I performed 10-fold cross validation. However, when I average the results of the 10 bags, the outcome is worse than without doing bootstrap aggregating. So I am curious whether bootstrap aggregating is an useful method for improving my predictor or not.