Can you explain it or recommend some papers?
(1) For classification problem, we can use recall rate, precision rate and F1 score, AUC, balanced accuracy(although I don't know what is mean/ max balanced accuracy) in same time to compare different classifiers.
but I haven't see any paper to evaluate different numerical prediction models (forgive my ignorance).
(2) I only know r-squared, MAE, MAPE and RMSE could be used to assess models, but how to combine them to evaluate in a certain case, should add any other metrics?
(3) Could you help explain max/mean balanced accuracy, mean/max recall, mean/max accuracy, mean/max AUC, mean/max F1 if you are free. Why use mean or max, aren't these values specific values? Why are they divided into mean and max?