Does anyone know how much data from the whole data set should be selected for model selection? Particularly in case of imbalanced data, how should we select a portion of data for model selection?

Similar questions and discussions