For example, when dividing a large sample into a training and a test set, how do I best choose the bins for an age stratification? Should bins reflect years, decades, or should they be based on quantiles of the age distribution? What is your experience with different choices?