Generally, in a classification task, when the class distribution is unbalance, most classifiers are bias towards the majority class.

Similarly, for Regression, regressors (especially linear regression) usually work better when the target variable is close to a normal distribution (i.e. a bell curve).

My question is: Do the above two general statements stand when a decision tree or random forest based classifier/regressor is used? Is the distribution able to affect/bias the calculation of information gain?

Similar questions and discussions