I am currently dealing with a large data set and most classes of it have an imbalanced data distribution.SVM and KNN  algorithms going to be used  for classification.I need to know what are the most suitable data preprocessing methods for large imbalanced data sets.

Similar questions and discussions