I am currently dealing with a large data set and most classes of it have an imbalanced data distribution.SVM and KNN algorithms going to be used for classification.I need to know what are the most suitable data preprocessing methods for large imbalanced data sets.