I have data set which contains 14 categorical nominal data. However, the dataset is imbalanced some categories have more sample than others. Data were one-hot encoded and I tried SMOTE-NC. I'm not sure if this technique is relevant to categorical data only because is K-nearest based.

Could someone suggest me a technique or solution for categorical nominal dataset only?

Similar questions and discussions