I've some question about clustering data set using SOM. What the pre requisite of the data?

I'v mixed data types (categorical, binary, continues).

I'transformed categorical to dummies then merged my data set into one. I did not apply min-max scaler as I've many dummies variables.

I preferred using pca. I applied pca to reduce features by feature importance. then I use these axes as input in my SOM (SOMPY).

Is this a good approach?

If my binary data has for example 95% of 1 and 5% of 0 or vice versa, is this a problem for clustering?

More Abdoulaye Diallo's questions See All
Similar questions and discussions