I have a dataset and trying to apply soft clustering, preferably Multivariate Gaussian Mixture model, but I have following doubts :

1. Does Multivariate GMM assume the underlying data to follow multivariate normality ? I guess, even if the individual components are Gaussian, their mixture can still be non-gaussian and thus violating this condition. Is this so ?

2. If multivariate normality is indeed required, what are the other ways to attain probabilistic clustering. Will be really helpful, if someone could refer a python or r based implementation.

More Ashutosh Karna's questions See All
Similar questions and discussions