I have large data (more 200K with more 30 feature). I try clusteirng model as K-means, Gaussian Mixture, Agglomerative hierarchical clustering. Overall, all give same result. I use under 25K datasets that still provide varied clusters but when more than 30K datasets only 99% of data is in one cluster. The number of cluster is 6, it the best value from Elbow method.

Similar questions and discussions