I have 6 measures per case, with 10 cases and I want to run a non hierarchical cluster analysis such as k-means (preceded by an analysis to define the number of clusters k that minimizes within-group sum of squares) on the 60 values, but somewhat forcing cases to stick together, so that for a single case I don't get 3 values going to one cluster and 3 values to another...

In some cases I get a majority (4 or 5 out of 6) of values clustering together, but I don't know if it is even ethical to assume that this case belongs to cluster x because the majority of values from that case went to cluster x...

So in other words, I wish to cluster cases, while taking into account their inherent variation, represented by the 6 values.

Is there a way of doing this? Or a different method?

Similar questions and discussions