We are using a dataset of student marks across different undergraduate and graduate years. We want to group the students into competency profiles using K-means. Two questions arise: how to set "K" and how to semantically name the clusters to understand what each cluster is about. Any suggestions? 

Similar questions and discussions