I mean, in your opinion what attributes (in this particular case terms or topics ) in users messages are more important to consider while building a model for clustering users into different types, each type indicating a set of similar people in some manner, in other words, how can I pick out only more important terms in a message as a feature if I wanted to represent a text message as a n-dimensional vector?
Thank you for your help