I have a huge database made up of thousands of sentences that can be divided into two groups depending on the nature of the verb within. I have about 20 parameters that define in a binary way these sentences (i. e presence or absence of a subordinated conjunction) and I am looking for a method able to split this db in segments according to an estimate of the best predictors among the 20 parameters. I've already used chi2, what would you suggest me ?