With decision trees its easy to overfit the data, especially in the case where there are lots of variables. However, if you want to grab a subset of variables that are more meaningful and explain most of the data, it would be nice if some metrics were available. 

Any suggestions, papers, software, comments,... will be much appreciated.

More Ron S Mahabir's questions See All
Similar questions and discussions