I have a dataset with a total of 103 cases. There are three distinct values for my dependent variable each of which having different size, i.e. 43, 42, and 18. I am using SPSS to classify my data using a Decision Tree classifier. My aim is to create spilling rules whereby I can predict the future events. However, I am not sure about the values I should input for minimum number of cases for each of the parent and child nodes. Decreasing the respective values lead to an increase in the number of terminal nodes and the depth level and of course the accuracy of classification, which is a desired aim for me. Anyhow, I am not sure as to whether there is a limit, kind of threshold that lowering the values below that makes the result meaningless. 

More Mahyar Masoudi's questions See All
Similar questions and discussions