Dear all colleagues,

I run small research about binary classification with Decision Tree. I have about 9000 data consist of about 68 features. I use sklearn for my library for this research since I use Python as programming language.

I use proportion for train and test with this sequence: 7:3,5:5 and 1:9. However, i found that each proportion has high accuracy for the test result (greater 90%) which lead me to suspicious about over-fitting. What is the best practice to investigate the indication of over-fitting of my model?

Similar questions and discussions