You need to perform tests for generalisation abilities of the classifiers (e.g. cross-validation tests, leave-one-out, McNemar test, etc.). For example, the book: Introduction to Machine Learning (by Ethem Alpaydinm), provides a helpful introduction to topic (Chapter: Design and Analysis of Machine Learning Experiments).