I have a low accuracy(50%) on my image classification when using supplied test data but high accuracy (84%) when using 10-CV on my 900 images (observation data). From my reading, this happen due to overfitting. FYI, I am using LIBSVM with RBF kernel. I have done several things to overcome overfitting which includes:
a) increase data set & features (before this i got only 400 images)
b) Normalised the data
c) use a different images in test sets that have the same features in training
However, it still gives a low accuracy. Therefore in order to find the best accuracy I need to do some grid search to find the suitable setting for gamma and cost that hopefully will increase my accuracy.
After doing some grid search in Weka, I still got poor result in my accuracy as shown below. Am I wrongly set the grid search parameter?
=== Classifier model (full training set) ===
weka.classifiers.meta.GridSearch:
Classifier: weka.classifiers.functions.LibSVM -S 0 -K 2 -D 3 -G 1.0E10 -R 0.0 -N 0.5 -M 40.0 -C 1.0E18 -E 0.001 -P 0.1 -model "C:\\Program Files\\Weka-3-8" -seed 1
X property: Cost
Y property: gamma
Evaluation: Correlation coefficient
Coordinates: [18.0, 10.0]
Values: 1.0E18 (X coordinate), 1.0E10 (Y coordinate)
LibSVM wrapper, original code by Yasser EL-Manzalawy (= WLSVM)
Time taken to build model: 229.71 seconds
=== Stratified cross-validation ===
=== Summary ===
Correctly Classified Instances 422 49.7642 %
Incorrectly Classified Instances 426 50.2358 %
Kappa statistic -0.0047
Mean absolute error 0.5024
Root mean squared error 0.7088
Relative absolute error 100.4705 %
Root relative squared error 141.7527 %
Total Number of Instances 848
=== Detailed Accuracy By Class ===
TP Rate FP Rate Precision Recall F-Measure MCC ROC Area PRC Area Class
0.198 0.203 0.494 0.198 0.283 -0.006 0.498 0.499 DISENGAGED
0.797 0.802 0.499 0.797 0.613 -0.006 0.498 0.499 ENGAGED
Weighted Avg. 0.498 0.502 0.496 0.498 0.448 -0.006 0.498 0.499
=== Confusion Matrix ===
a b