I have a data set with 91 Obervsation with 700 features. I have reduced the dimension of data using PCA? then I split the data set into 70/30. After that applied kfold =10 cross-validations on training data set and accuracy over each fold is varied a lot.

this is accuracy over each fold?

fold1 = 0.71

fold2 = 0.83

fold3 = 0.50

fold4 0.29,

fold5 = 0.67 ,

fold6 = 0.33

fold7 = 0.14

fold8 = 1.00

fold9 = 0.33

fold10 = 0.14

is it correct to have such variation? if not then how can I solve that problem?

how can I improve my accuracy over each fold at least it should not vary much?

what will be the possible reason for that? why it vary a lot?

I am looking for an answer to these questions

Similar questions and discussions