My research involves speech/chatter discrimination. I have a data set of audio files comprising 2 classes (speech, chatter). I have 300 audio files belonging to the speech class and another 300 for the chatter class (600 audio files in total). I know that I need to perform a 10-fold cross validation to evaluate the classifier. I just wanted some confirmation about the process I am going to follow, which I will describe now.

1. Divide the dataset into 10 subsets (60 audio files per subset - 30  speech, 30 chatter)

3. Train on 9 of subsets  (540 files) and then test on the 1 remaining (60 files)... and repeat for all 10 subsets

Is this correct?

More Devan Govender's questions See All
Similar questions and discussions