I am working on audio dataset. The dataset contains train and test audio files. The train audio file contains crowd environment where only human are murmmering and chattering. The test data contains these chattering voices of human alongwith gunshot, car passing, and accidents.
My question is which features are good for extracting information from this kind of training data who has no specific sound, just the chattering voices?
Regards