Hello, I have 26 hours of audio data. This is huge data to manually label every 2s data frame as a scream or not. Is there any idea to do at least a portion of it automatically to save time?
You should calculate the decibel levels on a sliding window to cover the 2second interval that you are using. From there you can put a threshold on what is considered scream or not (Using a standard classification of noise. For example classify screams at night as noise falling in the 50 dB as per the WHO's NIGHT NOISE GUIDELINES FOR EUROPE or similar guideline).
You can write a program to iterate through the audio data every 2 sec. to search for a threshold to automate a label for a scream. You can use GPU hardware to speed up the search.