I am in the process of transcribing audio from YouTube videos as part of my PhD dissertation. Meanwhile I have faced several obstacles on the way among which are noise, double speakers, muffled speech, unidentified words, and more. I will be very grateful, if you be kind and suggest the best ways I can overcome those obstacles and continue my work.

Similar questions and discussions