Hello, I am looking for papers about the pros and cons of CNNs and RNNs, and the advantages of a hybrid CNN-RNN model over the two separate models (if indeed there is an advantage) in speech recognition tasks, or in event detection tasks. Can anyone suggest relevant studies?

Similar questions and discussions