Hello, I am looking for papers about the pros and cons of CNNs and RNNs, and the advantages of a hybrid CNN-RNN model over the two separate models (if indeed there is an advantage) in speech recognition tasks, or in event detection tasks. Can anyone suggest relevant studies?