Though I am quite aware of the paper 'Visualizing and Understanding Convolutional Networks', I am not sure of how we can visualize 3D conv filters and how to interpret them.
As explained in the original paper of C3D by FacebookAI, they perform deconvolution on the intermediate conv feature maps to project them in the image space to visualize what the model is learning. But still things are not clear to me in this regard.