I was wondering if anyone knows or have published a technique that sucessfully combines shallow (HOG, SIFT, LBP) with deep (GoogLeNet) representation? I am interested both for images and video cases.

More Konstantinos Avgerinakis's questions See All
Similar questions and discussions