I want to develop a system based on the neural network that can accurately and fast recognize human actions in real-time, both from live webcam feeds and pre-recorded videos. My goal is to employ state-of-the-art techniques that can handle diverse actions and varying environmental conditions.
I would greatly appreciate any insights, recommendations, or research directions that experts could provide me with.
Thank you so much in advance.