As identification of object or segment the object from video is one the complex problem in computer vision. Now a day's deep learning based methods providing a good result in these type of task. As video is very diverse in nature is fundamental approach (without any type of learning) compete with the DL based recent model and provide better segmentation accuracy
What is your call? Please Provide your expert comment and some solution.