Video dataset should contain a single person with moving face gestures so that the trained model does not depend on a single pose. I want to train facenet VGG face retrained models on this dataset using triplet loss for transfer learning .

Similar questions and discussions