Hi all.

I need to evaluate my tracking system how accurate it works. So I need a face video dataset that contains multiple faces in each frame and also the labels contain bounding boxes to represent the coordinate of faces. The format of labels should be like this:

frame1, track_id1, face1 , [x_left_1, y_left_1, w1, h1], face2, track_id2, [x_left_2, y_left_2, w2, h2]

Briefly, I need a face video dataset for face tracking with label files that contain track_id and boxes of faces appearing in each video frame. I mean the dataset should support multiple faces in each frame.

More Saeedeh Bahrami's questions See All
Similar questions and discussions