Hi all,
I am doing a multi-label classification using machine learning algorithms. In order to evaluate the performance of the multi-label classification, I am using label-based and sample-based performance evaluation metrics listed in Article A Tutorial on Multi-Label Learning
.When I compared both types of approaches to performance evaluation, sample-based metrics are lower in value than the label-based metrics. I mean precision and recall of sample-based very low than the precision and recall of label-based method.
Just wanted to understand that is correct behavior or I am doing something wrong.