I want to know how multi-label classification work in detail. At the same time, how does the confusion matrix for multi-label classification is constructed? How does the performance measures such as accuracy, precision, recall, and f-measure is calculated for multi-label classification?