For a balanced dataset, we use a confusion matrix and the derived performance metrics; accuracy, precision, recall, F1-score. But what if the dataset is unbalanced, I think these metrics aren't enough, is it right, if yes share with me extra details please.