If we have three different domain of data (e.g. security, AI and sport) and we did 3 different case study or experiments (1 for each domain) and we estimate the Precision, Recall and F-measure for each experiment. How we can estimate the overall Precision, Recall, F-measure for the model. Is use normal mean average is suitable or F1 or p-value? Which one is better?

More Fatima Nadeem Al-Aswadi's questions See All
Similar questions and discussions