Hello friends,
I encountered a problem regarding the evaluation of semantic segmentation. Qualitatively the ground truth and prediction are quite similar but dice shows a small number about 0.56, How's this possible? If anyone has encountered the same problem so far, I would greatly appreciate it if you can share your experience and recommendation with me.