There are lots of work going on related to the intersection of consistency regularisation and image segmentation. Can anyone explain to me why consistency regularisation, i.e. finding consistency between two different segmentation map helps the overall learning, and why is it better than a simple encoder-decoder architecture?