"Compared to its predecessors, the main advantage of CNN is that it automatically detects the important features without any human supervision. This is why CNN would be an ideal solution to computer vision and image classification problems."
In contrast to dense neural networks, CNNs offer a distinct advantage in automatically identifying crucial features without the need for human supervision. This characteristic makes CNNs particularly well-suited for addressing computer vision and image classification challenges, providing an optimal solution. Furthermore, CNNs entail a significantly reduced computational burden, as they necessitate the adjustment of a substantially smaller number of weights.
I think, one of the main or primary reasons Convolutional Neural Networks (CNNs) are preferred over Dense Neural Networks (DNNs) for image classification tasks is their inherent ability to efficiently capture spatial hierarchies and patterns within images. CNNs excel at identifying local patterns such as edges, textures, and shapes by using convolutional layers that focus on specific regions of the input image. This hierarchical feature detection, along with weight sharing and pooling layers for translation invariance, enables CNNs to extract and learn complex visual features more effectively than DNNs.
To put it in simple terms, one would say in DNNs, every neuron in one layer is connected to every neuron in the next layer, without considering the spatial relationships present in images. This difference is why CNNs are more effective for tasks like image recognition, they're built to understand local details and patterns within the image.